Skip to content

feat(multimodal): Audio understanding #3602

Closed
@mudler

Description

@mudler

Is your feature request related to a problem? Please describe.
Projects like https://github.com/kyutai-labs/moshi and https://github.com/ictnlp/LLaMA-Omni allow to audio understanding, this is a capability that could be integrated in LocalAI as well

Describe the solution you'd like
A backend and a way for the API to understand audio

Describe alternatives you've considered

Additional context
This Issue is left open on purpose for discussing potential implementations and backends that should be integrated in LocalAI

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions