You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Ultravox is a suite of open weight models that are designed for getting the time to first token as low as possible with audio input. Basically they trained a good and fast projector to project the whisper large v3 encoder into llama 4.1 LLMs, both in 8B and 70B size.
I think it would be a great fit for livekit's agents so it would be nice to add an example and demo for it!
The text was updated successfully, but these errors were encountered:
Hi,
Ultravox is a suite of open weight models that are designed for getting the time to first token as low as possible with audio input. Basically they trained a good and fast projector to project the whisper large v3 encoder into llama 4.1 LLMs, both in 8B and 70B size.
I think it would be a great fit for livekit's agents so it would be nice to add an example and demo for it!
The text was updated successfully, but these errors were encountered: