#speech-to-text + #mistral

Mistral surcharges voice AI with new models | The Deep View

www.thedeepview.com/articles/mistral-surcharges-voice-ai-with-new-models

Voxtral Realtime - A 4 billion parameter model aimed at live transcription, achieving “state of the art” transcription with 480ms latency across 13 languages. It can be configurable down to sub-200ms latency.

Performance on the FLEURS benchmark shows that Voxtral Mini Transcribe V2 performs competitively against models from Gemini and OpenAI, with the lowest diarization error rate.

#3:33 PM

speech-to-text llm/audio mistral

Wednesday, February 4, 2026

Mistral surcharges voice AI with new models | The Deep View