#llm/audio

Public notes from activescott tagged with #llm/audio

Wednesday, February 25, 2026

Wednesday, February 4, 2026

Voxtral Realtime - A 4 billion parameter model aimed at live transcription, achieving “state of the art” transcription with 480ms latency across 13 languages. It can be configurable down to sub-200ms latency.

Performance on the FLEURS benchmark shows that Voxtral Mini Transcribe V2 performs competitively against models from Gemini and OpenAI, with the lowest diarization error rate.

Thursday, January 22, 2026

Wednesday, January 7, 2026

“The body cam software and the AI report writing software picked up on the movie that was playing in the background, which happened to be ‘The Princess and the Frog,’” a Heber City sergeant told FOX 13 News. “That’s when we learned the importance of correcting these AI-generated reports.”

Sunday, January 4, 2026