#llm/audio
Public notes from activescott tagged with #llm/audio
Monday, May 11, 2026
Wednesday, February 25, 2026
Free AI Voice Generator & Voice Agents Platform | ElevenLabs
Powering the best enterprises, creators, and developers. From ElevenAgents for customer experience, ElevenCreative for content creation, to the leading AI voice generator.
Wednesday, February 4, 2026
Voice emerges as AI’s next frontier as Deepgram raises $130M | The Deep View
One of Deepgram’s goals for the upcoming year is to pass the Audio Turing Test, which assesses how realistic and human-like AI-generated audio sounds.
Mistral surcharges voice AI with new models | The Deep View
Voxtral Realtime - A 4 billion parameter model aimed at live transcription, achieving “state of the art” transcription with 480ms latency across 13 languages. It can be configurable down to sub-200ms latency.
Performance on the FLEURS benchmark shows that Voxtral Mini Transcribe V2 performs competitively against models from Gemini and OpenAI, with the lowest diarization error rate.
Thursday, January 22, 2026
Simon Willison on text-to-speech
Wednesday, January 7, 2026
Build voice, video, and physical AI | LiveKit
Vapi - Build Advanced Voice AI Agents
AI police report turns Heber City officer into a frog
“The body cam software and the AI report writing software picked up on the movie that was playing in the background, which happened to be ‘The Princess and the Frog,’” a Heber City sergeant told FOX 13 News. “That’s when we learned the importance of correcting these AI-generated reports.”
Sunday, January 4, 2026
Z.ai - Inspiring AGI to Benefit Humanity
Chinese open source LLM company