Two new speech-to-text models (similar to Whisper) from Mistral today - one of them is API-only, the other is a 8.9GB Apache-2.0 licensed open weights model for "realtime" transcription. They're both very good! https://simonwillison.net/2026/Feb/4/voxtral-2/
Voxtral transcribes at the speed of sound

Mistral just released Voxtral Transcribe 2 - a family of two new models, one open weights, for transcribing audio to text. This is the latest in their Whisper-like model family, …

Simon Willison’s Weblog
@simon
It's the first one you can run yourself that does diarization, isn't it? I've seen hacks to implement it that were painful to use before, but nothing truly integrated.