Great new audio transcription model by Mistral.
Voxtral transcribes at the speed of sound. https://mistral.ai/news/voxtral-transcribe-2
#mistral #ai #europe #voxtral #audiotranscription
Voxtral transcribes at the speed of sound. | Mistral AI

Precision diarization, real-time transcription, and a new audio playground.

Speakr v0.8.0 ra mắt! 🎙️ Tính năng nhận dạng người nói (diarization) giờ chỉ cần API key OpenAI, không cần GPU hay WhisperX. Thêm REST API v1 cho tự động hoá (n8n, Zapier, Make) và UI cải thiện, theo dõi token, hỗ trợ cấu hình linh hoạt. Cập nhật Docker compose như thường. #Speakr #SelfHosted #AudioTranscription #AI #OpenAI #Docker #CôngCụ #ÂmThanh #TựChủ

https://www.reddit.com/r/selfhosted/comments/1q77qcm/speakr_v080_speaker_diarization_without_a_gpu/

Audio and Video transcription for free.

Last week, I needed a transcription of the latest episode of White Roof Radio. I for reals searched for some way to do it on my Mac. I came across OpenAI's Whisper. Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. They actually open sourced it and when you look hard enough, you find out how to use it. On Linux. Using terminal commands. Some searches proved unsuccessful, so I used my work […]

https://donburnside.com/because-this-should-be-easier/

Audio and Video transcription for free. – donburnside.com

🚀 Behold, the groundbreaking Gemini 3 Pro, now with audio transcription and... pelican benchmarks? 🦤 Because when you're at the cutting edge of AI, obviously the first thing you need to measure is how well it handles waterfowl. Thank you, Simon, for this essential update, proving once again that tech innovation is truly for the birds. 😂
https://simonwillison.net/2025/Nov/18/gemini-3/ #Gemini3Pro #audioTranscription #AIinnovation #pelicanBenchmarks #techForTheBirds #HackerNews #ngated
Trying out Gemini 3 Pro with audio transcription and a new pelican benchmark

Google released Gemini 3 Pro today. Here’s the announcement from Sundar Pichai, Demis Hassabis, and Koray Kavukcuoglu, their developer blog announcement from Logan Kilpatrick, the Gemini 3 Pro Model Card, …

Simon Willison’s Weblog
Trying out Gemini 3 Pro with audio transcription and a new pelican benchmark

Google released Gemini 3 Pro today. Here’s the announcement from Sundar Pichai, Demis Hassabis, and Koray Kavukcuoglu, their developer blog announcement from Logan Kilpatrick, the Gemini 3 Pro Model Card, …

Simon Willison’s Weblog

I'm trying to use an Elgato Stream Deck Pedal as a transcription pedal.

But which software – Mac OS, preferably #FLOSS – to use?

(Tried using Whisper to auto-transcribe interview recordings, and it takes about as much time to correct the transcript afterwards as transcribing manually with a good pedal…)

#audiotranscription #qualitativeresearch #transcription #elgato #interviewmethods

Progress on my little speech2text/transcription project:

1. You press some hotkeys.
2. You speak into your microphone.
3. You wait for approx. 10 secs. (depending on your hardware)
4. Text starts to magically appear on your screen!

 

It feels like True Magic™! 🪄 ✨

This is why I love software development! ❤️

#Speech2Text #AI #Whisper #Rust #RustLang #Audio #AudioTranscription

Multimodal Voice Intelligence with .NET MAUI - .NET Blog

Learn how to enhance your .NET MAUI apps with multimodal AI capabilities, enabling users to interact through voice using plugins and Microsoft.Extensions.AI.

.NET Blog
Parakeet-TDT-0.6b-V2 - a Hugging Face Space by nvidia

Upload or record an audio file, and get a detailed transcription with timestamps for each segment. The app handles long audio files efficiently.

Parakeet-TDT-0.6b-V2 - a Hugging Face Space by nvidia https://huggingface.co/spaces/nvidia/parakeet-tdt-0.6b-v2 (it’s good) #AI #AudioTranscription #OpenSource
Parakeet-TDT-0.6b-V2 - a Hugging Face Space by nvidia

Upload or record an audio file, and get a detailed transcription with timestamps for each segment. The app handles long audio files efficiently.