Voxtral transcribes at the speed of sound. https://mistral.ai/news/voxtral-transcribe-2
#mistral #ai #europe #voxtral #audiotranscription
Speakr v0.8.0 ra mắt! 🎙️ Tính năng nhận dạng người nói (diarization) giờ chỉ cần API key OpenAI, không cần GPU hay WhisperX. Thêm REST API v1 cho tự động hoá (n8n, Zapier, Make) và UI cải thiện, theo dõi token, hỗ trợ cấu hình linh hoạt. Cập nhật Docker compose như thường. #Speakr #SelfHosted #AudioTranscription #AI #OpenAI #Docker #CôngCụ #ÂmThanh #TựChủ
https://www.reddit.com/r/selfhosted/comments/1q77qcm/speakr_v080_speaker_diarization_without_a_gpu/
Audio and Video transcription for free.
Last week, I needed a transcription of the latest episode of White Roof Radio. I for reals searched for some way to do it on my Mac. I came across OpenAI's Whisper. Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. They actually open sourced it and when you look hard enough, you find out how to use it. On Linux. Using terminal commands. Some searches proved unsuccessful, so I used my work […]Google released Gemini 3 Pro today. Here’s the announcement from Sundar Pichai, Demis Hassabis, and Koray Kavukcuoglu, their developer blog announcement from Logan Kilpatrick, the Gemini 3 Pro Model Card, …
Trying out Gemini 3 Pro with audio transcription and a new pelican benchmark
https://simonwillison.net/2025/Nov/18/gemini-3/
#HackerNews #Gemini3Pro #AudioTranscription #PelicanBenchmark #TechNews #Innovation
Google released Gemini 3 Pro today. Here’s the announcement from Sundar Pichai, Demis Hassabis, and Koray Kavukcuoglu, their developer blog announcement from Logan Kilpatrick, the Gemini 3 Pro Model Card, …
I'm trying to use an Elgato Stream Deck Pedal as a transcription pedal.
But which software – Mac OS, preferably #FLOSS – to use?
(Tried using Whisper to auto-transcribe interview recordings, and it takes about as much time to correct the transcript afterwards as transcribing manually with a good pedal…)
#audiotranscription #qualitativeresearch #transcription #elgato #interviewmethods
Progress on my little speech2text/transcription project:
1. You press some hotkeys.
2. You speak into your microphone.
3. You wait for approx. 10 secs. (depending on your hardware)
4. Text starts to magically appear on your screen!
It feels like True Magic™! 🪄 ✨
This is why I love software development! ❤️
#Speech2Text #AI #Whisper #Rust #RustLang #Audio #AudioTranscription
via @dotnet : Multimodal Voice Intelligence with .NET MAUI
https://ift.tt/Db32W1s
#MultimodalAI #VoiceIntelligence #DotNetMAUI #AIIntegration #MobileDevelopment #VoiceCommands #AudioTranscription #OpenAI #UserExperience #Accessibility #TechInnovation #MicrosoftB…