https://cohere.com/blog/transcribe #speechrecognition #AItools #buzzwordbingo #workplaceinnovation #transcriptiontechnology #HackerNews #ngated
https://winbuzzer.com/2026/03/27/cohere-open-source-transcribe-model-tops-asr-leaderboard-xcxwbn/
Cohere's Open-Source Transcribe Model Tops ASR Leaderboard
#AI #Cohere #CohereTranscribe #SpeechRecognition #AITranscription #OpenSourceAI #HuggingFace #MultimodalAI
AssemblyAI (@AssemblyAI)
์๋ฃ ์๋ด ์์ฑ์ธ์์์ ๋ฒ์ฉ ASR์ ํ๊ณ๋ฅผ ๋ณด์ํ๊ธฐ ์ํด, Universal-3 Pro ์์ ๋์ํ๋ โMedical Modeโ๋ฅผ ์๊ฐํ๋ค. ๋จ์ผ ํ๋ผ๋ฏธํฐ๋ก ํ์ฑํํ๋ฉฐ, ์๋ฃ ์ฉ์ด ์ธ์์ ์ต์ ํ๋ ๋ณด์ ๋จ๊ณ๋ก ํน์ ์ฝ๋ฌผ๋ช ๊ฐ์ ์ ๋ฌธ ์ฉ์ด ์ค์ธ์์ ์ค์ด๋ ๊ฒ์ด ํต์ฌ์ด๋ค.

General-purpose ASR: 95%+ accuracy on a clinical consult. Also general-purpose ASR: gets "hydrochlorothiazide" wrong every time. Introducing Medical Mode โ a correction pass on top of Universal-3 Pro optimized for medical entity recognition. Enable it with one parameter.
AssemblyAI (@AssemblyAI)
์์ ์ํฌํ๋ก์ฐ์ฉ Medical Mode๊ฐ ๊ณต๊ฐ๋์์ต๋๋ค. ์ผ๋ฐ์ ์ธ ์์ฑ์ธ์ ์ ํ๋๊ฐ ๋์๋ ์์์์๋ ์ฝ๋ฌผ๋ช ๊ฐ์ ํต์ฌ ํ ํฐ ์ค๋ฅ ๋๋ฌธ์ ์ค์ฌ์ฉ์ด ์ด๋ ต๋ค๋ ๋ฌธ์ ๋ฅผ ํด๊ฒฐํ๋ ค๋ ๊ธฐ๋ฅ์ ๋๋ค.
https://x.com/AssemblyAI/status/2036822463347302652
#medicalai #speechrecognition #clinicalworkflow #asr #healthcare

Medical Mode is now available for clinical workflows. We built Medical Mode because a transcript that's 95% accurate can still be unusable in a clinical setting. Errors in general-purpose ASR are often concentrated on exactly the tokens clinicians care about most: drug names,
Chrome extension adjusts video speed based on how fast the speaker is talking
https://github.com/ywong137/speech-speed
#HackerNews #ChromeExtension #VideoSpeed #SpeechRecognition #TechInnovation #OpenSource
Hands on with AI audio generation: GAI voice, music, and sound effects
This is the second post in a series exploring the multimodal possibilities of generative AI. This series will take a detailed, hype-free look at text, image, audio, video, and code generation and explore the creative potential as well as the ethical concerns of GAI. Although Generative AI isn't a new technology, it's definitely been having a hype moment since the release of ChatGPT in November 2022. Unfortunately, the focus has been squarely on the text-based chatbot at the exclusion of [โฆ]https://winbuzzer.com/2026/03/16/ibm-granite-4-1b-speech-tops-openasr-leaderboard-xcxwbn/
IBM Granite 4.0 1B Speech Tops OpenASR Leaderboard
#AI #AIModels #IBM #SpeechRecognition #OpenSourceAI #EnterpriseAI #EdgeComputing #AITranslation #OpenASRLeaderboard
Nico Martin (@nic_o_martin)
MistralAI์ Voxtral๊ณผ Transformers.js, WebGPU ์กฐํฉ์ผ๋ก ๋ธ๋ผ์ฐ์ ์์ ์ค์๊ฐ ์์ฑ ์ ์ฌ๊ฐ ๊ฐ๋ฅํด์ก๋ค๋ ๋ฐํ์ ๋๋ค. ๋ค์ํ ์ธ์ด๋ฅผ ์ง์ํ๋ฉฐ ๋ฌธ์ฅ ์ค๊ฐ์ ์ธ์ด๊ฐ ๋ฐ๋์ด๋ ์ธ์ํ๋ ๊ธฐ๋ฅ์ ๊ฐ์กฐํ์ฌ ์น ๊ธฐ๋ฐ ASR(์๋ ์์ฑ์ธ์)์ ์ ์ง์ฐยท๋ค๊ตญ์ด ์ ์ฉ ์ฌ๋ก๋ก ์๋ฏธ๊ฐ ํฝ๋๋ค.
https://x.com/nic_o_martin/status/2032087412462022663
#mistralai #voxtral #transformersjs #webgpu #speechrecognition