Amazon Transcribe カスタム言語モデルで「お食事券」と「汚職事件」を聞き分ける
https://qiita.com/mksamba/items/6b38ac0d49eb7d060fee?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items

#qiita #AWS #Transcribe #AmazonTranscribe

Amazon Transcribe カスタム言語モデルで「お食事券」と「汚職事件」を聞き分ける - Qiita

1. 背景 業務で少しTranscribeを使っている。 Transcribeの音声認識の精度を上げる方法として、「カスタムボキャブラリ」と「カスタム言語モデル」がある。 「カスタムボキャブラリ」は、標準のTranscribeでは認識されにくい専門用語などを、個別に事前...

Qiita

#RealtimeTranscription for #PyConIT 🎙️

Whisper hallucinated on real conference audio: a problem I skipped for time: everything else to build.

🔮 Spoiler: #AmazonTranscribe doesn't generate text, it decodes it.

In the article: the choices I made, and the stories you only find when plugging things in (Linux audio is a zoo) 😄

https://alessandra.bilardi.net/diary/articles/2026-04/realtime-transcription-choices-and-stories-for-pycon-it.en

#DiaryOfALazyDeveloper #aws #transcribe #docker #fastapi

Type with your voice on Linux using this Whisper-based app - OMG! Ubuntu

Your mouth can say things faster than your hands can type them, yet voice typing is rarely used as a primary input method on desktop (most of us think

OMG! Ubuntu

CreatorCaps detects your video format and suggests the best caption style. Pick your font, highlight color, and preset.

No subscriptions, one-time unlock.

Try it on iPhone & iPad: https://apple.co/4ruCxYI

#transcribe #captions #subtitles #iOS

Cohere Transcribe: state-of-the-art speech recognition

Unmatched accuracy and speed. Transcribe converts your business’ audio data into precise text for search, analytics, and automation.

Cohere
#AI notes I had it to #transcribe a #podcast I did. I never converted it into a book. That was the idea. But I'm working with #Claude and #Gemini and #GPT now directly. I don't need a specialized AI product just for note-taking and note chatting. I can do that with Gemini and NotebookLM just fine.
Hey les guitaristes, quand vous repiquez une mélodie sur une tablature, au-delà du côté mécanique case 11, 13 corde de sol, vous le concevez comment votre repiquage ? Vous pensez notes jouées ? Intervales ? Degrés ? Juste numéros de cases ?
J'essaie de réfléchir à comment mettre à profit le temps passé à retranscrire et y mettre du liant.
Merci !
#guitare #musique #retranscription #morceau #repiquage #transcribe

Công cụ Transcribe (tx) miễn phí, chạy cục bộ với Whisper, hỗ trợ nhận diện giọng nói theo thời gian thực, phân biệt người nói (diarization) và thời gian chính xác. Hỗ trợ file, mic, âm thanh hệ thống và tích hợp Ollama để tóm tắt nội dung (tùy chọn). Hoạt động ngoại tuyến, đa nền tảng: Windows, macOS, Linux. Giao diện đồ họa và CLI tiện lợi tự động hóa.

#Transcribe #Whisper #Ollama #SpeechToText #Diarization #AI #LocalAI #CôngCụ #TríTuệNhânTạo #ThuyếtTrình

https://www.reddit.com/r/LocalLLaM

Deepseek只靠选项猜测听力答案

音频靠的是IDM自动嗅探的,哪个文件大基本哪个就是听力文件
音频转文字靠replicate的gpt-4o-transcribe
既有听力材料又有题目默认全对

第1-4 1 ❌3 ✅
第5-8 1 ❌ 3 ✅
第9-11 1 ❌ 2 ✅
第12-15 1 ❌ 3 ✅
16-25的音频缺失嗅探不到了,只能靠选项猜答案

看看deepseek能帮我考多少分

https://mstdn.feddit.social/@admin/115774431514137937

#deepseek #听力 #idm #transcribe #ai

Seriously, people, come on, if you can't be bothered to either #transcribe your screen-snapshotted #image meme-post or put #alt-text on it, maybe you should reconsider whether it's actually important enough to post. I just went past a DOZEN screenshotted-post posts that I'd have boosted if they had alt text or a transcript.