I'm working on reviving my old podcast searching system using OpenAI's Whisper engine (https://github.com/openai/whisper).

The results so far are amazing. I can run the transcription right on my Mac at roughly 5X realtime, and the accuracy is super impressive. It even gets brand names and weird words right nearly every time.

For example, this segment from The Talk Show where @marcoarment and @gruber argue about how to pronounce databases was perfectly transcribed, down the even the mispronunciations. 🤯

GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision

Robust Speech Recognition via Large-Scale Weak Supervision - openai/whisper

GitHub

@_Davidsmith @marcoarment @gruber I built a transcript search python module and unofficial @atpfm search engine with Whisper (which an Nvidia GPU and base model did ATP back catalog in 3 min per episode):

https://marcoshuerta.com/dash/atp_search/

https://github.com/astrowonk/search_transcripts

Back-ended by FTS5 in sqlite.

ATP Transcript Search

Unofficial Full text search of Accidental Tech Podcast transcripts.

@marcoshuerta this is awesome! Well done!