I'm working on reviving my old podcast searching system using OpenAI's Whisper engine (https://github.com/openai/whisper).

The results so far are amazing. I can run the transcription right on my Mac at roughly 5X realtime, and the accuracy is super impressive. It even gets brand names and weird words right nearly every time.

For example, this segment from The Talk Show where @marcoarment and @gruber argue about how to pronounce databases was perfectly transcribed, down the even the mispronunciations. 🤯

GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision

Robust Speech Recognition via Large-Scale Weak Supervision - openai/whisper

GitHub
@_Davidsmith @marcoarment @gruber is this a public service or project? I've been wanting to run whisper on the entire corpus of ATP episodes just to be able to reference things easier, but I haven't really sat down to do it.

@particles
While nowhere near as precise as the Whisper demo that @_Davidsmith just showed, https://catatp.fm also posts automated transcripts of ATP. I used it just the other day to check out some of the RSS feeds @marcoarment mentioned in episode 417 when they weren’t in the shownotes.

Thank you for sharing the link to your transcript catalogue, David. It’s a treasure trove.

catatp.fm · Unofficial Accidental Tech Podcast transcripts (generated by computer, so expect errors).

@jon sadly, whisper is just too good to pass up imho. It looks like catatp's author might be willing to consider it because they consider the "word error rate" of other transcription services in their about page. Whisper is just...awesome tbh.
@particles Agreed. Given my experience with ChatGPT in its first few hours of public existence (prior to its capabilities being reined in), I’m convinced it was trained on the endless abyss of YouTube content & other video, in addition to the vast sea of open podcast feeds, using Whisper.