Hey @finnvoorhees
I just came across https://github.com/finnvoor/yap

I'm not very familiar with Apple's AI/LLM frameworks, but do you know whether this could be extended to support speaker recognition (aka diarization ) ?

I'm coming from the "podcasts needs a transcript" angle 😅

GitHub - finnvoor/yap: 🗣️ A CLI for on-device speech transcription using Speech.framework on macOS 26

🗣️ A CLI for on-device speech transcription using Speech.framework on macOS 26 - finnvoor/yap

GitHub
@Jan0707 Apple doesn’t have any diarization models at the moment, but I suspect they might add some eventually