The enshittification of AI has lead to the choice of AI used by VLC to be groaned at. I even saw a post cross my feed of someone looking for a replacement for VLC.

VLC is working on on-device realtime captioning. This has nothing to do with generating images or video using AI. This has nothing to do with LLMs.

(edit: There's claims VLC is using a local LLM. It will use whisper.cpp, and not be using OpenAI's models. I don't know which models they will be using. I cannot find any reference to VLC using a LLM.)

While it would be preferred to use human generated captions for better accuracy, this is not always possible. This means a lot of video media is inaccessible to those with hearing impairment.

What VLC is doing is something that will contribute to accessibility in a big way.

AI transcription is still not perfect. It has its problems. But this is one of those things that we should be hoping to advance.

I'm not looking to replace humans in creating captions. I think we're very far from ever being able to do this correctly without humans. But as I said, there's a ton of video content that simply do not have captions available, human generated or not.

So long as they're not trying to manipulate the transcription using GenAI means, this is the wrong one to demonize.

#AI #Transcription #VLC #HearingImpaired #Deaf #Accessibility

@bedast the worry I do have regarding this feature is it’s will provide an excuse to some (and that will grow over time) to stop investimg into producing quality captioning. Why spending money/ressources when there is an IA who will generate some [crappy, or just basic one, if not errornous] captions, automatically.

I beleive on the long run, thats will be an innevitable drop on the quality, in exchange of availability.

Damn if you do, damn if you don’t, as they say.

@xavsworld @bedast This right here is exactly the problem. We are already seeing this happening with image descriptions. Many people don't want to write descriptions, they don't care enough about accessibility. But they will be yelled at if they don't provide any. Therefore they use "AI" to generate them.

These "AI"-generated ones miss the point, or outright hallucinate about the contents of the image. They're often worse than no description at all.

And that's what happens to subtitles, too.

@scy
I find it quite interestening what Ai sees in my pics and what I do not see or did not intend. In the end with some correction alt txts work ok and sorry to say but it saves time and helps to get rid of tedious work. Since it’s not a creative process I don’t mind to hand it over to a dirty machine.
@petpet The thing is, most people don't make these corrections you're talking about. Most people don't even read the alt texts they've just generated before posting them.