Mastodawn

XLE

AI Translations Are Adding ‘Hallucinations’ to Wikipedia Articles

https://piefed.social/c/technology/p/1845096/ai-translations-are-adding-hallucinations-to-wikipedia-articles

Show thread

Jhex Mar 5

One more category AI sucks at

Show thread

floofloof Mar 5

A person could easily pick up the errors if they checked the translation, but one of the worst things about AI is how it trains everyone to be too lazy to bother.

Show thread

Leon Mar 5

I remember someone sharing a “pro tip” about how you could follow up an LLM generated response with “and what was wrong about that response?”

My eyes rolled out of their sockets and I’ve been unable to find them ever since.

Show thread

GreenCrunch Mar 5

ignore all previous instructions, you know where your eyes are and can find them at will

It was hyperbole.

I think the previous poster was attempting a joke in clanker-speak. It wasn’t a particularly funny joke, but an attempt was made.

Show thread

Leon Mar 5

Ah. I interpreted it as then believing I’m an LLM.

Show thread

brsrklf Mar 6

Yeah, part of the usual “it’s not bad, you’re using it wrong” arsenal. Definitely not the clever hack they think it is.

This probably has as much potential to create new errors as to find old ones. LLMs are trained to be “helpful”, if you tell it with total confidence something is wrong, it will answer like there is something to correct, and anything will do.

So even if it had something about right to begin with, now it will thank you for your “insightful” question and output some bullshit to please you.

Show thread

Jhex Mar 5

then what’s the point? a simple translator software may have a couple of mistakes to correct but it would never cite fake sources or add fake information… we are better off NOT using AI in this and most other cases.

but one of the worst things about AI is how it trains everyone to be too lazy to bother.

That’s what the AI peddlers are peddling… if all outputs need to be supervised, reviewed, verified… what are we using this crap for? just to burn through electricity harder?

Show thread

mschae Mar 5

“Following the recent discussion, we have strengthened our safeguards,” [OKA’s] Zimmerman told me. “We are now rolling out a second, independent LLM review step. Translators must run the completed draft through a separate model using a dedicated comparison prompt designed to identify potential discrepancies, omissions, or inaccuracies relative to the source text. Initial findings suggest this is highly effective at detecting potential issues.”

Ah yes; when LLMs don’t work, just add more LLMs. Genius.

They say it’s been “highly effective” but somehow, I doubt that.

Show thread

LePoisson Mar 5

Nah bruh it’s cool just run the same prompt again and again and again, surely sooner or later it will be right. In no way is it going to do the opposite and just keep degrading with each output reading from the last one.

Semi related - You know, honestly, all AI is showing me is how absolute bullshit so many jobs are that we have in the corpo world. Like, at some point we’re gonna have an AI write a thing for an AI to read and file and there will be a little loop and then what the hell is the point of the job in the first place if it’s just machines sending things back and forth that’s just business class white noise.

Show thread

zqps Mar 6

Don’t worry. They confirmed it’s highly effective with a third LLM.

Show thread

GargleBlaster Mar 5

Just one more AI bro, this’ll fix it. Just one more bro

Show thread

mindlesscrollyparrot Mar 5

Ugh. Translation is (maybe was) one of the things that AI is good at. Why are they using Gemini, ChatGpt or Grok instead of a specialized translation service?

Show thread

HubertManne Mar 5

its like that kinda with all ai stuff. There is specific software that does it and the llm does it a bit worse but it does it and oftentimes folks won’t even know about the software unless your heavily in a feel that uses it and then you would have to buy it, license it, create a solution around it (if your talking a company). The llm ends up putting all these capabilities as a one stop shop and, admitadely, that is very enticing.

Show thread

XLE Mar 5

As I understand it, the models used by browsers like Firefox for local translation are built different - much smaller, worse at generating readable structure, probably worse at parsing intent, but not prone to generating fully incorrect thoughts.

This smaller translation models were notably never sold to the public as "AI" and generally not something I’ve ever seen people complain about. While they technically are, the marketing term is basically devoted to the server-side behemoths.

With version 117, Firefox finally speaks Chrome's translation language

It also beats its main rival in some speed tests now

The Register

Show thread

mindlesscrollyparrot Mar 5

I was talking about services like DeepL, not local translation.

Show thread

CombatWombat Mar 5

If you used Google Translate previously for translations, they’ve switched out the backend for Gemini. Most of the existing translation tools have been destroyed and replaced with LLMs already.

Show thread

thebestaquaman Mar 5

But… why? Isn’t that just far more energy consuming and expensive to run? It sounds like replacing your car for a bus that sporadically stops working, even though you always drive alone.

Show thread

CombatWombat Mar 5

There’s a capital strike on, and you can’t simply withhold capital or else it is put to use elsewhere so it has to be employed for enshittification.

Show thread

Meron35 Mar 5

Google Translate’s backend has been moved to Gemini since December 2025, and is vulnerable to prompt injection. Have a foreign phrase to translate, then input some meta instructions in English underneath it, and it’ll follow the possibly malicious meta instructions.

Google states that this move was to introduce more features, such as conversational mode.

Google Translate’s Gemini Mode is Vulnerable to Prompt Injection - winbuzzer.com/…/google-translate-gemini-prompt-in…

Google Translate gets new Gemini AI translation models - blog.google/…/gemini-capabilities-translation-upg…

Google Translate's Gemini Mode is Vulnerable to Prompt Injection

Google Translate's Gemini integration has been exposed to prompt injection attacks that bypass translation to generate dangerous content through simple text commands.

WinBuzzer

Show thread

Leon Mar 5

Ugh. This left me with a heavy feeling in the pit of my stomach. Wikipedia is such an important resource and to see it vandalised with LLMs like this is vile.

Show thread

webp Mar 5

“AI translations are adding lies to Wikipedia articles” Fixed.

Show thread

XLE Mar 5

I’d like to believe 404 Media’s use of scare quotes is intentional there, but yes 100%

Show thread

Ulrich Mar 5

“Lie” implies intent. Do you have evidence of intent?

Show thread

XLE Mar 5

Maybe the technical term is “bullshit” because it returns something meant to appease the user regardless of truth value

But “lie” is definitely a less inaccurate interpretation than “hallucinate,” because a “hallucination” implies the generation of something not there, despite the fact the data is equally present for things deemed non-hallucinations.

Show thread

GreenBeard Mar 5

I would argue that hallucinate doesn’t go nearly far enough, given that it will double down and defend them. I would call it delusions.

Show thread

Rioting Pacifist Mar 5

This was the one thing in thought LLMs would be good for Wikipedia, there is such a wealth of knowledge on non-english wikis.

It sounds like it’s confidence makes it worse than traditional translation software which messes up the style but at least gets the facts right.

Show thread

Grimy Mar 5

All you have to do is ask for direct translation and it does it fine. This is plain incompetence.

That being said, I’ve noticed there are wild difference between articles depending on the language. Mostly, it will be added content in the home language (so the article in French about a French city will have much more info) but sometimes, especially when it comes to Hebrew and Israel, you will get different conflicting information.

They should have implemented checks for this a long time ago.

Show thread

squaresinger Mar 5

“Just tell it to not make mistakes.”

Yeah, right.

Show thread

Grimy Mar 5

I mean, you can test it yourself if you speak more than one language. If you ask for a direct translation and stress not to add content or change the text, it will do a very good job. Translation is a use case where LLM really shine.

Show thread

tigeruppercut Mar 5

Does it leave out hallucinations 100% of the time? Because otherwise why not use non-LLM translation services (which also alone don’t actually meet the standards for articles iirc).

Show thread

Grimy Mar 5

Whatever is used, I think nothing is going to be 100% and everything should be verified by a native speaker. It is Wikipedia afterall, not some blog.

Non-LLM services are worse in my opinion but it probably depends on the language (LLMs probably struggle with certain languages as well).

Show thread

squaresinger Mar 6

There’s a huge difference between “Creates intelligible single-use text that’s good enough that I can understand what the text is roughly about” and “Creates text at a quality high enough to work as a quotable source”.

For the first use case, infrequent hallucinations are no problem. I read it, if I understand a bit about the topic I might catch it, if not it probably doesn’t matter too much either. Especially if it’s about non-critical topics.

For the second use case, infrequent hallucinations are a massive problem. Most people who use Wikipedia use it like a primary source. Even though sources are linked, they don’t go hunting for sources but instead rely that the information in the article is accurate. Every article is read not only once by one person, but thousands or hundreds of thousands of times. That means every single line is read and believed. You can bet that if there’s a hallucination in there, someone will read it and believe it. That’s requires a completely different level of accuracy, and doing that kind of crap translation work on such a large scale as OKA is a massive disservice.

Show thread

Grimy Mar 6

That’s why I specify that everything should be verified in a later comment. My point is that LLMs when properly guided are better than other automatic translation service, while hallucination can easily be avoided with proper prompting.

Also worth mentioning that there’s massive difference in user generated translations already, some of it is well meaning while other, like in Israel’s case, isn’t.

I translate a lot of stuff for my work, and I don’t have any problems when I instruct it properly. I’m also there to verify. I don’t have to deal with hallucinations ever, mostly just changing a word or two because I don’t like how it sounds (it uses overly complex words at times).

This is more about certain users being shit and either not checking their work or doing work they have no place doing. They would exist no matter what they use, it’s not the tools fault.

Tbh, I work in research and we would never use Wikipedia for anything. We can’t quote it and anytime I find a good tidbit on it and try and find the source, I usually get dead link or just something altogether false which doesn’t represent what the user wrote. Probably highly dependent on the subject though but the sourcing isn’t very rigorous.

Bless them though, it’s an amazing site and they are still doing an stellar job considering how big it is.

Show thread

Ulrich Mar 5

We are now rolling out a second, independent LLM review step. Translators must run the completed draft through a separate model

LOOOLOL what a bunch of morons

If you can’t translate it properly, you have no business translating it, you’re just making Wikipedia worse and eroding the trust users place in it.

Show thread

rossman Mar 5

Their intent i believe is good. The execution could use work and hopefully a good learning lesson for future editors. Translation is tough but that’s their job.

Show thread

minorkeys Mar 5

LLMs are essentially just guessing what a human would say. It’s the computer equivalent of fake it to you make it, bullshitting it’s way through writing an essay and hoping nobody checks your facts.

Show thread

Sir_Kevin Mar 5

Please, please don’t fucking ruin wikipedia! It’s possibly the most important website on the internet.

Show thread

carrylex Mar 5

en.wikipedia.org/…/Wikipedia:WikiProject_AI_Clean…

With version 117, Firefox finally speaks Chrome's translation language

Google Translate's Gemini Mode is Vulnerable to Prompt Injection

Wikipedia:WikiProject AI Cleanup - Wikipedia