Mastodawn

endrift 🏳️‍⚧️May 26

LLM translation models are going great

Show thread

endrift 🏳️‍⚧️May 26

The word for finger isn't even in this sentence

Show thread

endrift 🏳️‍⚧️May 26

I think people really underestimate how fragile LLMs are for auto-translation. You can put complete garbage into it where none of the words are real words and still get out plausible-sounding "translations" just because the LLM sees it as "close" to a real sentence, and then translates what it thinks that "close" sentence is based, once again, on what seems "close".

The whole benchmarking approach really does not help with this since benchmarks rarely include testing for failures. You need to test that garbage-in is recognized as garbage, otherwise you get garbage-out too.

Show thread

Alice

@endrift yuuup

the previous (2016+) google translate approach already handled grammar well, and also didn't have this problem, at least not as much. If you feed it nonsense, it would usually just transliterate it or sometimes keep it as is

it's noticeably worse since they switched to an llm, now it will just happily output a wrong answer instead

"llms are good for translation" my ass

Show thread

yuki - queen of the snow May 27

@alice @endrift i feel like llms are “good” at translation only because companies have thrown billions at those systems. if we spent as much on actual MT systems, we’d have better results, because they’re specialised systems that don’t try to do absolutely everything