LLM translation models are going great
The word for finger isn't even in this sentence

I think people really underestimate how fragile LLMs are for auto-translation. You can put complete garbage into it where none of the words are real words and still get out plausible-sounding "translations" just because the LLM sees it as "close" to a real sentence, and then translates what it thinks that "close" sentence is based, once again, on what seems "close".

The whole benchmarking approach really does not help with this since benchmarks rarely include testing for failures. You need to test that garbage-in is recognized as garbage, otherwise you get garbage-out too.

@endrift yuuup

the previous (2016+) google translate approach already handled grammar well, and
also didn't have this problem, at least not as much. If you feed it nonsense, it would usually just transliterate it or sometimes keep it as is

it's noticeably worse since they switched to an llm, now it will just happily output a wrong answer instead

"llms are good for translation" my ass
@alice @endrift i feel like llms are “good” at translation only because companies have thrown billions at those systems. if we spent as much on actual MT systems, we’d have better results, because they’re specialised systems that don’t try to do absolutely everything