Machine translations are often brought up as a gotcha whenever I criticize LLMs. It's worth pointing out two things: Machine translations existed decades before LLMs, and yes, machine translations are useful. However: I would never in my life read a machine translated book. Understanding what a social media post is talking about in rough terms? Sure. Literature? Absolutely not. Hell, have you ever seen machine translated subtitles? It's absolute garbage.
I have the impression that primarily anglophone people don't read as much translated literature, because so much good literature already exists in their language, so this issue may not be as familiar within that demographic. As someone who did not grow up anglophone, I can tell you there is a world of difference between a good and a bad translation even when done by humans. Machine translations are not even on the scale.

@Gargron Back in my pretentious high schooler days I read Dante's Comedy and, don't know where from, the version I found was one that was like... fully translated as poetry, like in the Italian* original, going as far as trying to replicate the rhyme structure

No computer's ever going to pull off anything even remotely that mad

@VileLasagna @Gargron oh that’s probably John Ciardi’s translation

@maco @Gargron I don't quite think so, it wasn't in English either =P

But cool to know there are others like it. It made for a super interesting read!

@maco @VileLasagna @Gargron oh, I would have assumed the Dorothy L Sayers translation. Good lord, are there two rhyming translations into English?
@kludgekml @maco @VileLasagna @Gargron
I think are quite a few such translations ( even Clive James had a go ! ), but afaik none that attempt to replicate the same rhyming scheme Dante used - which may well be impossible in English.

@kludgekml @maco @VileLasagna @Gargron I've got one by the Rev Francis Carey, which (I think) attempts to follow the rhyming. It's been a while since I tried reading that one, though.

(I primarily bought the volume because it's got loads of full-page plates of Doré illustrations).

@VileLasagna @Gargron I've read a few translations (into English) of the Divine Comedy, and the one that attempted to replicate the terza rima was 19th century, and (to me) almost unreadably difficult to deal with. Victorian translators weren't all that good, by modern standards, in my (admittedly fairly limited) experience.
@Gargron I'm willing to guess that machine translation of prose may serve two uses: firstly, as an assist for human translators (by preparing a very rough first cut, which they then have to refine), and secondly, as an assist for human editors in figuring out which foreign-language-works to pay a human translator (with or without AI assistance) to work on (translation costs money: knowing where to spend it is important). But those are assistive roles, not human-replacing ones.

@cstross @Gargron I have a friend who worked for years as a translator (English to French) but in recent years he found that he was no longer being asked to translate but to "post-edit" machine translations. It was taking him just as long, paying him less, and destroying his soul.

He now works as a tour guide.

@ccferrie @cstross @Gargron

I have a friend who used to work as a translator. Just like your friend, she hated being given machine-translated texts to polish up.

Edit: typo.

@ccferrie

THIS!

Same (on a much smaller case, as translating is "just" my 2nd job on the side, and it's mostly "just" roleplaying games) happens to me as well. I'd never take on such an offer.

@cstross @Gargron

@ccferrie @cstross @Gargron

Your friend should have asked for more money, as that was clearly to clean up someone's mess.

As you do when asked to clean stupid code.

#consulting #pricing

@ccferrie @cstross @Gargron my wife translated French to Chinese. She now works on exports inspections from France to Africa.
@ccferrie @cstross @Gargron Yep. This is human translators' situation. At least in "major" language pairs where there is masses of data.

@cstross @Gargron
Machine translations are more of a hindrance than a help, for translators. If you don't know both languages well, having an automated dictionary lookup could possibly be useful - but if you're a translator, and especially a translator of fiction, having a nuanceless draft will only take more time to figure out. And it will be irritating time, because reading mistranslations is a pain. Editing one's own drafts is hard enough!

As to B: Editors rely on readers, reviews /...

@cstross @Gargron
and the sort of awards that give a book the sheen of "worth reading".

My cred: I've translated more books than I can carry. Both fiction and technical.
My current position is that use of AI in translation is malpractice.

@Shunra @cstross @Gargron
My go-to example is the Esperanto translation of Alice Through The Looking Glass published by Evertype, which has 5 different translations of "Jabberwocky", each of which is absolutely "correct" and each of which is totally different from each other. Even the names of the poem differ.
In each case one can see the decisions the translator made balancing meter, rhyme, meaning, implications & nuance of the text, based on what it meant to them; how can a computer do that?
@HighlandLawyer @Shunra @cstross @Gargron i’ve always found the wealth of translations of ‘the little prince’ to be fascinating, and the way folks can trace back certain choices, showing that x translation was actually based off of y translation instead of the original french, &c.
@cstross @Gargron The first causes far more trouble than it saves. Easier just to do the work yourself. The second is true. Source: dad, who translated most King, Grafton, and Ludlum to Finnish.

@cstross @Gargron My (very not translator) impression is that human translators who have worked from rough machine translations, say that it’s harder than just translating the text.

Also, today I was in a work info session, where the talks were translated by some MS PoS thing, from Finnish to English. The results were horrendous, if hilarious. It might get better but I don’t really know why. Good simultaneous interpretation is kind of a human-level problem, really. Context matters!

@cstross @Gargron

I feel pretty dumb telling this to the master, but translating a literary work is much more than changing one word for another. Even it you keep all the meaning, it gets weird and doesn't flow; each language has its own rhythm and cadence. A good translator frequently has to completely rewrite a paragraph to keep the sense, the emotions and the flow of the story. Even worse, he needs to make it faithful to the original, which having intermediate versions can make harder.

I'm not a professional translator, but I have tried to translate some public domain stories, and found that automatic translation is a hindrance. I had to rewrite nearly all, looking always to the original version. It was too easy to drift far from it and get the text and the author absolutely distorted.

It is a work of art and love, not something a machine can do at all. Not even a part of it.

@cstross @Gargron chiming in with "every translator I know hates this"

And also my brief attempt at using speech to text in making subtitles for something gave me stuff that was just off enough in timing and correctness it took me more time to fix than doing it entirely by hand.

@cstross

I sometimes translate text (bilingual German-English). And I have edited a machine-translated book, and let me tell you it does NOT save time.

I basically had to go back and rewrite almost every sentence. It was so bad the publisher actually put me on the cover as translator.

DeepL is good for short non-fiction stuff, and even that needs to be polished if you want to use it for anything serious.

Do not rely on anything AI. Never.

@Gargron

@cstross @Gargron

As someone that has done some translation work, a machine translation template has ended up being more of a hinderance to me than help, it makes my brain lock into a pattern of the choices made by it, and it ends up being a way more clunky and weird feeling text than something I translated from the start.

@sotolf @cstross @Gargron i guess people assume the same thing about code too :)
@Gargron Crunchyroll also experiments with machine translated subtitles for Japanese anime, and the community (anglophone or not) is not happy with it.
@Tacas @Gargron This! The community is furious
@geolaw @Tacas @Gargron their subtitles for the English dub are also terrible in a way that can only be machine generated without any oversight
Besides "misheard" words, missing punctuation and names changing their spelling it is also never clear who says what or when a different person starts talking
I hate it so much (and I'm at least able to notice when they are wrong since I'm not deaf and can understand the audio in most cases. It must be worse for others)
@Larymir @Tacas @Gargron Not to mention, the spoiler effect of announcing the real name of the mysterious figure cloaked in shadow
@Larymir Not specifying who says what is a normal thing to do, even when it's made for hard of hearing/deaf people. That's unfortunately not only a genAI feature. @geolaw @Tacas @Gargron

@Gargron - I think the most recent translated book series I read was Liu Cixin's Remembrance of Earth's Past trilogy,* translated into English by Ken Liu and Joel Martinsen. I don't know a word of Chinese, but I can't imagine machine translation doing these great works any justice.

* Later adapted into the Netflix series 3 Body Problem

@funcrunch @Gargron

* Later TERRIBLY BADLY adapted into the Netflix series 3 Body PROBLEM

If you like the books, watch the Chinese original. Unfortunately it's only the first book so far.

@Gargron As someone who translates into English, I appreciate you saying this.

It's true there's a lot of very good literature in English, but I'm pretty convinced that the majority of anglophones don't know what they're missing by not reading translated books.

@Gargron plus doesn't everyone do the test of translate then translate back? They are garbage in a way that even a machine could recognize as garbage.
@virgilpierce @Gargron
There's an old joke from the 1960s about machine translation of the saying "the spirit is willing, but the flesh is weak" from English to Russian and then back again.
The result was "the vodka is good but the meat is rotten."

I've heard that one in German with some equivalent of "but the steak is not quite done".

@gdinwiddie @virgilpierce @Gargron

@gdinwiddie I quoted this a number of times over the past few decades :) (I remembered it as "the spirit is strong", BTW) @virgilpierce @Gargron
@Szescstopni The sentence I referenced is a bible verse.

@gdinwiddie Yes, and there a few different translations into Polish :)

"duch wprawdzie pełen chęci, ale ciało — słabe."

"duch wprawdzie ochoczy, ale ciało - słabe."

I wrote ho I remembered it for the record. And the vodka was strong, not good. Translations are a fascinating rabbit hole.

@Szescstopni Maybe it was “the vodka was strong” in English, also. I was a child when I heard my father tell that joke.
@Gargron I've just bought an English translation of Mikhail Bulgakov's The Master and Margarita. I wondered which was the best translation, and found a website with multiple translations of the opening page. It was really interesting how different they were. The one by Diana Burgin's and Katherine Tiernan O'Connor was the clear winner for me.
@Gargron Many times when I land on an auto-translated site I have to change the language to english because I don't even understand what's supposed to mean.
@Gargron Native speaker: I think you’re right. Though I have seen warnings recently about “new translation” editions on Amazon that are just LLM trash.
@Gargron @mastodon.social I absolutely agree.
On the other hand, although I'm a native spanish speaker, I've read a couple of books in english.
I think that US pleople don't even consider reading in any language but english.
@cktodon I've seen a work of Terry Pratchett "translated" (by a human though) from British English to US English. To even have the idea this could be useful enrages me.

@Gargron I've read translations of Haruki Murakami's novels in English and my native Danish - and I've found the latter *far* better. I can't judge the fidelity to the originals because I don't speak Japanese, but at least my reading experience with the Danish translations were a lot better - and I've probably read at least ten times as much English in my life as Danish.

I learned a while ago that the Danish translator of most (possibly all) Murakami's books has lived in Japan, knows Murakami personally, and talks to him about her translation work. And, well, the level of care put into those translations really shows.

@Gargron thank you. When writing the German subtitles for No Other Land I had to basically do it all by hand because the context window was so narrow it got everything wrong that could be lost in translation.
@Gargron machine learning can utilize all written text to help understand language and make translations better?? So basically my plagiarism machine is justified. Got em.

@ErikUden @Gargron I have seen a screenshot of a redirect link translated as "Nimm mich da".

Context matters.

@ErikUden @Gargron I work for Swiss Broadcast Company. Our devs did a wonderfull job in this regard. I get autotranslated subtitles that are amazingly good. It ain't literature but very good. It's a two tier system that joins the captions, then translation and then reconstructing the captions. Translation is done by Claude. Langs are not that big of a challange (DE FR IT EN). Only Rumantsch is a challange. Claude 3.5(!) Is pretty darn good though. Claude 4+ not so much
@decurtins @ErikUden @Gargron One Apertus focus is being multilingual, it may do a better job with Rumantsch. https://www.swiss-ai.org/apertus
Apertus | Swiss AI

Swiss AI
@slowenough @ErikUden @Gargron not yet. But there is something in the works 😊
@decurtins @ErikUden @Gargron
The things is that only people who speak the translated language sufficiently well, can assess the validity of the translation. So in specialist's hands the machine translation has some value. Still a nogo for literature.
Edit: typos
@Gargron Machine translated UIs are even worse a crime. LLMs don't have the slightest idea of the context of some random button, and (looking at Microsoft's German UI translations recently) seem to choose the worst possible word to drop into that.
@galaxis @Gargron Or Google. Last week I stumbled upon an Google admin interface where the checkbox with the English label "Enforcement" was translated in French with the equivalent of "Activation". It was about 2FA, and those both words doesn't mean at all the same thing in that context!
@tdelmas @galaxis @Gargron Google Maps keeps asking if I want to “navegar a la página principal” — go to the homepage—after I drop a friend off. I understand that in some contexts, that’s how “go home” is translated, but…no.
@maco @galaxis @Gargron thanks for the laughs 😂
@tdelmas This doesn't mean it was translated with an LLM, though. Very often human translators, especially in tech, have to translate a single label with no context around it, usually they try to be generic enough that the label can be used in many places unchanged. But sometimes that exact label has a slightly different meaning n different contexts, like in your example.
@galaxis @Gargron To be fair, it happens with human translations too: I fixed some translations in open-source projects in which the translation interface only showed the text, but not the context, and the previous translator translated it wrongly. Example: Wikipedia had "Large (width)" (Largo) translated to "Huge" (Grande). If you check the edit history for this entry in Wikimedia, that's my name fixing this issue. But, sure, it's mostly common in machine translations, as I commented in some other toot in this thread.