TIL that Kenyan workers have been used so much to train AI systems, that standard writing by Kenyan people is often flagged as AI generated while it is not (which means that they can get discriminated for jobs / exams etc)
https://marcusolang.substack.com/p/im-kenyan-i-dont-write-like-chatgpt

Edit: many people raised below that the article is talking about texts written in very classically trained English detected as AI generated, which is the case for many Kenyans. It is documented that many Kenyan workers have been hired to train LLMs, but I made an assumption that it was the reason for this detection while it may not be. Sorry about that, thanks for the feedback (and feel free to continue the discussion here)

I'm Kenyan. I Don't Write Like ChatGPT. ChatGPT Writes Like Me.

I'm calm. I'm calm. I promise.

this man's mind

@tek
> "Kenyan workers have been used so much to train AI systems"

Where in the linked post does it say that?

@ki @tek it rather says thay tons of old colonial british english babble was ingested by the slop machine because there is no copyright on it anymore.
@f4grx @ki @tek since when do they care about copyright?
@elexia @f4grx @tek
they always did, as long as it is their copyright