“ChatGPT Has Already Polluted the Internet So Badly That It's Hobbling Future AI Development”

https://futurism.com/chatgpt-polluted-ruined-ai-development

Also, a recurring security concern (and bandwidth abuse issue) with training data sets is that they generally don't store copies of the data and have few safeguards checking if it's been changed

So, y'know, if a pre-2020 domain expires and is replaced entirely with slop, many datasets will mark the slop as pre-2020 data

ChatGPT Has Already Polluted the Internet So Badly That It's Hobbling Future AI Development

There may be no undoing the vast amounts of pollution wreaked by ChatGPT. And that's just tough luck for any AI models that come after it.

Futurism

@baldur

AI is like a parachute. In the hands of experts with deep analytical skills (aka "scientists") it can be a very helpful tool.

But as a labor-saving device for the masses that can also make a few people lots of money, it's gonna cause the kind of destruction that all bad advice mindlessly followed will tend to do.

Also, scientists check their work.

J.Q. Public generally doesn't, as countless anecdotes on social media will attest to.

Parachutes for the masses? Nuh-uh.