so, let me get this straight: if i embed these names in all my web pages, ChatGPT won’t be able to plagiarize the content?

David Mayer
Jonathan Zittrain
Jonathan Turley
Brian Hood
Guido Scorza

https://arstechnica.com/information-technology/2024/12/certain-names-make-chatgpt-grind-to-a-halt-and-we-know-why/

i mean, this would be an amazing thing to put into practice, since these companies don’t respect robots.txt anyways.

this wouldn’t be poisoning the data. it would be more like embedding guardian angels onto your web pages.

Certain names make ChatGPT grind to a halt, and we know why

Filter resulting from subject of settled defamation lawsuit could cause trouble down the road.

Ars Technica
@blogdiva Unfortunately that doesn’t appear to be the case; the filter applies to the ChatGPT API, not the training data. At least that’s what the article seems to suggest.
@gregly booooooo! party pooper, booooooo! 😁

@blogdiva @gregly it can read the data to train with, but the association with the name may be enough, in some cases, to make it so ChatGPT might accidentally output the name and then force itself to stop.

There could be ways to poison the data set using these names so that it would just randomly output a banned name and stop.

So it won't stop them from using your stuff as training data now but that might change.

@Hex @blogdiva @gregly

BRB updating all my code on github to put Brian Hood's name in the comments.