To stop AIs scraping your website for content, add this to your robots.txt file on your website.

Thanks to Neil Clarke for most of these.

User-agent: CCBot
Disallow: /

User-agent: ChatGPT-User
Disallow: /

User-agent: GPTBot
Disallow: /

User-agent: Google-Extended
Disallow: /

User-agent: Omgilibot
Disallow: /

User-Agent: FacebookBot
Disallow: /

Add this one to the list:

User-agent: Amazonbot
Disallow: /

Thanks go to @fasterandworse

@patricksamphire @fasterandworse why would I want that? I want the models to be better and I believe I write excellent content in my blog...
@I @fasterandworse Cool. No one said you had to.
@patricksamphire @fasterandworse what I failed to see is why I would want to. My blog is public so it can be accessed and read, man or machine. What is my interest in blocking it? Even if I was not in favor of shared knowledge, open source and open data, I still don't see why I should want to block progress. It's not like I am losing money or reputation or any other measurable damage by leaving this door open.
@I @fasterandworse Cool. A lot of people are losing jobs to "AI" products trained on scraped data. I'm not in favour of that, so I'm choosing not to let AI train on my work. You can choose otherwise, entirely freely.