To stop AIs scraping your website for content, add this to your robots.txt file on your website.

Thanks to Neil Clarke for most of these.

User-agent: CCBot
Disallow: /

User-agent: ChatGPT-User
Disallow: /

User-agent: GPTBot
Disallow: /

User-agent: Google-Extended
Disallow: /

User-agent: Omgilibot
Disallow: /

User-Agent: FacebookBot
Disallow: /

It won't stop everything, but it does cover a lot of the major ones.
@patricksamphire any of these even read and parse the robots.txt at all?
@_Nec @patricksamphire well if they don’t and it ever comes out, you auto-win the resulting lawsuit and can retire on your own island 😂
@jkbecker @patricksamphire Is that regulated by FCC? Lawsuit on what base?
(always found "robots.txt" a hilarious joke, is it actually legally binding?)
@_Nec @jkbecker No, it's not legally binding. However, all the bots listed actually do obey it.