If you've been blocking AI from scraping your website, there's another one to add. This time it's Google.
I've updated my post on the subject.
https://neil-clarke.com/block-the-bots-that-feed-ai-models-by-scraping-your-website/
If you've been blocking AI from scraping your website, there's another one to add. This time it's Google.
I've updated my post on the subject.
https://neil-clarke.com/block-the-bots-that-feed-ai-models-by-scraping-your-website/
Great post, thanks for all the reserach. Maybe add FacebookBot to block Meta’s efforts?
“FacebookBot crawls public web pages to improve language models for our speech recognition technology.”
@clarkesworld Also I added a link from my somewhat popular #Django robots.txt post: