Speaking of which, hot new robots.txt entry just dropped:
User-agent: GPTBot
Disallow: /
Speaking of which, hot new robots.txt entry just dropped:
User-agent: GPTBot
Disallow: /
@olivierlacan I just do this:
echo "Blocking IP addresses..."
echo "[OpenAI egress ranges]"
iptables -A INPUT -s 23.102.140.112/28 -j DROP
iptables -A INPUT -s 23.98.142.176/28 -j DROP
How I can add that to mastodon?
@olivierlacan
So:
User-agent: GPTBot
Disallow: /
User-agent: CCBot
Disallow: /
User-agent: ChatGPT-User
Disallow: /
any other I'm missing? What about the browser.ai and axiom.ai crapola? Are they good netizens that can be kept at bay via robots?
It would be nice to intercept them and send them to a script generating a page with some Mb of "you're full of shit".
@olivierlacan There's at least 3 people here with a similar idea... a script to feed it an infinite number of generated, cross-linked nonsense pages, each linking to more of the same, wasting resources and poisoning the dataset.
They don't even need to be anything advanced. Any basic markov library spitting out vaguely (but unhelpfully) semi-coherent nonsense would do. It just needs to look just enough like language while being garbage in html tags.
Please! Create or boost!

@olivierlacan it seems weird to post this with a link to OpenAI that requires login.
No thank you.