You know what, you know what the best part of all this is, I read this
https://xaselgio.net/posts/26.poisoning-knowledge/
and I went wow, that sounds like a pain in the ass, good luck with that.
...
I HAVE A WEBSITE
I literally read that whole thing and did I take any preemptive measures, did I have a trap set up for the bots, did I even have a .htaccess that would at least block some of them, did I bollocks
You know when you see nature documentaries and the lion takes down one of the antelopes and all its mates just stand around watching it get eaten like four feet away, going wow, that looks like a pain in the ass, good luck with that
THAT'S ME, THAT IS.
Oh and you know how like eighty percent of y'all have your own websites, hey guys the lion's eating me maybe you might wanna do something about it because it'll probably have you for dessert
Antelopes watching their mate get et and going "Wow, look at that," that's you that is
See everyone going "Wow my websites keep going offline because of AI bots"
Me: oh no that's awful I'm so sorry that's happening to π βπΆyooooouuuuuu and not meeeee,π΅ππ
My own website goes offline because of AI bots: me doing surprised Pikachu face
Me, just a head sticking out of the lion's mouth: π¦ It happened to me WAY sooner than I thought it would!
You, eyes inches away watching me get et: π° Yeah, I see that! Wow!
π¦ Like it happened REALLY suddenly, I wish I'd prepared
π° Yeah, that would've been smart, huh!
π¦ Have you done anything to get ready for this thing that's definitely gonna happen to you?
π° I'll get right on that!
π¦ *muffled voice from π¦'s belly* Yeah, like, soon!
π° It's on the list!
π¦ Like, now maybe!
π° Definitely before the end of the month! Wow, that's rough buddy
π¦ yeah wow
π¦ buuurrrrp
We're all the main character in our own stories but have you considered that there are many genres of story
Mine is apparently A Cautionary Tale
I am a slacker and I tail my log look to see what ip addresses are hitting me and then look up what asn range that's from and feed it into fail2ban for a while.
One of my sites they like scraping is a moin site.
Moin has a MonthCalendar macro that where you can click to a different day, and then to a different day, and I've decided any address indexing the calendar page after 2100 or before 1950 should be blocked.
I just need to write a fail2ban rule for that.
@alex has gotten much more automated with his blocking scripts
@alienghic @ifixcoinops @alex the issue with block lists is itβs trivial to spawn more robots (maybe not forever with IPv4 addresses but anyway)
Instead of this βcaptchaβ bullshit where I have to tick boxes with more than 10% motorcyle only, what if we positively identified humans and had a cryptocurrency-like scheme for maintaining the database?
Call it HumanCoin. Itβs proof of stake. You can spend coins to browse websites, invite people to the system, or dob in someone for being a suspected robot. If a sufficient threshold is reached for a suspect ID, all their coins are burnt. The coins of whoever invited them are also burnt. A bonus is paid to the reporters (but not enough to incentivise abuse by bots).
Iβve thought about this for far too short a time for it to possibly work.
The heavy handed solution is to block by IP address ranges.
For instance 216.73.216.0/22 is anthropic.
They're very aggressive, this is the count of times they're in todays access.log
8436 216.73.216.62
They've got a couple of IP address they use so off with the whole ASN.
The problem is there's a bunch of free apps or vpns that get money by using residential IPs as proxies so you can also get vast numbers of requests spread all over a vietnam telco, like
14.176.135.196 is in the range 14.160.0.0/11 VNPT-VN
and was behaving suspiciously.
In the story of someone who runs things on the internet, I'm just hoping my genre is "god's own fool."
@ifixcoinops oh, I thought it was βWelcome to Jackassβ
(not giving a free character assessment, I mean in the sense of the viewer experiencing schadenfreude)
@ifixcoinops >.>
yeah idk i been reading this like 'yup you're right i should definitely get on that soon. i'll move it higher up the list.' :P