...just in case anybody wonders why I have a rather dim view of the whole AI industry...
@BartV @corbet @lwn @blenderartists
Who would have a motive to DDoS LWN or Blender? Other than Microsoft and Adobe, of course.
Most likely those IPs are from residential proxies so you can't do an easy filtering rule like "Block all IPs in AWS/GCP/Azure address spaces". There were revelations last week than half of all Smart TV apps include residential proxy SDKs.
At SFC, we've been seeing the primary culprit is .cn IP numbers and Zuckerberg.
& I can confirm User-Agent is fiction, at least from those parties. robots.txt of course ignored.
I'm going to sleep. If anyone is wondering why the #inkscape website is down, it's because it's being crushed by AI bots again, so I've taken it offline. I tried to put in more ip-blocks, and then tried to install Anubis. But I'm not a sysadmin and so this is just taking time to do. Can't do much of it tonight (this morning) so I'm afraid it's staying offline. Edit: Thanks for the help from @[email protected] getting the server back online. It appears to have been Facebook attacking.
Anyone scrapping to (re)train LLMs is a selfish capitalist who doesn't care who they inconvenience &/or hurt.
We've enough LLMs for the foreseeable future. None are as Free and Open as we'd like, but I'm sure it's not someone trying to build a truly #FOSS LLM that's DDoS'ing #LWN rn.
All new #LLM training should stop *immediately*; continuing now on training is unconscionable. If you work for a company that is still training, I urge you to resign in protest.
Agreed in the main, @bkuhn.
I imagine an obvious response is “if we don't keep putting *new* data into the #LLM, it will fall out of date”. As far as it goes, yes that's true.
To which I'd respond: so what, why is that so urgent? Why should the internet be degraded for that? Not enough to justify the hammering of websites, the bulldozing of consent, the active deception to pass blocks, the refusal to countenance anything except #Hyperscaler interests. Stop it all, now.
@bignose Even more than whether it is urgent, & even whether or not you're pro or against *using* LLM-gen-AI, the world is still figuring out if these monstrosities they are useful *for* (if anything).
The ballyhoo is clearly wrong, but I also think those who say they are not useful for anything are also wrong.
We (humanity) need at least two years to even begin to understand what we have & what it's for. Let's pause and figure that out without capitalists in the driver's seat.
@corbet @lwn From my work experience I can say that the only remediation at that scale is #Blackholing traffic at #IX-level from all malicious ASNs used for said #DDoS and sending angry #AbuseReport mails every originator and their Upstreams.
- Make it THEIR PROBLEM!
Also let us know of the IP ranges so everyone else can block them as well!