Cloudflare announces AI Labyrinth, which uses AI-generated content to confuse and waste the resources of AI Crawlers and bots that ignore “no crawl” directives.

https://programming.dev/post/27302847

Cloudflare announces AI Labyrinth, which uses AI-generated content to confuse and waste the resources of AI Crawlers and bots that ignore “no crawl” directives. - programming.dev

Lemmy

That’s just BattleBots with a different name.
You’re not wrong.
Ok, I now need a screensaver that I can tie to a cloudflare instance that visualizes the generated “maze” and a bot’s attempts to get out.
You probably just should let an AI generate that.
They should program the actions and reactions of each system to actual battle bots and then televise the event for our entertainment.
Then get bored when it devolves into a wedge meta.
Somehow one of them still invents Tombstone.
Putting a chopped down lawnmower blade in front of a thing, and having it spin at harddrive speeds is honestly kinda terrifying…
No, it is far less environmentally friendly than rc bots made of metal, plastic, and electronics full of nasty little things like batteries blasting, sawing, burning and smashing one another to pieces.
Will this further fuck up the inaccurate nature of AI results? While I’m rooting against shitty AI usage, the general population is still trusting it and making results worse will, most likely, make people believe even more wrong stuff.
The article says it’s not poisoning the AI data, only providing valid facts. The scraper still gets content, just not the content it was aiming for.
and the data for the LLM is now salted with procedural garbage. it’s great!
@ladel @XeroxCool You could probably add a separate routine to poison the data.
Until the AI generating the content starts hallucinating.
if you’re dumb enough to trust a large language model because someone told you “iTs Ai!” no amount of facts will be of great utility to you.
That take would be more digest able if I wasn’t stuck on the same planet as those people.
im saying they want to be lied to. it would be disrespectful to offer them the truth.

Thank you for catching that. Even reading through again, I couldn’t find it while skimming. With the mention of X2 and RSS, I assumed that paragraph would just be more technical description outside my knowledge. Instead, what I did hone in on was

“No real human would go four links deep into a maze of AI-generated nonsense.”

Leading me to be pessimistic.

If you’re dumb enough and care little enough about the truth, I’m not really going to try coming at you with rationality and sense. I’m down to do an accelerationism here. fuck it. burn it down.

remember; these companies all run at a loss. if we can hold them off for a while, they’ll stop getting so much investment.

The problem I see with poisoning the data is the AI’s being trained for law enforcement hallucinating false facts used to arrest and convict people.

that’s the entire point of laws, though, and it was already being used for that.

giving the laws better law stuff will not improve them. the law is malevolent. you cannot fix it by offering to help.

Law enforcement AI is a terrible idea and it doesn’t matter whether you feed it “false facts” or not. There’s enough bias in law enforcement that the data is essentially always poisoned.
Law enforcement doesn’t convict anyone, that’s a judge’s job. If a LEO falsely arrests you, you can sue them, and it should be pretty open-and-shut if it’s due to AI hallucination. Enough of that and LEO will stop it.
More likely they will remove your ability to sue them if you are talking about the usa and many other countries

They aren’t poisoning the data with disinformation.

They’re poisoning it with accurate, but irrelevant information.

For example, if a bot is crawling sites relating to computer programming, or weather, this tool might lure the crawler into pages related to animal facts, or human biology.

@melpomenesclevage @XeroxCool “Move fast, break things,” can also be applied to bad things.
@XeroxCool @Tea If they don’t attract the worm they’ll never learn. That is, GOOD!

while allowing legitimate users and verified crawlers to browse normally.

What is a “verified crawler” though? What I worry about is, is it only big companies like Google that are allowed to have them now?

I assume a crawler which adheres to robots.txt
I would love to think so. But the word “verified” suggests more.
IP verification is a not uncommon method for commercial crawlers
Googlebot and Other Google Crawler Verification | Google Search Central  |  Documentation  |  Google for Developers

You can check if a web crawler really is Googlebot (or another Google user agent). Follow these steps to verify that Googlebot is the crawler.

Google for Developers
I dunno. I don’t find any sympathy with any of these fuckers though. this is not a generally useful technology, it is not something the average person ever needs to see, and honestly, just fuck em. Fuck anyone messing with open source to engorge the garbage dispenser.

Any accessibility service will also see the “hidden links”, and while a blind person with a screen reader will notice if they wonder off into generated pages, it will waste their time too.

Also, I don’t know about you, but I absolutely have a use for crawling X, Google maps, Reddit, YouTube, and getting information from there without interacting with the service myself.

yeah. it’s pretty fucked. hopefully it’s temporary.

so do we make everything inaccessible to everyone, or just inaccessible to disabled people? we don’t have a way to include them yet. we should work on it, but we are not the ones who fucked accessibility.

yeah. search engine web crawlers are a public service. they are responsible. but we are in a conflict. we must struggle tooth and nail against capital for every nice thing.

I’d assume they’re using aria tags to hide the links from screen readers, at least that’s what the article seems to imply.
Cloudflare isn’t the best at blocking things. As long as your crawler isn’t horribly misconfigured you shouldn’t have much issues.
this is some fucking stupid situation, we somewhat got a faster internet and these bots messing each other are hugging the bandwidth.
nothing can be improved while capitalism exists; all improvement will be seized and used to oppress.
Lost on Lemmy?
no, responding to a post about exactly that thing.

He is not wrong. Unless people start to take steps , the dependency of tech will be used to chain most of us. Granted, these chains will be the kindest and gentlest chains seen in a long time.

Social revolution lives on in decentralized services, like this; the true battles will be later though. This year is a mild warm up. I can’t imagine the challenges that await many

That’s not really relevant here. This is more of a “genie is out of the bottle and now we have to learn how to deal with it situation”. The idea and technology of bots and AI training already exists. There’s no socioeconomic system that is going to magically make that go away.
I don’t need it to not exist. I need it to stay the fuck out of everyone’s lives unless they work in a lab of some kind.
everyone remembers tomogatchi, they were like a digital houseplant.
cool, but where do you get them?
the used market

so you’re saying it’s a niche toy you can get if you really want one, but nobody’s pushing that shit on you, and if you never want to talk about one again in your life, you can probably do that?

I would like if large language models were in this position. I don’t think you understand the degree to which our productive capacity and infrastructure are committed to this technology. it’s a lot. basically all the cutting edge computer chips being made are specialist chips for processing large (whatever) models, including the engineering to back it. that means everything else is bumped down a generation or five.

then there’s the amount of electricity being put towards these things-we are in a climate disaster, we do not have green energy, and these things are drawing in the high GW/low TW of energy.

water is also a lot more precarious than a lot of people want to think about. these things are using lots and lots of good drinkable water to cool those specialist chips, then just being throws out, because I assume it’s cheaper than cooling the hot water back down.

You can still buy new ones. Take a nap.
somehow I get the impression all this defense of tomogatchis is not about tomogatchis.
I think the point you’re missing is that without the monetary incentive that arises under capitalism, there would be very little drive for anyone to build these wasteful AI systems. It’s difficult to imagine a group of people voluntarily amassing and then using the resources necessary for “AI” absent the desire to cash in on their investment. So you’re correct that an alternative economic system won’t “magically” make LLMs go away. I think it unlikely, however, that such wasteful nonsense would be used on any meaningful scale absent the perverse incentives of capitalism.

It’s difficult to imagine a group of people voluntarily amassing and then using the resources necessary for “AI” absent the desire to cash in on their investment.

No imagination necessary.

I mean Dmitry Pospelov was arguing for AI control in the Soviet Union clear back in the 70s.

The Soviet scientific programme on AI: if a machine cannot ‘think’, can it ‘control’? | BJHS Themes | Cambridge Core

The Soviet scientific programme on AI: if a machine cannot ‘think’, can it ‘control’? - Volume 8

Cambridge Core
Just another way the state capitalist soviet union was closer to capitalism than socialism.
How can authority not exist? That’s staggeringly broad
given what domains we’re hosted on; i think we’ve both had a version of this conversation about a thousand times, and both ended up where we ended up. do you want us to explain hypothetically-at-but-mostly-past each other again? I can do it while un-sober, if you like.
Not who you responded to but yeah I want to hear a drug fuelled rant I don’t even care what topic
Not the one you responded to but how about the tiredness-fuelled rant that I replied to the other person with.
then don’t ask for that version, dear. I did say ‘by request’.
This is the most boring drug fuelled rant I’ve heard in my life
you haven’t, but it would be. that’s why I offered to add the drugs.
I… Don’t think I’ve heard anyone claim authority shouldn’t exist.