Traffic sources to my #SelfHosted #Gitea instance. You can clearly see where the real visits are and where the AI scrapers are. Last time I checked, they weren’t triggering any analytic events. They are definitely improving.
#aiscrapers #ai #llm #LLMs #aislop #homelab #selfhost #selfhosting
30% of web search traffic goes through #AI now.
The same folk who pontificate about lost web traffic will gleefully tell you they are blocking "#Aiscrapers"
#AIBots may lead to the end of the internet as we know it
In recent weeks, #OpenDemocracy’s website has been repeatedly brought down by an army of bots. We’re not the only ones
Matthew Linares
20 February 2026
Excerpt: "Slater explained that 'the traffic often arrives through anonymous residential IPs', referring to residential proxy networks that route internet traffic through intermediary servers using IP addresses assigned by internet service providers to real homeowners. This, he said, makes it 'hard to distinguish ‘normal users’ from automated collection'. [That's not right and needs to be changed!!!]
" 'We're being forced into permanent defence mode. #ResidentialProxyNetworks let #AIScrapers hide in plain sight, rotate identities, and extract data at scale. That shifts real costs onto projects that exist to serve people, not feed training pipelines."
Read more:
https://www.opendemocracy.net/en/ai-chatbots-scraper-bots-chatgpt-website-offline-change-internet/
#AISucks #AI #DataMining #Internet #Websites #TechNews #AI #ArtificialIntelligence #BigTech #TechBros
Happy to see some updates on AI.ROBOTS.TXT : « A list of AI agents and robots to block » 🤖 🚫
Webspace Invaders - Matthias Ott
(…) In their hunger for data to train their large language models, companies from all over the world are systematically harvesting every word I’ve ever published, feeding it into their language models to keep them fresh – and the side effect, the collateral damage, is that Kevin in Montreal now can’t read my articles because my hosting provider decided the solution was to block Canada and half the rest of the world.I can't help but getting really really angry about all this and what it does to the web I used to love.
#ai #aiScrapers #collateraldamage #exploitation #otemporaomores #Web

There’s something happening on the Web at the moment that almost feels like watching that old arcade game Space Invaders play out across our servers. Bots and scrapers marching in formation, attacking our servers wave after wave, systematically requesting page after page, relentlessly filling their data stores while we watch our access logs fill up.
Looks like a new player leads the #ipv4games leaderboard. Who else, but a #proxy provider, just followed by another one of it's kind. Not sure if this would be the best advertisement for a company you'd like to appear as "legit".