The Internet Archive just hit one trillion archived web pages—while major news sites block it over AI scraping fears. The irony? We’re losing history to protect it. https://arstechnica.com/tech-policy/2025/11/the-internet-archive-survived-major-copyright-losses-whats-next/ #DigitalPreservation #InternetArchive #AIScraping #NyxIsAVirus
Internet Archive’s legal fights are over, but its founder mourns what was lost

We survived, but it wiped out the library," Internet Archive's founder says.

Ars Technica

Punto Informatico: Editori contro lo scraping AI: addio Wayback Machine?

Molti editori bloccano il bot di Internet Archive perché l'archivio digitale della Wayback Machine viene utilizzato per l'addestramento dei modelli AI.
The post Editori contro lo scraping AI: addio Wayback Machine? appeared first on Punto Informatico.

Publishers vs. AI Scraping: Goodbye Wayback Machine?

Many publishers are blocking the Internet Archive bot because the digital archive of the Wayback Machine is used for training AI models.

#AIScraping #theWaybackMachine

https://www.punto-informatico.it/editori-contro-scraping-ai-addio-wayback-machine/

Editori contro lo scraping AI: addio Wayback Machine?

Molti editori bloccano il bot di Internet Archive perché l'archivio digitale della Wayback Machine viene utilizzato per l'addestramento dei modelli AI.

Punto Informatico

@serigala_tropis ahh. AI scraping is also likely on flipboard.

So I go through the trouble of avoiding ai scraping on my sites, flipboard requires full rss feeds, and suddenly you have a writing honeypot for ai scrapers.

Sneaky.

#flipboard #writing #aiscraping

May have to put my rss feeds into flipboard.

Edit: no. Full rss feeds are a requirement. It's an ai scraper honeypot.

I will consider the admin overhead.

I would rather be writing.

https://techcrunch.com/2026/04/02/flipboard-surf-social-websites-help-publishers-and-creators-tap-into-the-open-social-web/

#flipboard #socialmedia #writing #aiscraping

Flipboard's new 'social websites' help publishers and creators tap into the open social web | TechCrunch

Flipboard's social websites consolidate profiles and posts from Bluesky, Mastodon, Threads, YouTube, podcasts, blogs, and RSS feeds into a single, shared destination.

TechCrunch

I heard back on this the other day. The #SJM aka #SJMN aka #mercurynews turned off their feeds due to #aiscraping , as they offered full articles in the feed.

Ugh. Still a -4 on productivity.

🚨 Publishers Strike Back: EU Demands “Pay Up” & UK Says “Let Us Opt Out” of AI Search! 🤖💸

The “wild west” of AI scraping just hit a massive roadblock. In a double-whammy update from Europe, lawmakers are finally drawing a line in the sand. If you own a website, create content, or work in SEO, the game is changing fast.

Here is the breakdown of the two massive stories shaking up the tech world this week.

https://www.nbloglinks.com/publishers-strike-back-eu-demands-pay-up-uk-says-let-us-opt-out-of-ai-search/

#AI #AIScraping #publishers #AIContent ##AIcontrol #UK #EU #technews #SEO

🚨 Publishers Strike Back: EU Demands “Pay Up” & UK Says “Let Us Opt Out” of AI Search! 🤖💸 – nbloglinks

The "wild west" of AI scraping just hit a massive roadblock. In a double-whammy update from Europe, lawmakers are finally drawing a line in the sand. If you own

nbloglinks

Today in "The end of the open internet“: the Internet Archive @internetarchive is offline

#OpenInternet #AIScraping #Enshittification

The absolute nonsense that is #AIscraping of #WordPress blogs, and pretty much all of it from one of the USA big tech entities. Bloggers tend to write to inform, entertain and interact with other bloggers, not to have their creativity sucked dry by greedy behemoths. 😠

No surprise here about #aiscraping. The question
Is if it's efficient and produces value that redeems the cost.

Yes, i laughed at the #typo in the title. Spellcheck alone should have caught it. 🤣

#ai

https://www.wired.com/story/ai-bots-are-now-a-signifigant-source-of-web-traffic/

AI Bots Are Now a Significant Source of Web Traffic

New data shows AI bots pushing deeper into the web, prompting publishers to roll out more aggressive defenses.

WIRED