The absolute nonsense that is #AIscraping of #WordPress blogs, and pretty much all of it from one of the USA big tech entities. Bloggers tend to write to inform, entertain and interact with other bloggers, not to have their creativity sucked dry by greedy behemoths. 😠

No surprise here about #aiscraping. The question
Is if it's efficient and produces value that redeems the cost.

Yes, i laughed at the #typo in the title. Spellcheck alone should have caught it. 🤣

#ai

https://www.wired.com/story/ai-bots-are-now-a-signifigant-source-of-web-traffic/

AI Bots Are Now a Significant Source of Web Traffic

New data shows AI bots pushing deeper into the web, prompting publishers to roll out more aggressive defenses.

WIRED

There is an ugly truth in this. I block ai scraping on my sites, but I am not blind to the fact that ai scraping can still happen.

It's not an industry that pays more than lip service to social responsibility.

They don't even need the data. That is how misguided it is.

#ai #aiscraping

https://www.axios.com/2026/02/02/iab-ai-accountability-publishers-act

Ad lobby seeks law to protect publishers from AI scraping

"Free riding isn't just unfair. It's stealing," IAB CEO David Cohen said.

Axios

Zero surprised that Google is scanning Gmail emails for AI training. 😕

#google #degoogle #gmail #ai #aiscraping

https://www.classaction.org/news/thele-v.-google-llc

Thele v. Google, Llc.

A class action lawsuit alleges Google is tracking consumers' communications without consent after it 'secretly turned on' Gemini AI for all users.

Lawsuit: Reddit caught Perplexity “red-handed” stealing data from Google results https://arstechni.ca/cJ4u #Perplexity.ai #googlesearch #webscraping #aiscraping #Perplexity #Policy #google #reddit #AI
Lawsuit: Reddit caught Perplexity “red-handed” stealing data from Google results

Scraper accused of stealing Reddit content “shocked” by lawsuit.

Ars Technica

Although the bland the "A.I." generated voice is detestable in the extreme the overall concept is mildly amusing if unoriginal, plus, there's a special treat for #DoctorWho fans in trying to fathom where the contents of the Laser Tracking Room were illicitly scraped from...

https://www.youtube.com/watch?v=sZkB11pO9R8

#cats #Caturday #AIart #TARDIS #copyright #AIscraping

Pay-per-output? AI firms blindsided by beefed up robots.txt instructions. https://arstechni.ca/tpDy #reallysimplylicensing #rslstandard #aicrawlers #aiscraping #AItraining #Policy #google #openai #meta #RSS #xAI #AI
Pay-per-output? AI firms blindsided by beefed up robots.txt instructions.

“Really Simple Licensing” makes it easier for creators to get paid for AI scraping.

Ars Technica
Reddit blocks Internet Archive to end sneaky AI scraping

The Internet Archive confirmed it’s in ongoing discussions with Reddit after block.

Ars Technica

(⁠⌐⁠■⁠-⁠■⁠) Perplexity is using stealth, undeclared crawlers to evade website no-crawl directives

https://blog.cloudflare.com/perplexity-is-using-stealth-undeclared-crawlers-to-evade-website-no-crawl-directives

#perplexity #aisearch #aiscraping

Perplexity is using stealth, undeclared crawlers to evade website no-crawl directives

Perplexity is repeatedly modifying their user agent and changing IPs and ASNs to hide their crawling activity, in direct conflict with explicit no-crawl preferences expressed by websites.

The Cloudflare Blog