Google-Agent joins the crawler list as AI browsing gets an official identity: Google on March 20 added Google-Agent to its user-triggered fetchers list, formalizing a new user agent for AI systems like Project Mariner that navigate the web on behalf of users. https://ppc.land/google-agent-joins-the-crawler-list-as-ai-browsing-gets-an-official-identity/ #GoogleAgent #AIBrowsing #WebCrawlers #ArtificialIntelligence #ProjectMariner
Google-Agent joins the crawler list as AI browsing gets an official identity

Google on March 20 added Google-Agent to its user-triggered fetchers list, formalizing a new user agent for AI systems like Project Mariner that navigate the web on behalf of users.

PPC Land
FYI: Anthropic clarifies what its three web crawlers do - and how to block them: Anthropic today updated its crawler documentation, detailing ClaudeBot, Claude-User, and Claude-SearchBot - what each collects and what blocking them means for site visibility. https://ppc.land/anthropic-clarifies-what-its-three-web-crawlers-do-and-how-to-block-them/ #WebCrawlers #SEO #DataPrivacy #ClaudeBot #SiteVisibility
Anthropic clarifies what its three web crawlers do - and how to block them

Anthropic today updated its crawler documentation, detailing ClaudeBot, Claude-User, and Claude-SearchBot - what each collects and what blocking them means for site visibility.

PPC Land
ICYMI: Anthropic clarifies what its three web crawlers do - and how to block them: Anthropic today updated its crawler documentation, detailing ClaudeBot, Claude-User, and Claude-SearchBot - what each collects and what blocking them means for site visibility. https://ppc.land/anthropic-clarifies-what-its-three-web-crawlers-do-and-how-to-block-them/ #Anthropic #WebCrawlers #SEO #ClaudeBot #DigitalMarketing
Anthropic clarifies what its three web crawlers do - and how to block them

Anthropic today updated its crawler documentation, detailing ClaudeBot, Claude-User, and Claude-SearchBot - what each collects and what blocking them means for site visibility.

PPC Land
Anthropic clarifies what its three web crawlers do - and how to block them: Anthropic today updated its crawler documentation, detailing ClaudeBot, Claude-User, and Claude-SearchBot - what each collects and what blocking them means for site visibility. https://ppc.land/anthropic-clarifies-what-its-three-web-crawlers-do-and-how-to-block-them/ #Anthropic #WebCrawlers #ClaudeBot #SearchEngine #DigitalMarketing
Anthropic clarifies what its three web crawlers do - and how to block them

Anthropic today updated its crawler documentation, detailing ClaudeBot, Claude-User, and Claude-SearchBot - what each collects and what blocking them means for site visibility.

PPC Land
Facebook's Fascination with My Robots.txt

Facebook is requesting my robots.txt thousands of times per hour.

Random Notes

🚀 Akamai’s latest data shows a sharp rise in AI training bots and content‑fetching crawlers since July. These bots are reshaping web traffic patterns, stressing infrastructure and raising privacy questions. How will developers and open‑source projects adapt? Dive into the numbers and what they mean for the future of machine‑learning pipelines. #AIBots #WebCrawlers #BotTraffic #MachineLearning

🔗 https://aidailypost.com/news/akamai-data-shows-ai-training-bots-contentfetching-bots-rise-since

NiemanLab: News publishers limit Internet Archive access due to AI scraping concerns. “When The Guardian took a look at who was trying to extract its content, access logs revealed that the Internet Archive was a frequent crawler, said Robert Hahn, head of business affairs and licensing. The publisher decided to limit the Internet Archive’s access to published articles, minimizing the chance […]

https://rbfirehose.com/2026/01/30/niemanlab-news-publishers-limit-internet-archive-access-due-to-ai-scraping-concerns/
NiemanLab: News publishers limit Internet Archive access due to AI scraping concerns

NiemanLab: News publishers limit Internet Archive access due to AI scraping concerns. “When The Guardian took a look at who was trying to extract its content, access logs revealed that the In…

ResearchBuzz: Firehose
How I protect my forgejo instance from AI Web Crawlers

This article describes my nginx configuration and strategy on how to prevent web crawlers from putting down my instance while still serving most people with minimal amount of friction.

Search Engine Roundtable: OpenAI Scales Up Crawling & Bots For The Holidays. “OpenAI is reportedly scaling up its crawling infrastructure for the holiday shopping season. The folks at Merj noticed OpenAI adding a lot of new IP ranges for its bots and crawlers.”

https://rbfirehose.com/2025/12/01/search-engine-roundtable-openai-scales-up-crawling-bots-for-the-holidays/

Search Engine Roundtable: OpenAI Scales Up Crawling & Bots For The Holidays | ResearchBuzz: Firehose

ResearchBuzz: Firehose | Individual posts from ResearchBuzz
Inside the web infrastructure revolt over Google’s AI Overviews

Cloudflare CEO Matthew Prince is making sweeping changes to force Google’s hand.

Ars Technica