Snuck in a quick, but important https://probes.dev feature and added the User-Agent header to be a good web citizen.

`User-Agent: probes (+https://probes.dev)`

#HTTP #UserAgent #Monitoring

Probes — Uptime & Performance Monitoring

Website and API uptime and performance monitoring for solo developers and small teams. DNS, TCP, TLS, and TTFB timing breakdowns to the millisecond. $1/site/month. Free trial, no credit card required.

Probes
User-Agent情報についての分析から分かること - Qiita

はじめに 三菱電機の大塚です。 三菱電機 情報技術総合研究所では、製品開発時のセキュリティ対策にフィードバックする目的で、複数種類のハニーポットを設置・運用しています。 今回はIoT家電を標的としたサイバー攻撃を観測するハニーポット「IoT家電ハニーポット」で観測した攻撃...

Qiita
🤡 Ah, the #nostalgia of reliving the 90s with Cloudflare's demand for a "better" user-agent, because who doesn’t love a modern web obstacle course? Congrats on turning #WebGL into a fingerprinting playground—next up, #CAPTCHA that asks for your favorite smells! 🕵️‍♂️🔍
https://hacktivis.me/articles/cloudflare-turnstile-webgl-fingerprinting #Cloudflare #useragent #HackerNews #ngated
Cloudflare Turnstile requiring fingerprintable WebGL

@GNUmatic Buchung 1 ging ohne Probleme, bei Buchung 2 hat die Bahn mich, trotz Login und vorausgegangenen Fahrkartenkauf aufgrund des bekannten Problems geblockt. 💩 Was für ein Saftladen #DB #Bahn #Linux #UserAgent #SecurityCircus
Seltsam: Webseite der Deutschen Bahn sperrt Linux-Nutzer aus

Ein kurioser Vorfall verwirrte Besucher der DB-Webseite. Nutzer von Linux-Betriebssystemen stießen auf eine unsichtbare Mauer.

Torben Kopp

So, with Google announcing "Search is going full-AI, we won't be sending traffic to the original sites any more", someone else pointed out that this eradication of the traditional search-engine compact - we let you crawl our sites to create your index, and you send visitors to our sites when relevant - means that we can, and should, block all of Google's crawlers now. If they're going to just take, take, take and give nothing back, why let them access your content at all?

But this is cute. Besides the fact that Google documents that some of their crawlers ignore robots.txt, there's this bit of fun. On this page (https://developers.google.com/crawling/docs/robots-txt/create-robots-txt), they link to "the Google list of user agents" (https://developers.google.com/crawling/docs/crawlers-fetchers/overview-google-crawlers).

However, that links to 3 separate pages of them, and *each of those pages explicitly states that is not comprehensive, but only the ones they commonly get questions about*. And of course, none of the "User-triggered fetchers" obey robots.txt, along with some others.

So Google isn't even reporting the full list of user-agents that can be used to stop their crawling.

That is some bullshit.

#Google #crawler #RobotsTxt #UserAgent #bullshit #antisocial #web #search #WebSearch #LLM #AI

Create and Submit a robots.txt File | Google Crawling Infrastructure  |  Crawling infrastructure  |  Google for Developers

A robots.txt file lives at the root of your site. Learn how to create a robots.txt file, see examples, and explore robots.txt rules.

Google for Developers
Bestimmte User-Agents zu blockieren, muss ich nicht verstehen, oder? https://www.heise.de/news/Deutsche-Bahn-Keine-Auskunft-unter-Linux-11300742.html #db #linux #useragent #browser
Deutsche Bahn verweigert Auskunft für Linuxer

Die Deutsche Bahn sperrt Linux-User aus der Webseite aus, eine Fehlermeldung warnt vor Bot-Verdacht. User-Agent ändern hilft.

heise online

@heiseonline

Ahh so, na das erklärt alles. Ich war diese Woche auch betroffen.

Ist das ein Fehler in der Software oder will die Bahn die Leute in die Datenschutz-unfreundliche Bahn-App zwingen?

#deutschebahn #linux #webentwicklung #useragent #digitaleausgrenzung

🙄 Oh, the joys of browsing a "high-entropy" article where you need a PhD in user-agent etiquette just to read about alloys! 🔓🤖 Spoiler: It's like being told to #RSVP before you crash a party you never wanted to attend. 🥳📜
https://en.wikipedia.org/wiki/High-entropy_alloy #highentropy #useragent #alloys #techhumor #browserissues #HackerNews #ngated
High-entropy alloy - Wikipedia