🤡 Ah, the #nostalgia of reliving the 90s with Cloudflare's demand for a "better" user-agent, because who doesn’t love a modern web obstacle course? Congrats on turning #WebGL into a fingerprinting playground—next up, #CAPTCHA that asks for your favorite smells! 🕵️‍♂️🔍
https://hacktivis.me/articles/cloudflare-turnstile-webgl-fingerprinting #Cloudflare #useragent #HackerNews #ngated
Cloudflare Turnstile requiring fingerprintable WebGL

@GNUmatic Buchung 1 ging ohne Probleme, bei Buchung 2 hat die Bahn mich, trotz Login und vorausgegangenen Fahrkartenkauf aufgrund des bekannten Problems geblockt. 💩 Was für ein Saftladen #DB #Bahn #Linux #UserAgent #SecurityCircus
Seltsam: Webseite der Deutschen Bahn sperrt Linux-Nutzer aus

Ein kurioser Vorfall verwirrte Besucher der DB-Webseite. Nutzer von Linux-Betriebssystemen stießen auf eine unsichtbare Mauer.

Torben Kopp

So, with Google announcing "Search is going full-AI, we won't be sending traffic to the original sites any more", someone else pointed out that this eradication of the traditional search-engine compact - we let you crawl our sites to create your index, and you send visitors to our sites when relevant - means that we can, and should, block all of Google's crawlers now. If they're going to just take, take, take and give nothing back, why let them access your content at all?

But this is cute. Besides the fact that Google documents that some of their crawlers ignore robots.txt, there's this bit of fun. On this page (https://developers.google.com/crawling/docs/robots-txt/create-robots-txt), they link to "the Google list of user agents" (https://developers.google.com/crawling/docs/crawlers-fetchers/overview-google-crawlers).

However, that links to 3 separate pages of them, and *each of those pages explicitly states that is not comprehensive, but only the ones they commonly get questions about*. And of course, none of the "User-triggered fetchers" obey robots.txt, along with some others.

So Google isn't even reporting the full list of user-agents that can be used to stop their crawling.

That is some bullshit.

#Google #crawler #RobotsTxt #UserAgent #bullshit #antisocial #web #search #WebSearch #LLM #AI

Create and Submit a robots.txt File | Google Crawling Infrastructure  |  Crawling infrastructure  |  Google for Developers

A robots.txt file lives at the root of your site. Learn how to create a robots.txt file, see examples, and explore robots.txt rules.

Google for Developers
Bestimmte User-Agents zu blockieren, muss ich nicht verstehen, oder? https://www.heise.de/news/Deutsche-Bahn-Keine-Auskunft-unter-Linux-11300742.html #db #linux #useragent #browser
Deutsche Bahn verweigert Auskunft für Linuxer

Die Deutsche Bahn sperrt Linux-User aus der Webseite aus, eine Fehlermeldung warnt vor Bot-Verdacht. User-Agent ändern hilft.

heise online

@heiseonline

Ahh so, na das erklärt alles. Ich war diese Woche auch betroffen.

Ist das ein Fehler in der Software oder will die Bahn die Leute in die Datenschutz-unfreundliche Bahn-App zwingen?

#deutschebahn #linux #webentwicklung #useragent #digitaleausgrenzung

🙄 Oh, the joys of browsing a "high-entropy" article where you need a PhD in user-agent etiquette just to read about alloys! 🔓🤖 Spoiler: It's like being told to #RSVP before you crash a party you never wanted to attend. 🥳📜
https://en.wikipedia.org/wiki/High-entropy_alloy #highentropy #useragent #alloys #techhumor #browserissues #HackerNews #ngated
High-entropy alloy - Wikipedia

When you have to fake your User-Agent because Zeit.de blocks Firefox with "CrawlerDetected" via some dumb script they found on Github.

Of course this is all just their desperate attempt at reducing traffic from slop-bots that leads to real users being blocked too.

Hope someone reads the logs sometimes, but I doubt it.

#Enshittification #Slop #DieZeit #UserAgent

Even in the era where all major desktop environments are going Wayland-only, web browsers will ensure we never get rid of X11's traumatic memory, huh? 🫠 https://bugzilla.mozilla.org/show_bug.cgi?id=2027556

Par for the course for the ecosystem where everybody pretends to be everybody because the web is a never-ending collection of hacks…

#UserAgent #X11 #Xorg #Firefox #browsers #Wayland

2027556 - User agent says "X11" while running natively on Wayland on Linux

RESOLVED (nobody) in Core - Widget: Gtk. Last updated 2026-04-28.