#Development #Obituaries
Farewell to robots.txt (1994-2025) · “You were too good for this world.” https://ilo.im/167q2b

_____
#SearchEngine #InternetArchive #AI #Content #Website #RobotsTxt #RFC9309 #WebDev #Backend

Obituary: Farewell to robots.txt (1994-2025)

The voluntary compliance protocol that civilized the internet has departed, bids Henning Fries farewell.

heise online

I’ve made a little something, so I thought I'd share.

Gort is a robots.txt parser and evaluator. It implements RFC 9309.

More details in the ReadMe: https://github.com/pointlessone/gort

#Ruby #rubygem #release #robotstxt #robots_txt #rfc9309

GitHub - pointlessone/gort: robots.txt parser and evaluator

robots.txt parser and evaluator. Contribute to pointlessone/gort development by creating an account on GitHub.

GitHub

Setting up /robots.txt, not because it helps, but because being crabby in compliance with an RFC is satisfying.

Who has some unsavory ones besides ChatGPT and Twitterbot?

https://rossabaker.com/configs/website/webcrawlers/

#RFC9309 #RobotsTxt

Ross A. Baker: Webcrawler configuration

Defines a robots.txt file to define access policies for compliant webcrawlers according to the Robots Exclusion Protocol.

GitHub - crawler-commons/crawler-commons: A set of reusable Java components that implement functionality common to any web crawler

A set of reusable Java components that implement functionality common to any web crawler - GitHub - crawler-commons/crawler-commons: A set of reusable Java components that implement functionality c...

GitHub