Ok, hear me out; Stupid idea:

RBL for AI scrapers, but with BGP. I setup an auto-peering bgpd, and fill in the /32 | /128 of hosts i find scraping across my AS (+allow trusted entities to do the same).

Setup a remote peering with the host and nullroute that stuff.

Thoughts?

@tfiebig Sounds like route leaking hilarity ready to happen.

DNSBL for this sort of stuff likely will be coming soon. I've not looked into whether web servers are currently leveraging such things.

mod_access_dnsbl for Apache

A module to control access to web content based on the client's IP address being in certain DNSBLs