Could I get some admins of bigger instances looking in their access.log and see if there's more than just the good for nothing scraper using "Apache-HttpClient" as part of it's useragent or if there's any actual fedi software using this?

See: https://wolfi.ee/@jase/statuses/01KHB81TRFKKXC1C72M8Z8GFYD

I'm going to be adding that to my user agent blocklist of #BadBotBlocker, as I have nothing but the public timelines scraper using it, but want to be sure.

#Scraper #ScraperNoScraping #FediAdmin #MastoAdmin #SysAdmin #Fediblock

Wolfie Jase is lewd your friendly neighborhood terrorist (@[email protected])

Sensitive content: Public timeline fedi scraper, using Apache-HttpClient in it's useragent

wolfi.ee

@justdude @regendans @fluepke My personal alternative that got me away from cloudflare, is running my backend server, in my case now home hosted on a Pi, tunneled through tailscale behind a VPS used as an nginx reverse/squid forward proxy with https://git.wolfi.ee/jase/nginx-bad-bot-blocker and https://git.gammaspectra.live/git/go-away.

#Cloudflare #BadBotBlocker #GoAway #Anubis

nginx-bad-bot-blocker

Nginx Bad Bot Blocker, customised for fedi instances, and made more tor friendly https://github.com/mitchellkrogza/nginx-ultimate-bad-bot-blocker

Wolfejo: You wouldn't download a wolf... right?
Dear ClaudeBot and friends please FOAD.
More and more abusive bots are finding a place in my bad bot blocker configs.
#bots #ai #artificalintelligence #badbotblocker