A little htaccess kung-fu later, and if everything works, GPT bot is blocked now.
Put the Amazonbot on it as well, since it was creeping around on my website.
A little htaccess kung-fu later, and if everything works, GPT bot is blocked now.
Put the Amazonbot on it as well, since it was creeping around on my website.
Open-source web application framework for ASP.NET Core! Offers an opinionated architecture to build enterprise software solutions with best practices on top of the .NET. Provides the fundamental in...
Today in web application hosting land, I find the Amazon's Amazonbot that is crawling to train the LLM that is Alexa seems to be using AI spew to generate potential crawling target URL, rather than proper crawler behaviour and only following actual links that have been offered.
It's been hammering a Wordpress events calendar site with concurrent requests from multiple networks for events in the year 2271.
I don’t add web crawlers to my shitlist often, but #amazonbot just made the cut. It has been issuing 1 request/s to pages behind “nofollow” for quite a few minutes. Based on the about page for the bot, #Amazon didn’t bother implementing robots.txt rate limiting or honoring nofollow in a meta tag. 🤦♂️
From now on, it’ll get 403s.
τελευταία τεχνικά νέα :
#modoboa 2.2.0 από χθες στο email hosting, με πολλές αλλαγές. σύντομα και send-only emails. δυστυχώς ένα συγκεκριμένο bug που μας ενοχλούσε, παραμένει.
#searx αφαιρέθηκε τελείως. δεν υπήρχε χρόνος για συντήρηση και το debian πακέτο δεν ενημερώνεται όπως θα έπρεπε. και χρήστες είχε κυρίως bots με στοχευμένα queries για χειραγώγηση των πραγματικών μηχανών αναζήτησης. (warez, κα)
#ai #bots + #amazonbot μπλοκαρίστηκαν τελείως από τα web hosting μηχανήματα μας.
#Amazonbot. Really aggressive. Do wonder of the value tbh.
I have asked Alexa questions and had the site returned which was nice, but outside of that, where’s the click? Where’s the sign up? Where’s the sale? Where’s the ad impression?
ASN blocked.