Someone is building a "global fediverse post indexer" that:

* scrapes the public APIs so it can't be blocked via defederation
* uses a bunch of dynamic IPs so it can't be banned at network level (hilariously, the author redacted this part and forgot that the edit history can be viewed by anyone)
* can be blocked by server admins via robots.txt, but they're planning to publish which instances are opting out (right now this is "open for debate")
* can be blocked by users by disabling indexing in the profile settings (!) or adding a specific hashtag to their bio (!!)

There's ZERO mention of opt-in, a lot of pushback against anyone who dares calling this thing a scraper ("we're using public APIs, so we're not a scraper") and the inevitable "we got complaints only from people who have something to hide".

With this attitude, I wonder how they're going to respond to the first GDPR compliant they're inevitably going to receive, it'll be fun 🍿

@rfc1459 There's no easy answer on that one. It's a "it depends". Depends if it is a company or an individual. Depends on where they are based. Depends where the scripts are running from (jurisdiction is a swamp).

Most likely, they will get away with it for a very long time. Enough to fill Tb of data, index it and make money out of it