This person is building a tool (zeitgeist dot blue) that scrapes your ActivityPub feed to generate daily LLM-driven summaries. https://alpaca.gold/@seldo/116286295716964968
@drahardja Wondering if the @moderators here have a policy about this (I couldn't find anything in About, Privacy, or Community Hub).
@haljor @moderators I would support discovering the IP addresses and/or user-agent of the client they use to scrape feeds, and blocking them.
@drahardja @haljor @moderators We don’t have a policy on this, but this particular tool will respect standard Mastdon opt-out tags in bios, like #noindex/#nobots/#noai and also the “don’t index” flag on user profiles. This seems pretty reasonable to me (not speaking for SFBA on that last part).

@neuralgraffiti @haljor @moderators I’m not sure many people know about those tags; I certainly didn’t.

IMO the “don’t index” preference is orthogonal, or at least a superset, of opting out of LLM summarizing. I don’t mind my posts being indexed/searchable, but I don’t want them to be fed into the LLM meat grinder.

@drahardja @neuralgraffiti @moderators I didn't know about these tags either, and I'm not finding documentation on how to use them. I also don't see "don't index" specifically in Preferences, unless that's implied by one of the checkboxes.

Is there more information on these tags somewhere?

@haljor It’s this setting.