This person is building a tool (zeitgeist dot blue) that scrapes your ActivityPub feed to generate daily LLM-driven summaries. https://alpaca.gold/@seldo/116286295716964968
@drahardja Wondering if the @moderators here have a policy about this (I couldn't find anything in About, Privacy, or Community Hub).
@haljor @moderators I would support discovering the IP addresses and/or user-agent of the client they use to scrape feeds, and blocking them.
@drahardja @haljor @moderators We don’t have a policy on this, but this particular tool will respect standard Mastdon opt-out tags in bios, like #noindex/#nobots/#noai and also the “don’t index” flag on user profiles. This seems pretty reasonable to me (not speaking for SFBA on that last part).

@neuralgraffiti @haljor @moderators I’m not sure many people know about those tags; I certainly didn’t.

IMO the “don’t index” preference is orthogonal, or at least a superset, of opting out of LLM summarizing. I don’t mind my posts being indexed/searchable, but I don’t want them to be fed into the LLM meat grinder.

@drahardja @neuralgraffiti @moderators I didn't know about these tags either, and I'm not finding documentation on how to use them. I also don't see "don't index" specifically in Preferences, unless that's implied by one of the checkboxes.

Is there more information on these tags somewhere?

@haljor @drahardja @moderators The tags are a convention that’s been around for a while, but isn’t official.

The indexing profile option is described in the docs here: https://docs.joinmastodon.org/user/preferences/#misc

Set your preferences - Mastodon documentation

Customize things just the way you like them.

@neuralgraffiti @haljor @moderators I think they should be opt-in, like search. Instead of noai, it should be yesai.

Consent should be opt-in.