This person is building a tool (zeitgeist dot blue) that scrapes your ActivityPub feed to generate daily LLM-driven summaries. https://alpaca.gold/@seldo/116286295716964968
@drahardja Wondering if the @moderators here have a policy about this (I couldn't find anything in About, Privacy, or Community Hub).
@haljor @moderators I would support discovering the IP addresses and/or user-agent of the client they use to scrape feeds, and blocking them.
@drahardja @haljor @moderators We don’t have a policy on this, but this particular tool will respect standard Mastdon opt-out tags in bios, like #noindex/#nobots/#noai and also the “don’t index” flag on user profiles. This seems pretty reasonable to me (not speaking for SFBA on that last part).

@neuralgraffiti @haljor @moderators I’m not sure many people know about those tags; I certainly didn’t.

IMO the “don’t index” preference is orthogonal, or at least a superset, of opting out of LLM summarizing. I don’t mind my posts being indexed/searchable, but I don’t want them to be fed into the LLM meat grinder.

@drahardja @neuralgraffiti @moderators I didn't know about these tags either, and I'm not finding documentation on how to use them. I also don't see "don't index" specifically in Preferences, unless that's implied by one of the checkboxes.

Is there more information on these tags somewhere?

@haljor It’s this setting.

@haljor @drahardja @moderators The tags are a convention that’s been around for a while, but isn’t official.

The indexing profile option is described in the docs here: https://docs.joinmastodon.org/user/preferences/#misc

Set your preferences - Mastodon documentation

Customize things just the way you like them.

@neuralgraffiti @haljor @moderators I think they should be opt-in, like search. Instead of noai, it should be yesai.

Consent should be opt-in.

@drahardja @haljor @moderators That’s a fair point. I’m not sure we want to be in the position of committing to find and block all tools that come around from an operational perspective, but we are always open to community feedback, and obviously AI tools are a rapidly changing landscape. We’ve had some internal debates on how to handle them.

Frankly, and not to derail, the various social media laws being passed around the US are of much greater concern to me.