Thinking about how to enable full-text searching of a useful subset of Fediverse posts in a way that is controlled by post authors and has humane unsurprising defaults.

The current yes/no binary toggle under Settings/Other (that most people have never seen) probably isn't going to do the job.

The problem needs a long-form write-up (coming), and is controversial, but I’m pretty sure won’t go away, so the community should get in front of it.

@timbray This sounds to me like the social graph interoperability debate and how classic web ideas clash with modern ideas around privacy.

Regardless of how useful it is to others, people have the right to opt out of their data being used. The GDPR effectively bans social graph interoperability between services given this principle. Mastodon seems to ignore this which is itself interesting.

The same philosophy should apply to being able to search people’s content.

@carnage4life @timbray
If we take "authors right to control" to the logical extreme, we'll find a need to include an Access Constraint or Rights Grant in each ActivityPub message.

For web pages, we've long used a robots.txt to control *who* may crawl a site and we've used page-specific"noindex" metatags, to control *what* may be done with crawled content. Does the SocialWeb need more granular control?

robots.txt: https://developers.google.com/search/docs/crawling-indexing/robots/intro
noindex: https://developers.google.com/search/docs/crawling-indexing/block-indexing

Robots.txt Introduction and Guide | Google Search Central  |  Documentation  |  Google for Developers

Robots.txt is used to manage crawler traffic. Explore this robots.txt introduction guide to learn what robot.txt files are and how to use them.

Google for Developers
@bobwyman @timbray It’s not really an extreme. I can choose permissions ranging from Only Me to Friends to Public on Facebook. So this is a fairly mainstream and normalized privacy construct.
@carnage4life @timbray
If I send a message to "Friends," some of whom use instances that implement full text search, I'd like my Friends, but only my Friends, to find my messages using their instance's search system. This seems to imply that the search engine must filter results based on a combination of my constraints plus knowledge of who is viewing the results. To make this possible, my distribution constraints must be kept with each message.