#Mastodon #Fulltext #Search
This is @kissane's deeply empathetic post where she listens to the concerns of people who have left Mastodon: https://mas.to/@kissane/110793942888550843
And @siderea's thoughtful thread about the different things folks mean by "finding your people": https://universeodon.com/@siderea/110794572555824059
This week, I went over to Bluesky and asked people who'd left Mastodon why they left, and lots of people told me. I grabbed the replies and crunched them and wrote up a summary. I think it's really interesting and often kind of wrenching. https://erinkissane.com/mastodon-is-easy-and-fun-except-when-it-isnt #meta
@krohne enshrining some third-party rando in the signup flow is not the way to do this.
the way to do this is to have a standard way to explicitly mark posts as searchable, the same way that Mastodon and several other Fedi servers already have a `discoverable` flag that explicitly marks accounts as searchable. that way, *any* search engine has a way of knowing what people have volunteered for.
https://github.com/mastodon/mastodon/pull/23808 unfortunately, the effort to patch this into Mastodon has stalled out.
NOTE: This PR does not implement the search functionality itself, just a framework to express explicit consent for posts to be indexed Mastodon has historically refused to provide full-text search ...
@krohne if we don't build something like this, "is this post set to public visibility" is the only intent signal to work with, and searching all public posts surprises Mastodon users.
(and only Mastodon users. "public = searchable" is the rule for full-text search implementations on most other Fedi servers that have it, except for GotoSocial.)
@vyr
> the way to do this is to have a standard way to explicitly mark posts as searchable
Man, the problem I have with this is I don't want to mark my posts either universally searchable or not universally searchable. I want to be able to pick and choose which search engines I allow to index my content!
As a side note, part of why I would like to have that option is that I can imagine a little social movement where people in the Fediverse decide to make a point of not letting Google Search index their posts, gravitating some other search engine, thereby giving that search engine a commercial advantage over Google Search, at least for searching the Fediverse.
@siderea @krohne yeah well don't get any of your rhetoric on my gender, thanks, i just got it the way i like it π
anyway, controlling how posts get exposed to web scrapers like Google is a related but different issue. specifically, you first have to somehow have enough leverage with Google to get them to agree to whatever scheme you propose, because they don't care about ActivityPub as such. they only see web pages, and those web pages are rendered by a dozen kinds of Fediverse server, and right now, even clicking the "hide me from search engines box" doesn't work 100% on your local instance, let alone once your posts leave it.
@mattswift @krohne there's always the possibility of an "opt-either" onboarding UX. consider the following:
"i want to be visible" on the left half of the screen vs. "i want to be quiet" on the right. one click defaults.
left one turns on discoverable and searchable flags, sets your default visibility to public, and disables the opt-out for web crawlers.
right one leaves the flags off, sets your default visibility to unlisted, tells web crawlers to ignore you, hides your social graph, and requires follow request approval.
anyone who wants further customization can go digging for it.
@krohne @kissane @siderea It's the opt-out part I'd oppose.
If opt-out full-text search had existed from the beginning, I'd have no objection. People would effectively opt in when they registered their account, and could disable it immediately if they wished. But enabling it after the fact for hundreds of thousands of users, including those who are expressly against it, seems insupportable.
At the very least it should be opt-in for existing accounts.
@krohne @kissane @siderea I think this needs to be *opt-in* at the server level by having the server register with an external service. I'm sure I don't understand the technical details but I think my single-user Mastodon instance data limits would be swamped just by the external *requests* for full-text searches.
Also, this can't just be a Mastodon thing. It would have to be a modification to the ActivityPub standard, right?
@krohne Yeah, it probably would require a centralized index but that would somehow need to take into account which servers are federated and which are not. Doing a full text search and being presented with harmful posts (racist, anti-trans, pro-nazi, etc.) from Truth Social or Gab or whatever isn't going to fly. As well, having people from defederated instances reach posts through a central repository is also going to be problematic.
Centralization itself may be anathema to federation.
Surfacing unsavory content has been an issue with some of the fedi search implementations that have come and gone. We would want to see any search using the standard @oliphant blocklists at a minimum.
As for having people from blocked instances (or just anyone on the internet) being able to find your posts, that is a legitmate tradeoff. I would hope being able to block search at the post/user/instance level would help with this.