Mastodawn

Hrefna (DHC)Feb 26, 2023

This idea that somehow search engines _can_ arbitrate "truth" is just so… not how any of this works or could even conceivably work.

The reason that search engines "backstop" with wikipedia is because wikipedia is a giant curated and mostly-audience-appropriate collection of knowledge.

Knowing what is "true" is so incredibly nontrivial.

Show thread

Shannon Clark Feb 26, 2023

@hrefna @futurebird there is also this vast disconnect - a lot of search engines and people broadly seem to think that there is a single “rightl answer to any given query.

When, j would argue, in nearly all cases there is not. That every query (and the person making the query) has a context which may or may not be known to the search engine and which can make the best answer/link for their context differ from others.

This is true even if seemingly obvious questions “how many hours in a day?”

Show thread

myrmepropagandist Feb 26, 2023

@Rycaut

I'm becoming increasingly resigned to the fact that I'm going to need to write my own search engine.

It was never reasonable to expect any service much less a free one to really do full text search; the data are too big.

That's the sad truth behind all this suggesting & dumbing down of results. Sometimes even species names get "corrected" to something more popular.

But if I want to be able to have everything I've read and written searchable I'm just gonna need to do it myself.

Show thread

Nire Bryce Feb 26, 2023

@futurebird
@Rycaut

if it's just things you write and you scrape your accounts' posts as you write them, you can probably use elasticsearch. but that only works going forward, finding all your old posts would be harder

(though you could probably use ES to search your Twitter etc archive)

I'm not deeply familiar with ES but I'm pretty sure this is what it's meant for, though getting the data into the right shape might take work

Show thread

Platonides

@futurebird having available a copy of "everything you have read and written" would be the hard part (e.g. is there even a browser addon that archives locally every page you visit?).

Once you have the data, you could easily grep it. And move to more indexed forms of searching if needed.

@NireBryce

Show thread

Golda Feb 27, 2023

@platonides @futurebird @NireBryce i think a bookmarklet that indexes might be a quick start