Why is literally every search engine variously but fundamentally fucking Broken, now? Yes even whatever one you were about to suggest.

And no it's not Just SEO/Marketing/Ad-Targeted shit on the search side, because even DDG is having these problems, and they're quite fucking scrupulous about these things.

Also, that still wouldn't explain the behaviour I'm talking about, even when accounting for SEO embedding on the site side. Again: not returning results that absolutely should come up in the search parameters, but returning those results with different, possibly less-well-related parameters.

This started before the "A.I." integrations, but, as I noted a BUNCH of times, now, is only going to get worse with their advent. So that's something to look forward to.
@Wolven Speculation: could queries by AI be starting to impact the search recommendation algorithms?
@Wolven Ad driven content farming has ruined the web. Search can't keep up and (for the most part) doesn't care or actively encourages it.
@Wolven My personal favorite has been when I search for two words, and I only get results with one of the words. It's bad enough that any time I open a page, I make sure to search in the page for each of the search terms I was originally looking for to verify that it's even close to a reasonable result.

@worr @Wolven
I'll look up how to do something in a particular 3D software package, and Google will helpfully provide links telling me how to do that thing in an entirely different bit of software. It's ... not useful. (Packing *everything* in quotation marks ends up being necessary...)

Although recently it found vague *synonyms* for every word in my search query - including the software's name itself (Substance Designer).

I'm sure there's lots of very clever code doing this, but AAAARGH.

@worr @Wolven
(This is for really, really specific searches - in a landscape that should still be unpolluted by endless SEO nonsense. For more generalised stuff? An even louder AAAARGH.)
@Wolven Yes they are all fucked. No idea. Itโ€™s terrible.

@Wolven

Here, let me

No, wait.

Maybe it is a conspiracy so people do not learn things.

@SpaceLifeForm @Wolven Big Data is conspiring with Big Search to keep away the Truth of Big Tech. or.. whatever
@Wolven because we have failed to instil good metadata practices, built social media systems resistant to going back through time,
and polluted links with various tracker crap tacked on to the end.
and it drives me bonkers. which is why i archive everything i see and read, and use emacs to access it again.
@Wolven what do you mean by broken?
@dudley Specifics depend on the engine, but in all cases not returning results that absolutely should come up in the search parameters, but returning those results with different, possibly less-well-related parameters. Bad.
@Wolven @dudley yeah it's definitely really bad out there. Even ddg's for sure getting worse. Not an endorsement or recommendation, but there's paid search engines which claim to do better, but ๐Ÿคทโ€โ™€๏ธ how accurate that really is. https://kagi.com/ is one I've seen a couple people on fedi talk about.
Kagi Search - A Premium Search Engine

Better search results with no ads. Welcome to Kagi (pronounced kah-gee), a paid search engine that gives power back to the user.

@Wolven @dudley

Hmmm. Seems weird ddg would do that.

@Wolven The goal of search engines today is to drive sales. They mostly catalog retail sites in an attempt to sell you something related to your search.
@Wolven I canโ€™t tell if itโ€™s that thereโ€™s too much noise and not enough signal out there on the wild internet or if itโ€™s that the search engines are more worried about something other than returning useful results (monetization, presumably?), but yeah, Iโ€™m near to pining for an old Yahoo-style directory maintained by some sharp librarians and giving up on the machine-driven indexing/searching entirely!
@Wolven don't worry, it'll get worse. Especially when AI fully saturates it with seemingly but not quite entirely incorrect results. Yay technology!
@Wolven from a business perspective a search engine doesn't have to return the best answer, just the worst answer the consumer still accepts. And the worse the answer, the more likely they'll need to look again (because the first answer wasn't satisfying), meaning they'll look at more ads.

@Wolven I'm a librarian & have noticed this for about 5 years. It's like the results are best if you want a basic explanation, "what is Ryan Bergara's age?" It performs worse if you want broad information to research: "Student-Centered Policies" library.

I'm better versed in database searches, & noticed those are also declining in relevant results as they are laying off people. My theory: machine indexing without human oversight. Human intervention is what brings relevancy.

@somcak @Wolven and I think the big search engines are looking to optimize metrics that favor easy but common searches over rare but tricky searches.

@Wolven I think the best working theory is that the internet itself is full of more and more crap.

Link/clickbait farms are full of human produced but very cheap dreck are optimized to trick search engines and run rampant now. Plus their operators work hard to play a cat-and-mouse game with the search engines over the rankings.

Meanwhile most quality content moved behind paywalls and is either not indexed or downranked.

@Wolven the gist of it is Google and co. got greedy and started showing answers to things instead of the links, going so far as to show full recipes and summaries of wikipedia articles, driving traffic away from those sites (which are largely ad-supported).

In response huge swaths of the web started requiring authentication to see anything.

Then social media platforms realized it was better for their data harvesting to make their stuff not externally searchable, and newer ones made stuff inaccessible by default (discord, insta, etc).

It turned the web into more giant walled gardens.

Then Google seems to have stopped investing in search so the SEO brigade ran wild. And every other search engine tries to be Google in one way or other.

@Wolven
Yeah. Iโ€™d been grudgingly tolerating Googleโ€™s intrusive profiling because the results were betterโ€ฆuntil they werenโ€™t.

Iโ€™ve been enjoying Kagi, fwiw. Their results are hardly perfect either, but they have a lot less BS โ€” more like Google circa 2012. And no onerous data gathering.

@Wolven AFAICT it's because keyword search failed years ago, and all the search engines have been moving to semantic vector search instead. The same underlying tech as LLMs, actually - training a model to vectorize a document (or fragment), then doing nearest-k neighbor search.

Which means the results are inherently less specific than they should be

@Wolven my theory is that all the engineering resources got moved onto future opportunities work (like AI) and that a single sustaining engineer keeps various search engines going these days
@Wolven I am limping along with DDG but not just for privacy reasons. Gurgle now yields complete shit results unless I am logged in so Alphabet can look up who it is talking to. Shitification has been picking up speed. Download times for mere news pages stretch out to noticeable pauses while all the goddamned ad videos load. There is no free service.

@Wolven The first time I noticed google had gotten really bad was a decade ago when they decided that for every search, location data was the most important factor, so a pet shop one mile away that sold supplies for snakes would come up ahead of python the programming language.

They dialed that back a bit, but it never fully recovered.

@Wolven @CassandraZeroCovid I think the @RikerGoogling bot may be basically our future, weโ€™re just going to have to post all our dumb queries in public and no one will ever be able to say โ€œjust fucking google itโ€ ever again.
@Wolven @carnage4life I pay for Kagi and I am quite happy with almost all search results. And no ads, which is heaven.
@Wolven Search engines seem to assume that I'm not actually searching for what I searched for then they try to "guess" what I really want.

@meshaiman @Wolven my own theory is that they've been using the AI for so long that it has learned to guess what people mean when they enter a misspelled or vague or tangentially related search term, which regular users often do.

But when you know exactly what you're looking for it still thinks you're guessing. Maddening stuff.

@Wolven I even noticed this on a large internal wiki, where a search for something like โ€œinsurance benefitsโ€ started returning some random personโ€™s onboarding notes rather than the official documentation. This wiki used to be spot on!
@Wolven
I think it's because of cost optimization by the search engines. Handling tail queries requires bigger indexes. Returning more specific results to multi word queries takes more processing. That all costs money. And they can make about the same amount by just returning generic results.
@Wolven Because they are all centralised. I wrote an article years back on federated search, but it didn't get any traction.
@Wolven I can only find what I need on google images. The web search is broken. Sometimes I can't find any good solution for an error I have ,but I stumble upon one when I am looking for something else.

@Wolven Search on google products seems to be terrible across the board? Youtube will literally give me three results related to a search, then 12 other videos under a "viewers also watched" or "you might also like" tab.

GIVE ME WHAT I SEARCHED FOR, ACTUALLY?

@Wolven Maybe we can buy AltaVista for a hundred bucks and re-launch it. We'd b rich.

@Wolven

SEO

Plus . . if the search engine returned information that was actually helpful, then they couldn't show you MORE ADS . . as you keep trying (forever) to refine your search query to continue trying to find anything helpful