RE: https://social.treehouse.systems/@ariadne/116213132813239860

Read what Ariadne is writing about LLMs. This all tracks with my intuition, that OpenAI et al are a big grift.

You categorically do NOT need millions or billions to train a useful LLM that can communicate in human language. LLMs are good at language, it's in the name!

The reason these companies are burning massive amounts of money and using increasingly massive models is they've taken "look, this tech makes for a cute chatbot that can do useful stuff" and turned it into "if we make it bigger it'll be SMARTER!"

And the thing is, that's true... to a point. When you stop treating the LLM as a language model and start trying to turn them into an all-knowing entity that has memorized the entirety of human knowledge and can do anything you prompt it for all with the same model (or a few collaborating models), you quickly hit diminishing returns. And you end up with a thing that's kind of smart (not really) and kind of knows everything (not really) and convinces everyone to throw insane amounts of money at you because you're fundamentally using the technology for something it wasn't intended for.

The way we fight back is with small home-grown "LLMs" (SLMs?) that run on a MacBook and train on a few GPUs and training/fine tuning them for specific purposes.

The whole AIBro approach of just using prompting and in-context learning with a single all-powerful model is just patently absurd.

@lina the "bigger is smarter" philosophy you mentioned reminds me of a YouTube video I watched yesterday about a mall that became worthless almost immediately after being built because it followed a similar philosophy - "bigger can't fail." It ended up changing hands some 7 or so times over its 30 years, its value cratering every time. Recently it was greenlit for destruction because the land was worth more than the property.
@lina I think a mall is rather similar to an online service - both are high-risk investments, so they require both quick buy-in and continued usage in order to stay afloat. Perhaps most importantly, they require converting "window shoppers" into long-term customers. I personally hope and believe that most current users of corporate LLMs are window shoppers, and will not be converted into useful customers when they start charging more. I guess we'll see soon enough.
The Most Abandoned Mall in Ohio You Can Still Walk Into

YouTube