Honestly, the thing that will probably kill LLMs the hardest is someone writing a small language model that fits in JavaScript in a browser and hits comparable benchmarks.

Why bother with all those GPUs and energy usage if your Raspberri Pi could get comparable results?

@soatok

If anyone is thinking about smol models, one should go sniff around the Hugging Face Smol Models Research first. https://huggingface.co/HuggingFaceTB

Having said that though, I know some like the idea of a smol model, but then they get annoyed when the usability tradeoff is lack of general knowledge/needing to do tool use. Witness the reception of OpenAI's gpt-oss-20b for example.

HuggingFaceTB (Hugging Face Smol Models Research)

Exploring smol models (for text, vision and video) and high quality web and synthetic datasets