Honestly, the thing that will probably kill LLMs the hardest is someone writing a small language model that fits in JavaScript in a browser and hits comparable benchmarks.
Why bother with all those GPUs and energy usage if your Raspberri Pi could get comparable results?