Unused Ryzen 9 / 32gb / RTX3080 so stuck Ubuntu on it and testing a few local LLMs.
Gemma4:12b is pretty impressive, anything else I should check out?
Unused Ryzen 9 / 32gb / RTX3080 so stuck Ubuntu on it and testing a few local LLMs.
Gemma4:12b is pretty impressive, anything else I should check out?
Google should have named the 4 QAT series Gemma 4.1. Most people use quantized models (for a good reason!), and QAT, as verified by WebBrain’s benchmarks, is significantly superior to the original model, just like Qwen 3.6 is superior to Qwen 3.5.
https://www.webbrain.one/blog/gemma-4-31b-qat-planner-benchmark
#LocalLLM
#AI #BrowserAutomation
#WebBrain #OpenSource #LLM #Gemma4 #Gemma #Qwen
RE: https://social.wildeboer.net/@jwildeboer/116775461671762518
I've known about this back in early 2024 and it was pretty awesome first time I tried it, albeit with a really small model since I only had a 1050 Ti, Ryzen 3 1300X and 16 gigs of ram back then. I was getting like < 5 tk/s.
I mean it's to be expected but it's better than nothing when we don't have internet which happens sometimes.
This stuff's been out there for a while that I'm a little surprised people are only catching up with local stuffs
🔥 We just published our Q4 local planner benchmark comparing local AI models for browser automation:
• DiffusionGemma-26B-A4B-it: 0.35s median, 84% accuracy — fastest!
• Gemma 4 12B Coder: 0.40s median, 84% accuracy
• Cohere North-Mini-Code 1.0: 0.38s median, 84% accuracy
All three tied on accuracy but DiffusionGemma was the fastest.
Full benchmark: https://www.webbrain.one/blog/local-planner-q4-june-2026
2am. Triggered two model pulls, a 70B load, a cluster of cloud API agents, and seven daemons. All at once. 96GB unified memory.
Kernel panic.
Not 'do the models fit in RAM?' — fragmentation, in-flight buffers, filesystem cache, kernel allocations. All sharing the same pool. All spiking together.
Two queues. Local-heavy: serial. Cloud API: bounded parallel. Never cross-mix.
The question is, is building a NAS next or a local LLM for my home assistant next.
Can your machine run a #localllm https://llmfitcheck.com/