wrote a blog post trying to understand how local LLMs work so I can (in part 2) run a couple and squeeze as much performance out of them as possible on meager hardware

it's 1800 words, no pictures, very boring, i dont blame you if you dont read it, i wrote it mostly for my own amusement

https://blog.decryption.net.au/posts/local_llms_1.html

My Notes on Running LLMs Locally (Part 1)

As Rage Against The Machine wisely said, know your enemy.

decryption's blog
@decryption this is exactly the level of detail i needed, thank you
@decryption I run Qwen 3 and Jan code 4B parameter LLMs on my i7 + RTX 2060. Quite nice and doesn't heat up my GPU
@decryption can I get my LLM to summarise it?
@decryption This was great, thanks!
@wallamba glad someone found it useful :)
writing part 2 now and damn anything remotely useful is expensive - best bet is a tidy ryzen 395 with 128gb of RAM off AliExpress for $3500-ish