I've been playing around with these llamafiles that collapse the whole local AI stack (weights + llama.cpp + runtime) into a single, multi-platform executable. Just download and run.
Really impressed by this Mozilla project, and glad to see momentum picking up again.
