[ANN] PharoInfer is a fully in-image inference engine for Pharo. It loads a GGUF model file directly from disk and drives llama.cpp through UFFI — there is no HTTP server, no Ollama bridge, and no subprocess. Talk to the model straight from the image. https://github.com/pharo-llm/pharo-infer



