I just got llama2.c running on the Milk V Duo. Compiled using the offical Milk V toolchain. Used the smallest stories15M model and took about 10 minutes or so (I didn't count) to generate.
However, this is only running on the cpu, with the built in npu we might get faster speeds, but that is outside the the reaches of my brain.
A small developer exploring Homelabing, Self-hosting, Retro computing & Electronics. I do Web development & Linux (servers).
I sometimes make digital art including 3D Modeling and 2D Designs.
| Website | https://imagineee.web.app |
| Github | https://github.com/imagineeeinc |
| Blog | https://tilde.green/~imagineee/blog |
| Keyoxide | https://keyoxide.org/6EFE93E6F8EE03FD3BD73E2060617F79669E74DA |


.