iPhone 17 Pro Demonstrated Running a 400B LLM
iPhone 17 Pro Demonstrated Running a 400B LLM
Only way to have hardware reach this sort of efficiency is to embed the model in hardware.
This exists[0], but the chip in question is physically large and won't fit on a phone.