Run a 1T parameter model on a 32gb Mac by streaming tensors from NVMe
Link: https://github.com/t8/hypura
Discussion: https://news.ycombinator.com/item?id=47504695
Run a 1T parameter model on a 32gb Mac by streaming tensors from NVMe
Link: https://github.com/t8/hypura
Discussion: https://news.ycombinator.com/item?id=47504695