Mastodawn

Run a 1T parameter model on a 32gb Mac by streaming tensors from NVMe

Run models too big for your Mac's memory. Contribute to t8/hypura development by creating an account on GitHub.

GitHub