Run a 1T parameter model on a 32gb Mac by streaming tensors from NVMe

https://github.com/t8/hypura

#HackerNews #Run #a #1T #parameter #model #on #a #32gb #Mac #by #streaming #tensors #from #NVMe #https://github.com/t8/hypura #MachineLearning #Tensors #NVMe #Mac #Optimization

GitHub - t8/hypura: Run models too big for your Mac's memory

Run models too big for your Mac's memory. Contribute to t8/hypura development by creating an account on GitHub.

GitHub