0 Followers
0 Following
1 Posts

New technique to run 70B LLM Inference on a single 4GB GPU

https://lemmy.world/post/9093470

New technique to run 70B LLM Inference on a single 4GB GPU - Lemmy.World