RT @__tinygrad__: tinygrad wird das C für dich schreiben. Unser neuer Treiber kompiliert alle Interaktionen mit der GPU in C, sodass die CPU im laufenden Betrieb fast nichts mehr tut. Elon Musk (@elonmusk) SpaceX hat fast die V1.0 eines in-house entwickelten AI-Trainingsstapels in C fertiggestellt, der exakt auf 220k GB300s mit 800G-NICs abgebildet ist und stark von Pipeline-Parallelismus Gebrauch macht, um so nah wie möglich an die Bare-Metal-Leistung heranzukommen. Der potenzielle Geschwindigkeitsvorteil gegenüber JAX bei großen Trainingsläufen liegt über einer Größenordnung. — https://nitter.net/elonmusk/status/2059884150187053488#m

mehr auf Arint.info

#AI #CProgramming #GPU #SpaceX #tinygrad #arint_info

https://x.com/__tinygrad__/status/2060064595465912434#m

GitHub - quantbagel/gtinygrad: You like pytorch? You like micrograd? You love tinygrad! ❤️

You like pytorch? You like micrograd? You love tinygrad! ❤️ - GitHub - quantbagel/gtinygrad: You like pytorch? You like micrograd? You love tinygrad! ❤️

GitHub

At long last, the blog post I've been working on for what seems like forever is finished!

https://cprimozic.net/blog/growing-sparse-computational-graphs-with-rnns/

It's packed with lots of really cool stuff: ML #interpretability, #grokking, #tinygrad, #graphviz, and more

Growing Bonsai Networks with RNNs

A summary of my research and experiments on growing sparse computational graphs by training small RNNs. This post describes the architecture, training process, and pruning method used to create the graphs and then examines some of the learned solutions to a variety of objectives.

Casey Primozic's Blog

Did some #machinelearning
#benchmarks on my 7900 XTX GPU and wrote up notes: https://cprimozic.net/notes/posts/machine-learning-benchmarks-on-the-7900-xtx/

TL;DR performance on AMD GPUs is pretty bad, but it's likely a software issue.

I tested the raw hardware using some scripts from #tinygrad, and the raw FLOPS are excellent.

Machine Learning Benchmarks on the 7900 XTX

I recently upgraded to a 7900 XTX GPU. Besides being great for gaming, I wanted to try it out for some machine learning. It’s well known that NVIDIA is the clear leader in AI hardware currently. Most ML frameworks have NVIDIA support via CUDA as their primary (or only) option for acceleration. OpenCL has not been up to the same level in either support or performance. That being said, the 7900 XTX is a very powerful card.

Casey Primozic's Notes

Speaking of #tinygrad, I've been hanging out in that Discord server for the past month or so

Very unique project. The creator and maintainer geohot is an interesting person for sure.

Lots of good ideas, and the only place I've actually seen all the way from the top to the bottom of a GPU-powered ML stack before

Not a good place to go to learn though; very much a do-it-yourself vibe and noobs aren't tolerated at all

I've been making more progress on the sparse RNN training and visualization

Working on the blog post now. Lots of cool stuff went into this from custom activation functions, custom regularizers, the new machine learning library #tinygrad, #graphviz, #webgl, and more

Here, it learned a gated 3-state state machine coupled with other neurons that perform a different boolean operation depending on the current state

#rnn #machinelearning

Have you ever tried https://open-assistant.io/chat ? Two days ago I was listening to George Hotz at the Lex Fridman #podcast and he was talking about putting a 65b 🦙 model in the Tiny box-a PC assembled by the Tiny corp- that would run #tinygrad. I highly recommend listening to him. He ran out of #twitter, he tries to make #selfdriving affordable to everybody and he's into open sourcing #AI. Here is the #podcast -> https://youtu.be/dNrTrx42DGQ
Sign Up - Open Assistant

Conversational AI for everyone. An open source project to create a chat enabled GPT LLM run by LAION and contributors around the world.

the tiny corp raised $5.1M

Here we go again. I started another company. The money is in the bank.

the singularity is nearer

Making good progress with my visual search feature in TinyUX.

The simple neural net is stored on the device. It is now only trained on letters and numbers. Will need to train for all icons.

I use the same interface to draw content for the neural network to train on. Still quite number of hours of work ahead for creating that training content.

#neuralnet #ux #icons #pytorch #reactnative #tinygrad #tinyux