Mastodawn

Trying to read the microGPT sourcecode.
(https://karpathy.github.io/2026/02/12/microgpt/)

I love the idea of the project, but imho its readability suffers from:
- putting stuff on one line that would be better readable on multiple lines (probably to stay under 200 lines)
- using 2 or 3 character variable names. i.e. wo being weightsOutput, x being any vector etc. (mathematical conventions? Copied from the original GPT2 code?)

#microGPT #programming #style

microgpt

Musings of a Computer Scientist.

Habr Mar 5

Обучаем нейросеть онлайн

Небольшая статья о том как я под впечатлением статьи с Хабра " Возможно, самый гениальный код на python на сегодня: разбираем 200-строчный microgpt от Андрея Карпаты " решил проверить все это на практике, и перенес код при помощи нейросети в онлайн формат.

https://habr.com/ru/articles/1007114/

#llmмодели #llmприложения #microgpt #javascript

Обучаем нейросеть онлайн

Хабр

Erik Jonker Mar 2

Such a fun demonstrator by Karpathy, MicroGPT.
http://karpathy.github.io/2026/02/12/microgpt/
#ai #microgpt #karpathy

microgpt

Musings of a Computer Scientist.

Hacker News Mar 1

Microgpt explained interactively

https://growingswe.com/blog/microgpt

#HackerNews #Microgpt #Interactive #Explanation #Technology #AI #Learning

MicroGPT explained interactively

Walk through Karpathy's 200-line GPT from scratch. Tokenize names into integers, watch softmax convert scores to probabilities, step through backpropagation on a computation graph, explore attention heatmaps, and see a tiny model learn to generate plausible names.

sayzard Mar 1

Anders Marksen (@andersmarksen)

Andrej Karpathy가 초경량 GPT 프로젝트 'microgpt'를 공개(링크 공유). 20년 전에는 상상하기 어려웠을 만큼 경량화된 LLM 구현을 지향하는 오픈소스/연구 성과로, 저자원 환경이나 임베디드 적용 가능성을 높이는 혁신적 시도로 볼 수 있다.

https://x.com/andersmarksen/status/2028051693271670964

#microgpt #andrejkarpathy #opensource #tinyllm

Anders Marksen (@andersmarksen) on X

imagine having this 20 years ago @karpathy's microgpt (https://t.co/Xf98fWWdCb)

X (formerly Twitter)

Jan D Mar 1

Tutorial on creating a mini-"AI" which generates plausible names trained with a list of 32000 names.

https://karpathy.github.io/2026/02/12/microgpt/

#LLM #tutorial #microGPT

microgpt

Musings of a Computer Scientist.

sayzard Mar 1

Justin H. Johnson (@BioInfo)

200줄, 의존성 없음으로 구현된 'Full GPT' 예제가 공개됨: 토크나이저, autograd, 트랜스포머, Adam 옵티마이저, 학습과 추론까지 포함. Karpathy가 수년간의 ML 지식을 하나의 파일로 압축한 형태로 교육적·오픈소스 관점에서 주목받음(링크 포함).

https://x.com/BioInfo/status/2028014151541629016

#microgpt #karpathy #opensource #gpt

Justin H. Johnson (@BioInfo) on X

200 lines. No deps. Full GPT: tokenizer, autograd, transformer, Adam, training, inference. Karpathy compressed a decade of ML education into a single file. https://t.co/vElkjs3RNn cc @karpathy @rasbt @cwolferesearch @natolambert

X (formerly Twitter)

Hacker News Mar 1

Microgpt

http://karpathy.github.io/2026/02/12/microgpt/

#HackerNews #Microgpt #AI #Technology #GPT #Innovation #Machine #Learning

microgpt

Musings of a Computer Scientist.

sayzard Feb 18

VictorSanchez (@VictorSanchez)

@karpathy에게 영감을 받았다고 밝히며 'microGPT Studio' 출시를 알립니다. 이 도구는 브라우저에서 작은 GPT를 학습하고 시각화할 수 있게 하며 JavaScript, Rust/WASM, WebGPU를 활용합니다. 데모 링크를 제공해 브라우저 기반 모델 학습·시각화의 실용적 예시를 보여줍니다.

https://x.com/VictorSanchez/status/2024038753950453794

#microgpt #browserml #webgpu #wasm #javascript

VictorSanchez (@VictorSanchez) on X

Huge thanks to @karpathy for the inspiration behind this project. I just launched microGPT Studio: train and visualize a tiny GPT in the browser (JavaScript, Rust/WASM, and WebGPU). Demo: https://t.co/FH95R3qr1D

X (formerly Twitter)

sayzard Feb 16

vixhaℓ (@TheVixhal)

Karpathy의 microgpt에서 영감을 받아 작성된 'microgpt.c' 프로젝트 소개입니다. 완전 수동 전·후방 전파를 구현한 순수 C 코드로 약 600줄이며 외부 라이브러리나 의존성이 전혀 없어 순수 연산으로 동작한다는 내용입니다.

https://x.com/TheVixhal/status/2023010781391163784

#microgpt #microgpt.c #c #opensource

vixhaℓ (@TheVixhal) on X

Inspired by @karpathy microgpt, I built microgpt.c with fully manual forward and backward propagation. It is about 600 lines of pure C with no external libraries or dependencies, just raw computational power.

X (formerly Twitter)