Just say “Wait…” – and your LLM gets smarter?!
We explain how just 1,000 training examples + a tiny trick at inference time = o1-preview level reasoning. No RL, no massive data needed.
🎥 Watch now → https://youtu.be/XuH2QTAC5yI
s1: Simple test-time scaling: Just “wait…” + 1,000 training examples? | PAPER EXPLAINED

YouTube