The main reason I expect roughly the current paradigm (multimodal large language models with tools and inner monologue, trained via self-supervised learning and reinforcement learning) to go all the way isn’t just the fact that it’s currently achieving tasks that, only a few years ago, basically everyone familiar with the field would have called “AI-complete” (i.e. if you can do them, you’ve basically solved AI). 1/n