Where does an AI math agent get its ability, the model or the orchestration around it? In the first large-scale test of formal proof search on open problems, an agent closed 9 of 353 Erdős problems in Lean. In its own ablation, a plain generate-and-verify loop solved all nine, while smaller models and the specialized prover alone solved nothing.

https://benjaminhan.net/posts/20260606-ai-formal-proof-search/?utm_source=mastodon&utm_medium=social

#AI #AIforScience #Mathematics #AgenticSystems

Advancing Mathematics Research with AI-Driven Formal Proof Search – synesis

The first large-scale test of LLM-driven formal proof search on open problems, with an AlphaProof-equipped agent closing 9 of 353 Erdős problems and 44 of 492 OEIS conjectures in Lean.

synesis