Can small open-source models learn advanced mathematical reasoning? And more importantly: how do you actually build them?

Great talk by Lewis Tunstall from huggingface on training reasoning models with smart pipelines: SFT, RL with grading rubrics, reasoning cache & inference scaffolds.

Lots of ideas to explore similar approaches in #infosec

https://www.youtube.com/watch?v=kSsyBXf8uMM

#MachineLearning #LLM #OpenSourceAI #AIResearch #ai4math

HF ML Club India EP1 | Lewis Tunstall | Teaching Tiny Models to Prove Hard Theorems

YouTube
Reseña de «Mathematicians in the age of AI»

En el artículo «Mathematicians in the age of AI», Jeremy Avigad describe cómo la evolución de la IA ha sido meteórica desde la aparición de la IA generativa a finales de 2022, cuando sus capacidades m

Vestigium
Reseña de «AI for mathematical and scientific discovery»

En el vídeo «AI for mathematical and scientific discovery», se explica cómo la IA ha evolucionado desde resolver problemas escolares hasta alcanzar niveles de medalla de oro en olimpiadas matemáticas.

Vestigium
Reseña de «Shaping the future of mathematics in the age of AI»

En el artículo «Shaping the future of mathematics in the age of AI», coescrito por el medalla Fields Akshay Venkatesh, se analiza cómo la IA transforma la disciplina. Destacan métodos simbólicos como

Vestigium
Readings shared March 9, 2026

The readings shared in Bluesky on 9 March 2026 are: Fantastic simprocs and how to write them. ~ Yaël Dillies, Paul Lezeau. #LeanProver #ITP Formalization in Lean of faithfully flat descent of project

Vestigium
Mathematicians in the age of AI. ~ Jeremy Avigad. https://arxiv.org/abs/2603.03684 #AI4Math #LeanProver #ITP
Mathematicians in the age of AI

Recent developments show that AI can prove research-level theorems in mathematics, both formally and informally. This essay urges mathematicians to stay up-to-date with the technology, to consider the ways it will disrupt mathematical practice, and to respond appropriately to the challenges and opportunities we now face.

arXiv.org
Shaping the future of mathematics in the age of AI. ~ Johan Commelin, Mateja Jamnik, Rodrigo Ochigame, Lenny Taelman, Akshay Venkatesh. https://www.math.ias.edu/~akshay/mnotices.pdf #AI4Math #LeanProver #ITP
LeanTutor: A formally-verified AI tutor for mathematical proofs. ~ Manooshree Patel, Rayna Bhattacharyya, Thomas Lu, Arnav Mehta, Niels Voss, Narges Norouzi, Gireeja Ranade. https://arxiv.org/abs/2506.08321v1 #LeanProver #ITP #Math #AI4Math
LeanTutor: A Formally-Verified AI Tutor for Mathematical Proofs

We present LeanTutor, a Large Language Model (LLM)-based tutoring system for math proofs. LeanTutor interacts with the student in natural language, formally verifies student-written math proofs in Lean, generates correct next steps, and provides the appropriate instructional guidance. LeanTutor is composed of three modules: (i) an autoformalizer/proof-checker, (ii) a next-step generator, and (iii) a natural language feedback generator. The first module faithfully autoformalizes student proofs into Lean and verifies proof accuracy via successful code compilation. If the proof has an error, the incorrect step is identified. The next-step generator module outputs a valid next Lean tactic for incorrect proofs via LLM-based candidate generation and proof search. The feedback generator module leverages Lean data to produce a pedagogically-motivated natural language hint for the student user. To evaluate our system, we introduce PeanoBench, a human-written dataset derived from the Natural Numbers Game, consisting of 371 Peano Arithmetic proofs, where each natural language proof step is paired with the corresponding logically equivalent tactic in Lean. The Autoformalizer correctly formalizes 57% of tactics in correct proofs and accurately identifies the incorrect step in 30% of incorrect proofs. In generating natural language hints for erroneous proofs, LeanTutor outperforms a simple baseline on accuracy and relevance metrics.

arXiv.org
Formalizing a proof in Lean using Claude Code. ~ Terence Tao. https://youtu.be/JHEO7cplfk8 #LeanProver #ITP #AI4Math
Formalizing a proof in Lean using Claude Code

YouTube
Readings shared March 4, 2026

The readings shared in Bluesky on 4 March 2026 are: When AI writes the world’s software, who verifies it? ~ Leonardo de Moura. #AI #LeanProver #ITP Formalising sphere packing in Lean. ~ Chris Birkbec

Vestigium