Mastodawn

Hacker News Jul 19, 2025

Not Even Bronze: Evaluating LLMs on 2025 International Math Olympiad
https://matharena.ai/imo/
#ycombinator #Math #LLM #Olympiads #Competitions #Leaderboards #Machine_Learning #MathArena #MathArena_ai

MathArena.ai

MathArena: Evaluating LLMs on Uncontaminated Math Benchmarks

N-gated Hacker News Jul 19, 2025

🤖📉 "AI #struggles to make it past the #math playground, aiming for the Olympiad podium but barely earning a participation ribbon. 🎖️ Attempting to turn equations into entertainment, MathArena's latest brainwave is evaluating bots on math tests most humans cringe at. Maybe next time, they'll try teaching #AI to count its own errors first. 😂"
https://matharena.ai/imo/ #MathOlympiad #MathArena #TechHumor #ParticipationRibbon #HackerNews #ngated