LLM -Ai2 SciArena: OpenAI's o3 tops new AI league table for answering scientific questions
https://www.nature.com/articles/d41586-025-02177-7
nonpaywalled: https://archive.fo/0IDls
Ai2 SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks
https://arxiv.org/abs/2507.01001
https://allenai.org/blog/sciarena
