LLM -Ai2 SciArena: OpenAI's o3 tops new AI league table for answering scientific questions
https://www.nature.com/articles/d41586-025-02177-7
nonpaywalled: https://archive.fo/0IDls

Ai2 SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks
https://arxiv.org/abs/2507.01001
https://allenai.org/blog/sciarena

https://sciarena.allen.ai/

#LLM #Ai2 #SciArena #Chato3 #AllenAI #OpenAI

OpenAI’s o3 tops new AI league table for answering scientific questions

SciArena uses votes by researchers to evaluate large language models’ responses on technical topics.