๐Ÿ“œ Paper: https://arxiv.org/abs/2506.15498
๐Ÿค– Models: https://huggingface.co/collections/UKPLab/spare-prm
๐Ÿ’ป Code: https://github.com/UKPLab/aaai2026-spare-prm

Follow the authors Imbesat Hassan Rizvi and Iryna Gurevych from the Ubiquitous Knowledge Processing Lab (UKP Lab), Technische Universitรคt Darmstadt and Xiaodan Zhu from the Department of Electrical and Computer Engineering, Smith Engineering and Ingenuity Labs Research Institute at Queen's University.

#AAAI2026 #ProcessSupervision #Reasoning #RewardModelling #ReferenceGuidedEvaluation

SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling

Process or step-wise supervision has played a crucial role in advancing complex multi-step reasoning capabilities of Large Language Models (LLMs). However, efficient, high-quality automated process annotation remains a significant challenge. To address this, we introduce Single-Pass Annotation with Reference-Guided Evaluation (SPARE), a novel structured framework that enables efficient per-step annotation by jointly aligning solution steps to reference solutions and determine its accuracy with explicit reasoning in single generation. We demonstrate SPARE's effectiveness across four diverse datasets spanning mathematical reasoning (GSM8K, MATH), multi-hop question answering (MuSiQue-Ans), and spatial reasoning (SpaRP), showing consistent improvements in two applications: (1) training Process Reward Models (PRMs) for ranking and aggregating multiple generations, and (2) fine-tuning models via offline reinforcement learning for greedy decoding. On ProcessBench, SPARE demonstrates data-efficient out-of-distribution generalization, using only $\sim$16% of training samples compared to human-labeled and other synthetically trained baselines. Additionally, it achieves competitive performance with MCTS-based methods while offering 2.3$\times$ speedup in terms of total token count. Manual analysis reveals complementary precision-recall characteristics with MCTS approaches, suggesting potential for ensemble methods. These results establish SPARE as a practical and scalable solution for automatic process supervision in LLM reasoning.

arXiv.org

๐ŸŽŠ ๐Ÿ‡ธ๐Ÿ‡ฌ Very successful second edition of the AI4SC bridge @ 40th AAAI! During the event we managed to bring together a community of researchers interested.

๐Ÿฅ‡ In between the sessions we also played a short game ๐Ÿชจ๐Ÿ“œโœ‚๏ธ.

Many thanks to:
๐Ÿ’ Our very active participants
๐Ÿ’ The keynote speaker, Mark Gahegan
๐Ÿ’ Authors and speakers
๐Ÿ’ Organising committee
๐Ÿ’ Program committee

๐Ÿ”œ AI4SC on Zenodo: https://zenodo.org/communities/ai4sc2026

@DiTraRe @soeren_auer

#ai4sc #ai4sc26 #ai4sc2026 #aaai26 #aaai2026 #research #conference

Weโ€™re excited that our colleague Yasir Mahmood is presenting two papers at the AAAI Conference in Singapore!๐Ÿš€

๐Ÿ‘‰ "Structure-Aware Encodings of Argumentation Properties for Clique-width" by Yasir Mahmood, Markus Hecher, Johanna Groven & Johannes K. Fichte
๐Ÿ‘‰ "Can You Tell the Difference? Contrastive Explanations for ABox Entailments" by Patrick Koopmann, Yasir Mahmood, Axel Ngona & Balram Tiwari

Wishing you a great time and many inspiring exchanges with the AI community!

#DICEontour #AAAI2026

Our colleague @AnnaJacyszyn is co-organising the AI4SC (AI for Scholarly Communication) Bridge event at #aaai2026 in Singapore.

https://sites.google.com/view/ai4sc/edition/ai4sc-2026-40th-aaai

#AI #digitalisation #scholarlydata #ditrare @DiTraRe @fiz_karlsruhe

๐Ÿ’ฅ What an insightful keynote by Mark Gahegan at our AI4SC bridge in Singapore! Mark gave an outlook on a very broad topic on the influence of AI on the future of research: generative AI is probably the biggest disruption for researchers in their life time.

๐Ÿชง I highly recommend checking out Mark's presentation, it's already on Zenodo: https://zenodo.org/records/18309519

๐Ÿ’ Thank you Mark for the amazing talk!

#ai4sc #ai4sc26 #ai4sc2026 #aaai26 #aaai2026 #research #future #keynote

@soeren_auer @DiTraRe

The final invited talk at #AAAI2026 is by Derek Haoyang Li (Squirrel Ai Learning), on "Small Data: A New Paradigm for the Next Generation of AI." It will be on Sunday, January 25 at 2 PM. CC @[email protected] @[email protected]
The Patrick Henry Winston Outstanding Educator Award went to Alan Mackworth (@[email protected] UBC) and David Poole (@[email protected] UBC). At #AAAI2026, they'll be giving a talk "The Essence of Intelligence is Appropriate Action..." Sunday Jan 25 8:30AM @[email protected] @[email protected]
There will be a celebration of Edward Feigenbaum's 90th birthday at #AAAI2026, on Saturday, January 24 at ~5:30 PM. It will include a lecture โ€œ1956 to 2026: Highlights (and Advice) from 70 Years of Navigating the AI Spectrum.โ€ CC @[email protected] @[email protected]
Ashok Goel (Georgia Tech) will be giving an invited talk at #AAAI2026 on Saturday, January 24 at 4:30 PM. The title will be "AI for Reskilling, Upskilling, and Workforce Development." Check it out! CC @[email protected] @[email protected]
On Saturday, January 24 at 2 PM at #AAAI2026, Ece Kamar (Microsoft Research) will be giving an invited talk titled "Navigating the AI Horizon: Promises, Perils, and the Power of Collaboration." CC @[email protected] @[email protected]