Mastodawn

2025 saw significant advancements in #LLMs, with #ReinforcementLearning from #VerifiableRewards (#RLVR) emerging as a key stage in training, leading to improved #reasoning capabilities. The industry also began to understand the unique “jagged” intelligence of LLMs, excelling in specific domains but lacking generalisation. https://karpathy.bearblog.dev/year-in-review-2025/?eicker.news #tech #media #news

2025 LLM Year in Review

2025 Year in Review of LLM paradigm changes

karpathy

Hacker News Jun 2, 2025

ReasoningGym: Reasoning Environments for RL with Verifiable Rewards

https://arxiv.org/abs/2505.24760

#HackerNews #ReasoningGym #ReinforcementLearning #VerifiableRewards #AIResearch #MachineLearning

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

We introduce Reasoning Gym (RG), a library of reasoning environments for reinforcement learning with verifiable rewards. It provides over 100 data generators and verifiers spanning multiple domains including algebra, arithmetic, computation, cognition, geometry, graph theory, logic, and various common games. Its key innovation is the ability to generate virtually infinite training data with adjustable complexity, unlike most previous reasoning datasets, which are typically fixed. This procedural generation approach allows for continuous evaluation across varying difficulty levels. Our experimental results demonstrate the efficacy of RG in both evaluating and reinforcement learning of reasoning models.

arXiv.org