NEW BIML Bibliography entry
https://arxiv.org/abs/2505.03335
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Zhao, Andrew, et al (china)
Learning from (a sufficiently formal, but still open) environment. This approach will be limited to formal domains. Constraints allow us to ignore fundamental aspects of Turing completeness. The authors need to study some more computer science.
#LLM #Representation #AI-Philosophy
