At this point in time the only benchmark that I trust will truly measure how close we are to AGI is this:

https://youtu.be/3T4OwBp6d90?si=VgCnsnbuicFG07lI

Interactive Reasoning Benchmarks | ARC-AGI-3 Preview

YouTube