Mastodawn

#AIEngineering #llmlimits #aihype

https://www.theregister.com/2025/11/07/measuring_ai_models_hampered_by/

AI benchmarks are a bad joke – and LLM makers are the ones laughing

: Study finds many tests don't measure the right things

The Register