That ChatGPT passes exams is much more a reflection on exams than information about ChatGPT.

#WittgensteinsRuler

GPT-4 and professional benchmarks: the wrong answer to the wrong question

OpenAI may have tested on the training data. Besides, human benchmarks are meaningless for bots.

AI Snake Oil