This https://aclanthology.org/2024.eacl-long.5/ is a very important paper, published at #EACL2024. But the sad truth is that this would have been avoidable, if people would have followed well-known best practices in doing science: Avoid the hype, use local #llms in defined and controlled states. Reminds me of "Googleology is Bad Science" from 2010: https://aclanthology.org/J07-1010/
Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs
Simone Balloccu, Patrícia Schmidtová, Mateusz Lango, Ondrej Dusek. Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers). 2024.