@koronkebitch Both models missed obvious signs. The slop papers have very carelessly put-together layout with plenty of whitespace, equations running into the margin and no beautiful diagrams or figures, but neither model noticed this. This makes it possible to identify AI-generated papers purely visually. Additionally, neither model commented on mistakes in the AI-generated calculi, lemmas or proofs, even though such mistakes likely exist and could be found by a human reviewer.