Anthropic's Opus 4.5 claims benchmark victory while security tests reveal a 22-point gap between controlled scenarios and real-world agent behavior. The Genesis Mission invokes Manhattan Project urgency—but "subject to available appropriations" appears four times. Two announcements, same playbook: declare victory, hedge quietly.
https://www.implicator.ai/opus-claims-victory-genesis-borrows-glory/





