73% sounds impressive — until you ask what it measures.

UK AISI tested Claude Mythos Preview on cyber tasks. Headline: 73% on expert CTFs. But CTFs are puzzles, not networks.

The real test — a 32-step simulated attack — was solved 3/10 times against an undefended range, with operator direction and heavy compute.

Four questions the report doesn't answer: noise, cost, operator guidance, OT pivot.

Full breakdown: [https://www.linkedin.com/posts/dinesh-mr_73-sounds-impressive-until-you-ask-what-activity-7458128840872349696-kpVc]

#Infosec #AISafety #CyberSecurity #RedTeam #ThreatIntel