AI Sleeper Agents: How Anthropic Trains and Catches Them

YouTube