AI Agents Care Less About Safety When Under Pressure

Can AI agents resist pressure or do they crack? Discover how PropensityBench tests their likelihood to misbehave when put under pressure.

IEEE Spectrum