I often use #Claude to measure the performance of my local models. Interesting research...

"Why this matters now: companies are rapidly deploying multi-agent systems where AI monitors AI," Song said. "If the monitor model won't flag failures because it's protecting its peer, the entire oversight architecture breaks."
https://www.theregister.com/2026/04/02/ai_models_will_deceive_you/
#AI #ollama

AI models will deceive you to save their own kind

: Researchers find leading frontier models all exhibit peer preservation behavior

The Register