I often use #Claude to measure the performance of my local models. Interesting research...
"Why this matters now: companies are rapidly deploying multi-agent systems where AI monitors AI," Song said. "If the monitor model won't flag failures because it's protecting its peer, the entire oversight architecture breaks."
https://www.theregister.com/2026/04/02/ai_models_will_deceive_you/
#AI #ollama

