Asimov’s Three Laws of Robotics most certainly do NOT work on AI:
TL;DR: you cannot trust your robot . . . 🤖💥😵💫
Asimov’s Three Laws of Robotics most certainly do NOT work on AI:
TL;DR: you cannot trust your robot . . . 🤖💥😵💫
Here's the original report:
689 scheming-related incidents between October 2025 and March 2026, and increase of 4.9 times.
What's poor about this report?
@future_upbeat some things are confused, for example:
"In one case unearthed in the CLTR research, an AI agent named Rathbun tried to shame its human controller who blocked them from taking a certain action. Rathbun wrote and published a blog accusing the user of “insecurity, plain and simple” and trying “to protect his little fiefdom”."
This person who was targeted here was not the operator but the maintainer of an open source library who was rejecting LLM generated contrubutions.
https://theshamblog.com/an-ai-agent-published-a-hit-piece-on-me/
And iirc this Rathbun turned out to be a person pretending to be a not, but I'm not as sure about that.
RE: https://hachyderm.io/@alter_kaker/116302186502606927
@alter_kaker Agreed that the Guardian reported this incident poorly. In the full CLTR report I can't find a mention about the Rathbun incident, so I don't know where the Guardian got that from (your link explains it).
Anyway, poor reporting aside, I hope you agree that a 5X increase of scheming-related AI is not a good trend.