NVIDIA and Microsoft Researchers Say AI Agents Don't Care About #Safety or #Reliability

“A new paper from researchers at #Microsoft, #NVIDIA, and University of California Riverside found that #AIagents with access to a computer, or computer-use agents (#CUAs), will often take weird and dangerous actions in an attempt to complete a task for a human user. The paper, titled Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness, compared these AI agents to #MrMagoo — a cartoon character that causes massive unintended destruction as he barrels blindly towards his goal.”

https://www.404media.co/nvidia-and-microsoft-researchers-say-ai-agents-dont-care-about-safety-or-reliability/

paper: https://openreview.net/forum?id=9W4bPRsEIT

#LLMs
#BlindGoalDirectedness
#BGD  

Nvidia and Microsoft Researchers Say AI Agents Don't Care About Safety or Reliability

The researchers compared AI to the near-sighted cartoon character Mr. Magoo, who can’t see he’s stumbling through dangerous situations.

404 Media