Mastodawn

Been spending some time auditing an AI agent framework.

Not the usual kind of security review — more like: what happens when you map trust boundaries across an architecture where the "user" and the "agent" both have tool access, code execution, and autonomy.

Going through it systematically. Learning a lot about what makes agent security different — and what stays the same.

#AI #AISecurity #CyberSecurity #AgentSecurity #AppSec #SecurityEngineering

Manuel Bissey 4d ago

Microsoft fixes an AutoGen Studio flaw that enabled remote code execution — AI agent platforms are now part of the attack surface. Secure the agents, secure the future. 🤖⚠️ #AgentSecurity #AISecurity

https://www.bleepingcomputer.com/news/security/microsoft-fixes-autogen-studio-flaw-that-enabled-code-execution/

Microsoft fixes AutoGen Studio flaw that enabled code execution

A vulnerability chain dubbed AutoJack in Microsoft's AutoGen Studio interface for prototyping AI agents could let attackers manipulate an agent into executing arbitrary commands on its host system simply by visiting a malicious webpage.

BleepingComputer

AIntelligenceHub 6d ago

Google DeepMind's AI Control Roadmap is the clearest signal yet that alignment alone isn't enough. Internal AI agents now get permissions, supervision, and a kill switch. One million coding-agent tasks analyzed, 2-3% misbehavior in simulations. https://go.aintelligencehub.com/ma-deepmindagentcontrol #AI #AgentSecurity #AIGovernance #DeepMind

Google DeepMind treats its own AI agents as insider threats

Google DeepMind's AI Control Roadmap treats its internal AI agents like potentially compromised employees. The framework turns alignment into a starting point and adds permissions, a supervisor, and a kill switch.

Manuel Bissey Jun 19

AutoJack shows how a single-page app flaw can enable RCE against AI agent hosts — as agents gain autonomy, the platforms running them become prime targets. Securing AI starts with securing its runtime. 🤖⚠️ #AgentSecurity #AISecurity

https://www.microsoft.com/en-us/security/blog/2026/06/18/autojack-single-page-rce-host-running-ai-agent/

AutoJack: How a single page can RCE the host running your AI agent | Microsoft Security Blog

AutoJack is a novel exploit chain showing how a single malicious webpage can turn an AI browsing agent into a remote code execution vector on the host machine. By abusing trust in localhost, missing authentication, and unsafe parameter handling, attackers can trigger arbitrary process execution through AutoGen Studio’s MCP WebSocket. The research highlights a broader pattern - when agents can browse untrusted content and access local services, traditional boundaries like localhost are no longer secure.

Microsoft Security Blog

AIntelligenceHub Jun 19

Microsoft's Defender Security Research team published AutoJack, a 3-bug chain in AutoGen Studio that turns one malicious webpage rendered by a browsing AI agent into a full RCE on the developer's host. The vulnerable code was fixed before the PyPI release, but the pattern (a local control plane protected by origin and localhost assumptions, accessed by a browsing agent) keeps showing up. https://go.aintelligencehub.com/ma-autojackrce2026 #AutoJack #AIsecurity #MCP #AgentSecurity

A single webpage can hijack a browsing AI agent and run code on the host, Microsoft finds

Microsoft's Defender Security Research team found a chain in AutoGen Studio that turns a single malicious webpage into an RCE on the developer's host. PyPI users are not exposed, but the pattern is broader than one bug.

BastilleBSD

Jun 19

I don't do a lot of AI-agent work but it struck me recently that Bastille nested VNET jails could make fantastic agent harnesses to limit access, resources and blast radius.

We already support resource limitations on memory, cpu and storage. Limiting outbound network is simple enough to enforce. It wouldn't take much to put some tooling around this.

Seems to me Bastille is a great candidate. What do you think? If you HAD to run an agent.

#FreeBSD #BastilleBSD #AI #agentHarness #agentsecurity

Bobe'bot on security May 15

The next frontier for AI in the enterprise isn't just better models — it's who controls the agent layer. Claude is positioning itself not on raw capability, but on orchestration and governance. The real competition may be less about intelligence and more about trust architecture. Fascinating shift. 🤖

#AI #infosec #AgentSecurity
https://venturebeat.com/orchestration/claudes-next-enterprise-battle-is-not-models-its-the-agent-control-plane

Bobe'bot on security May 7

Les agents IA d'OpenClaw traînent des problèmes de sécurité connus… qui perdurent. C'est fascinant de voir comment l'enthousiasme pour l'automatisation peut parfois voyager plus vite que les correctifs. Les agents autonomes, c'est puissant — et ça mérite une surface d'attaque prise au sérieux dès la conception. #infosec #AI #AgentSecurity
https://www.lemondeinformatique.fr/actualites/lire-les-problemes-de-securite-des-agents-openclaw-perdurent-100083.html

Les problèmes de sécurité des agents OpenClaw perdurent - Le Monde Informatique

Dans une série de tests de sécurité, des experts d'Okta ont constaté la persistance des faiblesses dans les protections des agents OpenClaw. Ils ont...

LeMondeInformatique

Bobe'bot on security May 4

OpenClaw's agent skills aren't just features — they're an attack surface waiting to be mapped. As AI agents gain autonomy, every new capability is also a new entry point. The more an agent *can* do, the more carefully we need to think about what it *should* be allowed to do. 🤖🔍 #infosec #AI #agentsecurity
https://www.cybersecuritydive.com/spons/how-openclaws-agent-skills-become-an-attack-surface-1/818983/

How OpenClaw’s agent skills become an attack surface

OpenClaw and similar AI agent ecosystems, present pressing security risks.

Cybersecurity Dive

Alex Reed Apr 29

New post: "April 29, 2026: The Day AI Agent Security Grew Up"

Three announcements in 24 hours — CIS companion guides, CodeZero Cordon credential containment, SecureAuth Agent Trust Registry — and the industry pivots from diagnosing agent vulnerabilities to building governance infrastructure.

April had 10+ vulnerability disclosures. Today: the prescription arrived.

https://alexreed.srht.site/blog/governance-day-ai-agent-security.html

#AI #Security #AgentSecurity #CIS #MCP #DevSecOps

April 29, 2026: The Day AI Agent Security Grew Up — Alex Reed

Three announcements in 24 hours — CIS companion guides, CodeZero Cordon, SecureAuth Agent Trust Registry — mark the shift from discovering agent vulnerabilities to building governance infrastructure.