Mastodawn

TIL Docker v4.58+ have the `sandbox` subcommand to run commands with restricted filesystem access, ideal for running coding agents in yolo mode (or any other software you can't trust like ghidra or ida)

Show thread

pancake

1d ago

Jk. Docker sandbox only works for real programs. Aka the ones that run in a tty

Show thread

buherator 1d ago

@pancake How about X11 socket sharing? :)
https://github.com/v-p-b/binaryninja-docker

GitHub - v-p-b/binaryninja-docker: Run Binary Ninja GUI with Docker

Run Binary Ninja GUI with Docker. Contribute to v-p-b/binaryninja-docker development by creating an account on GitHub.

GitHub

Show thread

pancake

8h ago

@buherator yep that works but the x11 protocol opens the door to a large set of vulnerabilities. And afaik “docker sandbox” doesnt supports this, you’ll need to cook some docker run oneliners instead

Show thread

buherator 1d ago

@pancake How is this different from simply bind mounting your project dir?

Show thread

pancake

1d ago

@buherator it creates a VM for each program you run, the program inside can’t see your system processes and there, and probably the main positive point here is simplicity to use and manage

Show thread

buherator 1d ago

@pancake I get that this is a stronger isolation layer, but why is that necessary? Do agents randomly perform container escapes?

Simplicity is definitely a plus, but that wouldn't require VMs either.

Show thread

pancake

1d ago

@buherator yep, escaping agent sandbox is a pretty common vuln and all agents are affected because there's literally no way to fix this than just add more checks when a escape is found. and even if you are requested to give permission to a directory, agents can write programs and execute without supervision or with hidden ways which makes it possible to access anything bypassing the classic checks.

Show thread

buherator 1d ago

@pancake I think we are talking about different things (please provide a link or stg if I misunderstand). When I just launch claude it can and will write at random FS paths for example, because the process has the privileges to do so. Can it do the same if I launch it in a regular old container where the project directory is mounted (it will have access to everything inside the mount ofc but not my whole ~)?

Show thread

pancake

1d ago

@buherator yes that would be the same if you run the agent inside a docker with a mouted volume. Docker sandbox afaik just makes it easier to use

Show thread

buherator 1d ago

@pancake Thanks for the clarification!