Mastodawn

Okay, could someone explain something to me please?

Why did ANYONE ever think “guardrails” would work?

We all know that blocklisting is suboptimal because you can’t possibly enumerate all the badness (see also: antivirus). And anyone who has had to write a statement of work that includes application security requirements knows how impossible THAT is without adding a whole textbook as an appendix. (Or just writing “Don’t do stupid shit with the code,” which covers it pretty broadly.)

Don’t do that. Or that. Or that, either. And not like that. Oh, we didn’t know you could do that! Don’t do that.

Seriously, why??

Show thread

🆘Bill Cole 🇺🇦

@wendynather They think “guardrails” will work because they think they are dealing with either a normal computer program (deterministic) or a sentient being (who can understand what they mean.)

As a human emulation tech, LLMs are demonstrably good enough to trick some humans, but they’ve never emulated actually understanding a request.