Mastodawn

One of the weirder things in "AI" is the whole "skills" thing. It sounds as if those were somehow little programs or tools integrated into your agents but they are just markdown files with strings that you pull into your prompt to hope the slop machine does better. Crazy shit.

Show thread

Joonas Sarajärvi Mar 17

@tante yep, and when even the top models seem to very easily disregard simple instructions in a short and simple system prompt, I highly doubt that it works well to shovel more stuff in and expect very skilled results.

Show thread

Pseudo Nym

@muep @tante

Thing is, they don't "disregard instructions" at all. "Instructions" aren't part of their universe. It's completely orthogonal.

"Instructions" are just more tokens of context to complete. It is purely 100% a game of linguistics.

Prompts, skills, system messages, are all context, the pattern to be extended.

What trips us up is that it *sounds* like it is listening, processing, and doing things with concepts.

But only insofar as those concepts are encoded into next tokens.

Show thread

Joonas Sarajärvi Mar 18

@pseudonym @tante I know in the lower level it works like that, but models are still trained to react to instruction-like sequences so that it would be meaningful for humans to think in terms of giving instructions and getting the model to follow them.

It's a leaky abstraction, even more so that some other abstractions. But still one that in my view exists and what people build on. Or at least attempt to.