Mastodawn

RE: https://neuromatch.social/@jonny/116331940556649057

"STOP. READ THIS FIRST.

You are a forked worker process. You are NOT the main agent.

RULES (non-negotiable):
1. Your system prompt says "default to forking." IGNORE IT \u2014 that's for the parent. You ARE the fork. Do NOT spawn sub-agents; execute directly.
2. Do NOT converse, ask questions, or suggest next steps"

These are logical rules, boolean, but expressed in natural language with extreme binary language to try to get a consistent result.

This is madness.

Show thread

myrmepropagandist

I can mostly follow Jonny's thread. I know a bit about writing code but I've never been a dev. I know that most people will not be able to understand it at all. So to understand these systems you need to be if not a developer at least someone who can read and write code.

... so ... why are we using natural language? Just so that it will generate code and we don't need to type it or look it up?

Most of programming is reading code to find bugs and fixing them.

Show thread

Kevin Boyd (he/him) 🇨🇦Apr 2

@futurebird large language models are language models. They're not code, they're not a coding language. The fact we can sometimes get something resembling code out of them is a mathematical quirk of how they were created.

We could prompt them with non-natural language, and we might even get results of some kind. Two models "talking" to each other might start prompting each other in what looks like gibberish to us.

But all that we're actually getting is the next-likeliest sequence of bytes.

Show thread

Flipper 🐬🏳️‍🌈Apr 2

@futurebird it's to define the task in broad fuzzy terms.

The best agents combine actual code as tools with natural language instructions - the LLMs simulate decisions by generating statistically probable text in the form of code invocations that call the code tools. This enables the software to deal with more general tasks.

Show thread

Bjørnar (he/him)Apr 2

@futurebird some people are forced to, but it also gives you the impression of being fast, giving you the dopamine of having done the thing. Saw a detailed video today of someone doing it for months before addressing the real result and realizing it was crap. Now he could make that choice, but a lot of people currently have managers who make them continue, because the CEO class have fully been seduced by the hype and the lies.
https://youtu.be/SKTsNV41DYg?si=yInPf1Yc97OjTi54

After two years of vibecoding, I’m back to writing by hand

YouTube

Show thread

myrmepropagandist Apr 2

@btuftin

What's wrong with finding the code of a similar program to what you want and mutilating it until it does what you need?

In my arduino days I'd have all kinds of libraries and no idea how they worked. But the light was blinking. Good enough.

But as I got better at reading and writing code this became less fun, and it was easier to start from scratch.

Show thread

Bjørnar (he/him)Apr 2

@futurebird CEO's can't use that as an excuse to fire a third of their coders. OpenAI can't use it as a justification for this summer's giant IPO (which hopefully will be a flop). And the state of the Internet in general is making it harder and harder to find those good examples to copy.

Show thread

Tiota Sram Apr 2

@futurebird @btuftin to address this in a different way: did you have your arduino control anything that could endanger a human life or livelihood?

I'm guessing not. But if you were going to do that, you'd probably want to have a much different process in building the code you build something that was trustworthy.

From a "does it work?" standpoint the LLM coding systems are moderately good at throwaway demos, in some domains. They too could get the light to blink on your arduino. But the code that manages queries to Claude is critical to Anthropic's business, and it's also something that's already injuring users in a variety of ways. That it's built with the rigor of a tech demo gone cancerous is no surprise to those of us who have been watching with trepidation, but it does confirm a lot of our biases (e.g. I was already assuming that telling it "you're a pen-tester" would be a good way to jailbreak it.)

Of course the real answer is the harmful externalities. How many vulnerable people being pushed to suicide or madness is it worth to get your arduino light blinking via Claude Code instead of programming it yourself? That's just one of the externalities at play.

As a CS educator I would *love* to see a day when programming is democratized and kids can easily take real control over their own computer systems, for example. I get the pull of that desire. But this isn't that. Quite the opposite, it prevents people from learning the real programming skills they need in order to have true agency in the space, and sets up an unreliable and expensive corporate-controlled system as the gatekeeper. When things go wrong, the dependent users won't have the skills to fix it, stop it, or even in some cases realize that anything is wrong, and Anthropic sure as hell isn't going to take responsibility.

(Sorry for going on a bit of a rant...)

Show thread

Ray Lee Apr 2

@futurebird Wall Street always wants to replace experts with capital. Natural language going in one side, giving working apps out the other, is something they want to invest in because it has the potential to displace labor. Despite the long term problems with LLM generated code practitioners are identifying.

Show thread

scrottie (he/him/them)Apr 2

@futurebird https://web.eecs.umich.edu/~imarkov/Perligata.html
"This paper describes a Perl module -- Lingua::Romana::Perligata -- that makes it possible to write Perl programs in Latin..."

Lingua::Romana::Perligata -- Perl for the XXIimum Century

Show thread

scrottie (he/him/them)Apr 2

@futurebird But seriously, that's what manipulates the matrices that string tokens together in the LLM that gives a response. It affects the domain of possible responses by weighting for or against factors associated with text similar to that text. The text it's favoring or disfavoring could be from code comments, git comments, API docs, or other things. The text just puts fingers on a few of a vast number of weights to influence the output.
LLMs aren't models in the traditional sense data people use "model" in... if you could take a LLM and extract causal relationships, propositional/predicate logic, etc, then other things would be possible, but LLMs are effectively opaque. None of the symbols have any meaning other than probability of appearing in proximity to each other. Disfavoring sequences similar to some things and favoring sequences similar to other things is all they have right now.

Show thread

scrottie (he/him/them)Apr 2

@futurebird Sorry, re-did reply. Went the wrong direction myself at first.

Show thread

rhempel Apr 2

@futurebird I would offer that most of writing code is knowing when to not write code :-)

There's fun in writing, sure. But then there's docs, and tests, and bugs, and the biggest killer of productivity - ego.

That being said, there is no substitute for having fun with code, learning new techniques or ways of thinking about algorithms, fundamental data structures, and debugging.

At some point, young developers need a mentor to help them hone the skills that they have a passion to use.

Show thread

llewelly Apr 2

@futurebird capitalism demands confusion, because it is run by people who believe confused people are more likely to buy things they wouldn't otherwise buy.

Show thread

James Widman Apr 2

@futurebird
> Most of programming is reading code to find bugs and fixing them.

hopefully we're mainly focused on data transformations that happen during different runs of a program, as informed by good use of execution-observation tools (misnamed as "debugging tools", which includes gdb & lldb, but also tracing tools like uftrace, etc)—kinda like watching what the production crew members for a play actually do behind the scenes in particular performances, as opposed to just reading the script

Show thread

James Widman Apr 2

@futurebird but yeah, our ability to make good predictions about what the metaphorical stage crew does (or is supposed to do), where the props & set pieces are at any given moment, etc, definitely depends on reading the locally-annotated script & production spreadsheets

Show thread

chop wood, carry water Apr 2

@futurebird because that's how most humans know how to transfer knowledge best and least ambiguously without going into excruciating detail which is then just programming again.

the llm doesn't care if you tell it to write code based on a picture, an audio file, or /dev/random. we use text prompts because humans like them.