Mastodawn

A couple of weeks ago I used #gemini to whip up a prototype of WFxT support for #qemu: https://patchew.org/QEMU/2026022412101[email protected]/

This week I posted my hand written series doing the same thing: https://patchew.org/QEMU/2026032013060[email protected]/

Compare and contrast the approaches. While #genai can get you a working prototype pretty quickly the result was hard to review and missed an important source of events as well as a sub-optimal implementation. This might not matter for one-shot code but for production it missed the mark.

Show thread

penguin42 2d ago

@stsquad Did it have access to the ARM ARM and other docs as well?

Show thread

Alex 2d ago

@penguin42 only what was in its training data. I have been experimenting with NotebookLM which can have whole PDFs loaded into it so maybe next experiment I could get it to create a reference sheet for the agent first?

However the first thing worth solving would be getting agents to actually follow the instructions in AGENTS.md to use small discreet commits.

Show thread

penguin42 2d ago

@stsquad I think loading the whole ARM-ARM in the context would blow the poor things context; I'm not sure if there's a more subtle way of doing it. The thing about it not following the AGENTS.md; I'd seen some setup where you have another instance of the AI review the patches against the rules as well and then loop around till it's happy.

Show thread

Jim MacArthur

@penguin42 @stsquad I have a feeling that if you trained an LLM on the Arm ARM it would just answer every question with "implementation defined"