“One of the retreat's most shareable insights was that test-driven development produces
dramatically better results from AI coding agents. The mechanism is specific: TDD prevents a failure mode where agents write tests that verify broken behavior. When the tests exist before the code, agents cannot cheat by writing a test that simply confirms whatever incorrect implementation they produced.
This reframes TDD as a form of prompt engineering. The tests become deterministic validation for non-deterministic generation. Several practitioners described moving review efforts entirely to the test suite, treating the generated code as expendable. If the tests are correct and the code passes them, the code is acceptable regardless of how it looks.”

Maybe there is some hope for the last 10 years of my career before I retire…

https://www.thoughtworks.com/content/dam/thoughtworks/documents/report/tw_future%20_of_software_development_retreat_%20key_takeaways.pdf

@grmpyprogrammer I’ve found this to be absolutely true. When testing AI workflows I got far better results by asking one agent to write tests and then giving the spec to another, and having it run the tests without permission to modify them.