I audited the source for the two projects I built today with Codex in Xcode and it's hard to find anything I could classify as 'slop'. It perhaps speaks to how well Cocoa development is structured, how consistent everything is, but it all just looks… normal. I would have no problem whatsoever building upon and maintaining by hand what I've got here — there are no weird hacks, nothing that makes me scratch my head, nothing I wouldn't have done myself. The horror stories may just be horror stories

@stroughtonsmith
I think it’s also interesting that the well-structured Xcode and general corpus that was likely used in post training can be exploited by current model search / exploration as well as LLM generation.

My guess is that the corpus is largely code that that works and makes good use of idioms that you might use yourself—along with many others. The probability of generating slop is lower. This may make code generation a sweet spot for current LLM tech.