Debunking Devin: "First AI Software Engineer" Upwork lie exposed!

https://lemmy.world/post/14198282

Debunking Devin: "First AI Software Engineer" Upwork lie exposed! - Lemmy.World

YouTuber Internet of Bugs examines the latest demo from Cognition [https://www.cognitionai.ai/] that showcases their “first AI software engineer” allegedly solving UpWork programming tasks.

Am I one of the few who just doesn’t use AI at all? I don’t have to generate tons of code for work at the moment and brand new projects that I’ve been given are small–meaning I wouldn’t necessarily use it to generate starter boilerplate. I have coworkers that love copilot or spend much longer prompting ChatGPT than they would if they wrote code themselves. A majority of my time is spent modelling the problem, gathering rejuirements, researching others’ solutions online (likely this step could be better AI-assisted?), not actually implementing a solution in code.

Anyway, I’m not super anti-AI in software development, and I see where it could be useful. Maybe it just isn’t for me yet. The current hype around it as well as the attitude of big-tech exceptionalism (“AI can salve all our problems”) feels a bit like a bubble, at least regarding the current generation of LLMs and ML

One way it can be useful is when you use it as a more verbal variant of rubber duck debugging. You’ll need to state the issue that you’re facing, including the context and edge cases. In doing so, the problem will also become more clear to you yourself.

Contrary to a rubber duck, it can then actually suggest some approach vectors, which you can then dismiss or investigate further.

This is how I use LLMs right now, and there have been a few times it’s been genuinely helpful. Mind you, most of the time it’s been helpful, it’s because it hallucinates some nonsense that gets me in the right direction, but that’s still at least a little better than the duck.
That was my experience as well with GPT 3.5. But the hit ratio is a lot better with GPT 4, and other models like Mixtral and its derivatives.