Mastodawn

I think what a lot of AI critics are missing is that they're judging an LLM by its first draft. This is *not* what terrifies me about these machines.

What terrifies me is that you can ask them "find bugs in this PR." Or "find performance flaws." Or really anything.

Then have 3 agents (with different models ideally) vote on the result. Then have another fix it. Repeat until all bugs are clean.

If you haven't tried this experiment then you haven't reached the dark night of the soul that I have.

Show thread

Eugen Rochko Mar 7

@nolan What you're describing is not a skill or craft, it's a gacha machine. It's gambling. You're hitting spin until you win something. And it relies on similar code and programs being in its training data. It's copying from Stack Overflow with extra steps. It won't solve novel problems.

Show thread

G1itchbit Mar 7

@nolan @Gargron While I agree you, neither users nor businesses care about the skill or the craft. Never have. They get mad when the magic breaks and happy when the magic is fixed.

You may be able to find those that care, maybe some rich eccentrics. Communing with other devs perhaps.

Alternately, all these frontier models are operating at a loss. Maybe it won't be cost efficient after all once it stops being VC subsidized.

Show thread

Torb

@glitch @nolan @Gargron Considering how users have been complaining for years about the broken or user hostile state of the digital state of what they use I absolutely think they care. Not for it’s own sake, but because they want things to reliably work.

Show thread

Matt Campbell Mar 7

@torb @glitch @nolan @Gargron Case in point: I was at my favorite karaoke bar yesterday evening. I hang out with a group of mostly older folks (60s and up; I'm in my 40s). One regular, Kathy (70s), talked about a bug she found in Facebook on her Android phone and tablet: When someone posts text on a colored background, you tap to open the post, and the text *isn't there*. She tried googling the problem, clearing a cache, rebooting, etc. It clearly unsettled her. And that's a fairly benign bug.