Damn those Mythos benchmarks seem very promising
Wild that they don't seem to be making it GA, makes me suspect it's probably actually not as good as they say
Qwen 3.6 is essentially the same as Opus 4.6 now so I guess we'll see how the new generation stacks up?
@pojntfx I really don't get the excitement around tech that destroys the earth more than we as humanity have in our history so far?

@justin The fix isn't to not use useful tools it's to a) deregulate clean energy infrastructure so that we expand them China-style and b) make sure that the models are open so you can run them on clean energy right now

This is the same argument like with EVs "but the grid is dirty" like yes. Fix that. Don't be anti-EV because of it

What's the fix for the people behind it explicitly having the goal of replacing the human mind as a tool of thought?

CC: @[email protected]
We're racing to build hell because some people find the current level of warmth a little bit useful, and others think they'll get rich selling fuel for the furnaces.

CC: @[email protected] @[email protected]

@ori @justin > What's the fix for the people behind it explicitly having the goal of replacing the human mind as a tool of thought?

I don't know tbh. They are not intelligent or sentient. I'm kind of hoping this is kind of self-evident by the quality of things produced w/o intention just being bad? Having the statistically most likely program to fix problem X isn't particularly interesting IMHO ...

@ori @justin I know exactly one example of "mass" adoption of vibe coded software out in the wild atm (mise) except for ofc the tools arounds LLMs themselves. And that's only for dev-focused tooling. Not a single one that regular people use

Adoption of those tools in e.g. MS has been a bit of a disaster as is well known even by regular people at this point

Lots of vibe-coded e.g. Nextcloud clones out there now, and despite Nextcloud's UX being terrible people still prefer it over the clones

@ori @justin Ultimately it's just a question of whether or not you've put the care and attention and actual labour into making something vs. if you haven't 🤷‍♀️ The same people that created Electron boilerplate slop TODO apps in the mid-2010s will continue to do the same thing in the mid-2020s. On the other hand, some Electron apps (esp. VSCode IMHO) are also pretty well liked because they work reasonably well, and I suspect the same will happen again in the future.