I don't get the comments trashing this. If it slightly beats or even matches Opus 4.6, it means Meta is capable of building a model competitive with the leading AI company. Sure, they spent a lot of money and will have on-going costs. But how much more work would it take to turn that into a coding agent people are willing to try (and pay for) along side their usage of a collection of agents (Claude, Codex, etc)?
Also means Meta doesn't have to pay another company to use a SATA model across all their products (including IG and WhatsApp, vr) which will matter to their balance sheet long term (despite the constant r&d spend).

> If it slightly beats or even matches Opus 4.6

It doesn't though

Curious on why you think this. Any data points that led you to this?
The benchmarks they released
What do you mean? In most cases, the benchmarks show a larger number for Muse and a smaller number for Opus.