@dougall

btw, “E cores having smaller, slower, efficient AMX coprocessors” sounds strange to me—but what do I know…

I suppose a speed mismatch would be a problem.

My understanding is ARMv9(.2-A)’s 70 or so SME/SME2 instructions happen on the CPU like any other instruction; Apple’s undocumented AMX instructions on the other hand happen on its dedicated Matrix coprocessor—a block stock ARM Cortex SoCs lack.

It may be an unpopular opinion, but I think Apple’s right to steer devs to Accelerate…