btw, “E cores having smaller, slower, efficient AMX coprocessors” sounds strange to me—but what do I know…
I suppose a speed mismatch would be a problem.
My understanding is ARMv9(.2-A)’s 70 or so SME/SME2 instructions happen on the CPU like any other instruction; Apple’s undocumented AMX instructions on the other hand happen on its dedicated Matrix coprocessor—a block stock ARM Cortex SoCs lack.
It may be an unpopular opinion, but I think Apple’s right to steer devs to Accelerate…