What does this mean? It means that we now have a dedicated Matrix ASIC that can be used via standard opcodes/compilers, available to anyone with a relevant toolchain and compiler.
For the most part, expect all of your #BLAS kernels to gain support over time!
For #HPC in contrast with most matrix tile implementations, we have spec mandated single and double precision support.
That's in contrast with the x86 AMX extensions, most consumer dGPU implementations etc. which are 19 bits and below.