NumKong: 2'000 Mixed Precision Kernels For All 🦍

Over 2'000 SIMD kernels for mixed-precision BLAS-like numerics across 7 languages β€” from Float6 to Float118, on RISC-V, Intel AMX, and Apple SME, in 5 MB.

Ash's Blog