“TurboQuant achieves perfect downstream results across all benchmarks while reducing the key value memory size by a factor of at least 6x. PolarQuant is also nearly loss-less for this task”
https://research.google/blog/turboquant-redefining-ai-efficiency-with-extreme-compression/
Certainly sounds like another jump in ai capability.
