“TurboQuant achieves perfect downstream results across all benchmarks while reducing the key value memory size by a factor of at least 6x. PolarQuant is also nearly loss-less for this task”

https://research.google/blog/turboquant-redefining-ai-efficiency-with-extreme-compression/

Certainly sounds like another jump in ai capability.

#jgshare

TurboQuant: Redefining AI efficiency with extreme compression