Mastodawn

ray__4d ago

TurboQuant: Redefining AI efficiency with extreme compression

https://research.google/blog/turboquant-redefining-ai-efficiency-with-extreme-compression/

TurboQuant: Redefining AI efficiency with extreme compression

Show thread

bluequbit

I did not understand what polarQuant is.

Is is something like pattern based compression where the algorithm finds repeating patterns and creates an index of those common symbols or numbers?

Show thread

mrugge 4d ago

1. Efficient recursive transform of kv embeddings into polar coordinates
2. Quantize resulting angles without the need for explicit normalization. This saves memory via key insight: angles follow a distribution and have analytical form.

Show thread

quotemstr 4d ago

Reminds me vaguely of Burrows-Wheeler transformations in bzip2.

Show thread

Maxious 4d ago

https://mesuvash.github.io/blog/2026/turboquant-interactive/ has a little visualisation

TurboQuant Animated: Watch Vector Quantization Happen

Interactive 2D and 3D animations showing every step of TurboQuant: normalize, rotate, quantize, reconstruct. Add your own points and see the compression error in real time.

Show thread

spencerflem 4d ago

I like the visualization, but I don’t understand the grid quantization. If every point is on the unit circle aren’t all the center grid cords unused?

Show thread

vincnetas 4d ago

i think grid can be a surface of the unit sphere

Show thread

viktorcode 4d ago

The way I understand it, it's a way of compressing vectors by switching from their per-component representation to polar coordinates representation, where the nearby vectors are clumped together to a single line, allowing to describe them by different lengths