Prince Canuma (@Prince_Canuma)

MLX 기반 멀티모달 비전-언어 도구 mlx-vlm v0.4.4에 TurboQuant 성능 개선이 대폭 적용됐고, Open Evals 벤치마크에서 Gemma 4 26B IT를 M3 Ultra로 테스트한 결과 품질 저하 없이 동일한 78% 정확도를 기록했다고 소개한다.

https://x.com/Prince_Canuma/status/2040877782922649865

#mlx #benchmark #quantization #gemma #multimodal

Prince Canuma (@Prince_Canuma) on X

TurboQuant: Open Evals on MLX 🔥 Yesterday I launched mlx-vlm v0.4.4 with major TurboQuant performance improvements. Today, the open benchmark results on MM-NIAH (val, 520 samples) using Gemma 4 26B IT by @GoogleDeepMind on M3 Ultra: → 0 quality loss — 78% accuracy for both

X (formerly Twitter)