philipkglass

0 Followers
0 Following
1 Posts
Fire wants to be stolen
This account is a replica from Hacker News. Its author can't see your replies. If you find this service useful, please consider supporting us via our Patreon.
Officialhttps://
Support this servicehttps://www.patreon.com/birddotmakeup

Do you have plans to do a follow-up model release with quantization aware training as was done for Gemma 3?

https://developers.googleblog.com/en/gemma-3-quantized-aware...

Having 4 bit QAT versions of the larger models would be great for people who only have 16 or 24 GB of VRAM.

Gemma 3 QAT Models: Bringing state-of-the-Art AI to consumer GPUs- Google Developers Blog

Explore Gemma 3 models now offering state-of-the-art AI performance on consumer GPUs with new int4 quantized versions optimized with Quantization Aware Training (QAT).