Artificial Analysis (@ArtificialAnlys)

구글이 Gemini 3.1 Flash Live Preview를 공개했고, Big Bench Audio 음성-음성 모델 벤치마크에서 2위를 기록했다. 또한 thinking level을 설정할 수 있는 새 기능이 추가되었으며, high 설정 시 Big Bench Audio에서 95.9%를 달성했다.

https://x.com/ArtificialAnlys/status/2037195442489090485

#google #gemini #llm #speechtospeech #benchmark

Artificial Analysis (@ArtificialAnlys) on X

Google has released Gemini 3.1 Flash Live Preview, achieving #2 in our Big Bench Audio Speech to Speech model benchmark, and now features configurable thinking levels With thinking level set to high, it scores 95.9% on Big Bench Audio, making it the second-highest scoring speech

X (formerly Twitter)