Design Arena (@Designarena)
Audio Arena 리더보드가 업데이트되어 음성-음성(speech-to-speech) 모델 상위 3개를 공개했다. 1위는 Ultravox v0.7, 2위는 Gemini 2.5 Flash Audio, 3위는 Grok Realtime이며, 오픈소스 6개 멀티턴 벤치마크로 평가했다고 밝혔다.
https://x.com/Designarena/status/2041334891854565743
#audiomodels #benchmark #speechtospeech #opensource #leaderboard

Design Arena (@Designarena) on X
Audio Arena Leaderboard Update! Congrats to the top 3 speech-to-speech models: - #1 Ultravox v0.7 by @ultravox_dot_ai - #2 Gemini 2.5 Flash Audio by @GoogleDeepMind - #3 Grok Realtime by @xai We evaluated each model on our open source suite of 6 static multi-turn benchmarks