Scale AI launches Voice Showdown, the first global benchmark for voice AI using real human preference data across 60+ languages. Gemini leads Dictate mode; GPT-4o Audio and Gemini tie in Speech-to-Speech. The benchmark reveals GPT Realtime 1.5 mismatches language around 20% of the time. https://venturebeat.com/data/scale-ai-launches-voice-showdown-the-first-real-world-benchmark-for-voice-ai #AIagent #AI #GenAI #VoiceAI #Scale

Brie Wensleydale (@SlipperyGem)

์Œ์„ฑ ํ•ฉ์„ฑ(TTS)์˜ ๊ฐ์ • ์ œ์–ด ์„ฑ๋Šฅ์ด ๋งค์šฐ ์ข‹๊ณ , ๋…ธ๋ž˜๊นŒ์ง€ ๊ฐ€๋Šฅํ•œ ๊ธฐ๋Šฅ์„ ์–ธ๊ธ‰ํ•˜๋ฉฐ ํ–ฅํ›„ Xiaomi ๊ณ„์—ด ๋ชจ๋ธ๋“ค์— ๋Œ€ํ•œ ๊ธฐ๋Œ€๊ฐ์„ ๋“œ๋Ÿฌ๋‚ธ๋‹ค. ํ˜์‹ ์ ์ธ TTS ๊ธฐ๋Šฅ๊ณผ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์Œ์„ฑ ์‘์šฉ ๊ฐ€๋Šฅ์„ฑ์„ ๋ณด์—ฌ์ฃผ๋Š” ๋ฐ˜์‘ํ˜• ํŠธ์œ—์ด๋‹ค.

https://x.com/SlipperyGem/status/2034769970228601070

#tts #voiceai #multimodal #xiaomi #speech

Brie Wensleydale๐Ÿง€๐Ÿญ (@SlipperyGem) on X

Wow, that's seriously good emotion control. The TTS can sing too! Getting my hopes up for the various Xiaomi models in future.

X (formerly Twitter)

TestingCatalog News (@testingcatalog)

Microsoft๊ฐ€ Copilot Voice Mode์— ์ƒˆ Portrait ์•„๋ฐ”ํƒ€๋ฅผ ์ถ”๊ฐ€ํ•  ์ค€๋น„๋ฅผ ํ•˜๊ณ  ์žˆ๋‹ค. Sage์™€ Pax๋ผ๋Š” ์•„๋ฐ”ํƒ€๊ฐ€ ์–ธ๊ธ‰๋˜์–ด, ์Œ์„ฑ ๊ธฐ๋ฐ˜ Copilot ๊ฒฝํ—˜์„ ๋” ์‹œ๊ฐ์ ์œผ๋กœ ํ™•์žฅํ•˜๋ ค๋Š” ์—…๋ฐ์ดํŠธ๋กœ ๋ณด์ธ๋‹ค.

https://x.com/testingcatalog/status/2034398658603442425

#microsoft #copilot #voiceai #avatar #aiproduct

TestingCatalog News ๐Ÿ—ž (@testingcatalog) on X

Microsoft is preparing to release new Portrait avatars for Copilot Voice Mode! Sage and Pax ๐Ÿ‘€

X (formerly Twitter)

Customer support is evolving into a scalable, AI-driven system.

With chatbots, voice agents, and automation, businesses can:
โ€ข Improve response time
โ€ข Reduce operational costs
โ€ข Deliver consistent experiences

Aisa-X enables a hybrid model where AI handles scale and humans handle complexity.

Click : https://aisa-x.ai/ai-live-chat-customer-support/

#ArtificialIntelligence #CustomerSupport #Automation #VoiceAI #Chatbots #OpenTech #SaaS #FutureOfWork #AisaX #CustomerExperience

Mati Staniszewski (@matiii)

ElevenLabs๊ฐ€ ํด๋ž€๋“œ ๋ฐ”๋ฅด์ƒค๋ฐ”์—์„œ Summit ๊ฐœ์ตœ๋ฅผ ์˜ˆ๊ณ ํ–ˆ๋‹ค. แƒฅแƒ•แƒ”แƒงแƒœแƒ˜แƒก ๋Œ€ํ‘œ์  ์ƒ์ง•์  ์žฅ์†Œ์—์„œ ์—ด๋ฆฌ๋Š” ํ–‰์‚ฌ๋กœ, ElevenLabs์˜ ๊ธ€๋กœ๋ฒŒ ์ปค๋ฎค๋‹ˆํ‹ฐยท์Œ์„ฑ AI ์ƒํƒœ๊ณ„ ํ™•์žฅ์„ ๋ณด์—ฌ์ฃผ๋Š” ์†Œ์‹์ด๋‹ค.

https://x.com/matiii/status/2034316709482360943

#elevenlabs #summit #voiceai #event

Mati Staniszewski (@matiii) on X

The ElevenLabs Summit is coming to Warsaw - and to one of the most iconic venues in the country.

X (formerly Twitter)

๐ŸŸฆ Cloning my voice with ElevenLabs and Power Automate

I built a workflow that clones my voice and saves audio to SharePoint. I walk through ElevenLabs setup API key entry and voice model training. I debugged errors and confirmed a successful flow run. ๐Ÿš€

๐Ÿ’ก Voice cloning setup steps
๐Ÿ” Power Automate SharePoint flow
โš–๏ธ Troubleshooting and successful run

โ–ถ๏ธŽ https://www.hubsite365.com/en-ww/citizen-developer/?id=39fcf771-3a21-f111-8341-000d3a474810&topic=8daf8386-bb75-ea11-a811-000d3a210788&theater=true

#PowerAutomate #ElevenLabs #VoiceAI #Automation

๐ŸŽ™๏ธ Expired .FM domains are now LIVE at auction with Namecheap.

Browse & bid โ†’ https://Get.fm/auction

Great brands donโ€™t always start new โ€” sometimes you win them.

Perfect for:
๐ŸŽง Podcasts
๐Ÿค– AI voice agents
๐Ÿ“ก Streaming platforms
๐Ÿš€ Audio startups
Own the signal. ๐Ÿ“ก

#dotFM #DomainAuctions #AIAudio #Podcasting #VoiceAI #DomainNames #Domain #DNS

Mark Gadala-Maria (@markgadala)

์‚ฌ๋žŒ๋“ค์ด ๋ฐ˜๋ ค๋™๋ฌผ์—๊ฒŒ ๋ชฉ์†Œ๋ฆฌ๋ฅผ ๋ถ€์—ฌํ•˜๋Š” ์ƒˆ๋กœ์šด AI ํŠธ๋ Œ๋“œ๊ฐ€ ๋“ฑ์žฅํ–ˆ์œผ๋ฉฐ, ์ž‘์„ฑ์ž๋Š” ์ด๋ฅผ ์ž ์žฌ์ ์œผ๋กœ 1์–ต ๋‹ฌ๋Ÿฌ ๊ทœ๋ชจ์˜ ์•ฑ์œผ๋กœ ์„ฑ์žฅํ•  ์ˆ˜ ์žˆ๋Š” ์ƒ์—…์  ๊ธฐํšŒ๋กœ ๋ณด๊ณ  ์žˆ์Œ์„ ์ง€์ ํ•˜๋Š” ํŠธ์œ—(๋ฐ˜๋ ค๋™๋ฌผ ์Œ์„ฑ ํ•ฉ์„ฑ/์Œ์„ฑํ™” ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์˜ ๊ฐ€๋Šฅ์„ฑ ๊ฐ•์กฐ).

https://x.com/markgadala/status/2032391717870354500

#voiceai #generativeaudio #pettech #startup #ai

Amazon adds 'Sassy' personality to Alexa+ with profanity and attitude

Amazon launches Sassy voice style for Alexa+ with profanity and attitude, requiring security checks and child-safety safeguards.

The Daily Perspective
Fish Audio launches S2-Pro, a new TTS model with absurdly controllable emotion. The Dual-AR system pairs a 4B parameter language model with a 400M acoustic model for high-fidelity 44.1kHz audio. Supports zero-shot voice cloning from 10-30 second clips and inline emotional tags. Achieves sub-150ms latency on NVIDIA H200. https://www.marktechpost.com/2026/03/10/fish-audio-releases-fish-audio-s2-a-new-generation-of-expressive-text-to-speech-tts-with-absurdly-controllable-emotion/ #AIagent #AI #GenAI #VoiceAI #FishAudio