Теперь silero-tts v5 на русском языке умеет задавать вопросы
Мы недавно писали про обновление нашего публичного синтеза, silero-tts . В прошлый раз мы существенно увеличили скорость, качество и добавили поддержку омографов. В этот раз мы хотим вас порадовать особенной фичей, которая в большинстве случаев стабильно не работает даже в моделях синтеза, которые требуют для своей работы на 3-4 порядка больше вычислительных ресурсов и современные серверные видеокарты (наш синтез запускается даже на слабых процессорах). Как вы догадались, эта фича — это постановка вопросов . Хочу послушать вопросы
https://habr.com/ru/articles/1015942/
#silero #синтез_речи #tts #texttospeech #нейросети #синтезатор_речи #русский_язык #ударение #омографы #вопросы
Stop Robotic AI🚀 Transform Any Text To Human Voice In Seconds!
Most AI voices sound like a GPS from 2010. 🤖
We’ve all been there. You’re watching a potentially great video, but the moment the voiceover starts, you cringe. It’s that robotic, stuttering, “GPS-style” voice that immediately screams low quality.
For years, creators were stuck in a catch-22: either pay $200+ per script for a professional voice actor on Fiverr or spend your entire weekend recording and re-recording your own voice, only to end up with background noise and “umms.”
But a quiet shift is happening in the industry. We just found the tool that’s changing everything for content creators. Imagine professional, studio-quality voiceovers in any language, generated in seconds. No more expensive freelancers, no more ‘umms’ and ‘ahhs’, and no more robotic monotone.
A new AI technology is finally crossing the “uncanny valley,” and the results are indistinguishable from a human in a professional studio.
https://www.nbloglinks.com/stop-robotic-ai-transform-any-text-to-human-voice-in-seconds/
#texttospeech #aitexttospeech #humanvoiceover #professionalvoiceover #software #AI #AIsoftware #AItools
田中義弘 | taziku CEO / AI × Creative (@taziku_co)
GPU 없이도 동작하는 오픈소스 음성 합성 모델 Kitten TTS V0.8이 소개됐다. 최소 14M 파라미터, 25MB 미만의 경량 TTS로 CPU 실행이 가능하며, 표현력도 높다. 스마트폰, 장난감, 차량용 등 엣지 디바이스 배포 가능성이 큰 주목할 만한 기술이다.
Three new Kitten TTS models – smallest less than 25MB
https://github.com/KittenML/KittenTTS
#HackerNews #KittenTTS #KITTENML #TextToSpeech #AIModels #SmallModels
Remember when computer-generated voices and virtual pop idols were cool and cute and a completely new and exciting music genre and not the constant background noise of our horrifying computerized dystopia? Pepperidge Farm remembers. Man this is a bop, even a decade and a half later.
https://www.youtube.com/watch?v=duPJqfKiA78
#hatsunemiku #vocaloid #texttospeech #baka #triplebaka #music #synthesizer #electro #jpop
Fish Audio has open-sourced S2, a #texttospeech model that supports fine-grained inline control of prosody and emotion using natural-language tags like [laugh], [whispers], and [super happy]
#Business #Guides
Your browser can already speak a page · How to activate read-aloud features on web pages https://ilo.im/16b5hy
_____
#Reading #Audio #Accessibility #TextToSpeech #Text #Content #Webpages #Browsers

Users can customize the features built into the browser, something not often available from third-party approaches. Is an “AI” company offering to provide spoken versions of your pages for users? Is an overlay company promising to make your content more accessible by its overlay speaking it? Is some other vendor…
Google AI Studio — The Only App Builder You’ll Ever Need
