Mastodawn

Akshay (@akshay_pachaar)

단어 단위로 음성을 제어할 수 있는 새로운 100% 오픈소스 TTS 모델 공개. 기존 TTS는 문장 전체의 톤이 바뀌는 한계가 있었지만, 이 모델은 문장 내 특정 단어·구간만 따로 감정·억양을 지정할 수 있어 세밀한 음성 연출이 가능해진다.

https://x.com/akshay_pachaar/status/2033922460551418268

#tts #speechsynthesis #opensource #controllability

Akshay 🚀 (@akshay_pachaar) on X

Finally, you can control speech word by word. (Using a new 100% open-source TTS model) Every TTS system before this had the same core limitation. You'd say "speak in an angry tone" and the whole sentence shifted. There was no way to say "be calm here, then laugh right at this

X (formerly Twitter)