Success!! I've completed a first reimplementation of Dr. Sbaitso voice synthesis in Godot!

Try it out at Itchio: https://eibriel.itch.io/scp-079-voice

Ive configure the accessibility settings so the app properly describes the input fields and buttons, but for some reason is not working for me (tested on Linux with Orca). Let me know if it works for you!

#TTS #DrSbaitso #VoiceSynthesis #TextToSpeech #079 #SCP079 #SCP #Godot

Some improvements to the concatenation, prosody is still missing.

Here is a well known phrase by SCP 079.

The audio contains the same phrase first performed by Dr. Sbaitso TTS and the by Godot reimplementation.

#TTS #DrSbaitso #VoiceSynthesis #TextToSpeech #079 #SCP079 #SCP #Godot

Dr. Sbaitso compared to my reimplementation in Godot (Sbaitso first)  

Implemented: basic waveform concatenation
Missing: Interpolation, pitch control, prosody, text to phonemes

Im very happy with the progress, will be great to be able to run the voice without needing emulation.

#TTS #DrSbaitso #VoiceSynthesis #TextToSpeech #079 #SCP079

What I've learned so far while reverse engineering Dr Sbaitso's voice:
- Reverse engineering is hard

Also, the voice was made by very clever people. It's optimized to sound as good as possible, while consuming very few resources.

Progress after 5 days: 10%

#TTS #DrSbaitso #VoiceSynthesis #TextToSpeech

Primera prueba del sintetizador de voz por difonos hecho en Godot.

Tiene un millón de problemas, grabé la voz así nomás.

#Godot #TTS #VoiceSynthesis #SintesisDeVoz #Capusotto

Building a Diphone TTS engine in Godot for no reason at all.

#TTS #Godot #Linguistics #VoiceSynthesis

Hands on with AI audio generation: GAI voice, music, and sound effects

This is the second post in a series exploring the multimodal possibilities of generative AI. This series will take a detailed, hype-free look at text, image, audio, video, and code generation and explore the creative potential as well as the ethical concerns of GAI. Although Generative AI isn't a new technology, it's definitely been having a hype moment since the release of ChatGPT in November 2022. Unfortunately, the focus has been squarely on the text-based chatbot at the exclusion of […]

https://leonfurze.com/2023/09/25/hands-on-with-ai-audio-generation-gai-voice-music-and-sound-effects/

Germany gets a new AI call assistant from Deutsche Telekom that works straight from the cellular network—no app required. Powered by ElevenLabs’ voice synthesis, it can translate languages on the fly. Unveiled at Mobile World Congress, it shows how open‑source‑friendly AI can reshape everyday calls. Curious how it works? #MagentaAI #DeutscheTelekom #ElevenLabs #VoiceSynthesis

🔗 https://aidailypost.com/news/magenta-ai-call-assistant-launches-germany-no-app-needed

Qwen3-TTS ra mắt với độ trễ siêu thấp chỉ 97ms, hỗ trợ nhân bản giọng nói và API tương thích OpenAI. Công nghệ tổng hợp giọng nói tiên tiến, lý tưởng cho ứng dụng thời gian thực. #Qwen3TTS #VoiceSynthesis #AI #TextToSpeech #TríTuệNhânTạo #TTS #OpenAI

https://www.reddit.com/r/ollama/comments/1qlzbwk/release_qwen3tts_ultralow_latency_97ms_voice/

ElevenLabs appoints Karthik Rajaram as India Country Head to accelerate AI voice growth. His leadership will boost multilingual audio, voice synthesis and conversational AI for creators and brands across the Indian market. Discover how this move could reshape digital content creation. #AIvoice #VoiceSynthesis #ElevenLabs #MultilingualAudio

🔗 https://aidailypost.com/news/elevenlabs-names-karthik-rajaram-india-country-head-power-ai-voice