Here's what the raw data of the unit selection db file for SpeakEasy sounds like. It gives you a good idea on how unit selection synthesis works.
@rommix0 If my theory about the engine using MBROLA as a backend is correct, what we are actually hearing are the building blocks for a diphone based voice. When I listened to the clip you posted, it sounded familiar, and after listening to the clip of the synth itself, I remembered listening to the raw data of EN1 a while back.
@datajake1999 in a way, yeah. It's like MBROLA, but not actually MBROLA since speakeasy is proprietary.
@rommix0 @datajake1999 Yeah I definitely recognise that as diphone based synthesis.
@rommix0 is that 8-bit linear PCM?
@BorrisInABox nah, it's 16 bit. I'm sure the data has subheaders.
@rommix0 Wow, really? Sounds very 8-bit or less in that clip.
@rommix0 i think it's diphone, specifically, the embrola en1 database
@spacepup Nah. en1 is english but with a foreign speaker. The speaker used for SpeakEasy is not a foreigner.
@rommix0 sounds like a fucked up vocal warmup