have been exploring retrieval-based voice conversion today. i would like to train a model on my own voice, as while it is fun to transform things into other voices (though i am more interested in the timbral adjustment rather than just using another voice), this would be a great tool to generate vocal pad backing tracks, harmonies, or even lead vocals when i'm unable to provide them (tts -> tune/time in melodyne -> RVC model of my voice).
update: flawless victory. input is some saw waves through formant filters. model is just some quick recordings of me reading the harvard sentences lol
@msx Whoa, this is really cool! I need to look into this myself.