Cross-lingual Voice Conversion
I wish I could speak many languages. Wait. Actually I do. But only 4 or 5 languages with limited proficiency. Instead, can I create a voice model that can copy any voice in any language? Possibly! A while ago, me and my colleage Dabi opened a simple voice conversion project. Based on it, I expanded the idea to cross-languages. I found it's very challenging with my limited knowledge. Unfortunately, the results I have for now are not good, but hopefully it will be helpful for some people.
(To see what PPGs are, consult this)
python train1.py
for phoneme recognition model.python train2.py
for speech synthesis model.python convert.py
and check the generated samples in 50lang-output
folder.