PyTorch Implementation of PortaSpeech: Portable and High-Quality Generat...
A Non-Autoregressive Transformer based Text-to-Speech, supporting a fami...
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-...
PyTorch Implementation of Non-autoregressive Expressive (emotional, conv...
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallo...
🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregres...
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text...
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Bas...
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting...
Reparameterized Discrete Diffusion Models for Text Generation
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based...
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non...
A length-controllable and non-autoregressive image captioning model.
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinemen...