VITS: Conditional Variational Autoencoder with Adversarial Learning for ...
A sound cloning tool with a web interface, using your voice or any sound...
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目...
🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhan...
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS...
A fast, local neural text to speech system
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion a...
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art...
High-quality multi-lingual text-to-speech library by MyShell.ai. Support...
An Open Source text-to-speech system built by inverting Whisper.
Foundational model for human-like, expressive TTS
A TensorFlow implementation of Google's Tacotron speech synthesis with p...
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthes...
An unofficial PyTorch implementation of the audio LM VALL-E