OpenAI Whisper ASR Webservice API
Free, easy, portable audio engine for games
DELTA is a deep learning based natural language and speech processing pl...
Multilingual Automatic Speech Recognition with word-level timestamps and...
Community list of startups working with AI in audio and music technology
Praat: Doing Phonetics By Computer
Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务,...
The neural network model is capable of detecting five different male/fem...
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Videos, notes and experiments to understand deep learning
A Python wrapper for Kaldi
自然语言处理领域下的相关论文(附阅读笔记),复现模型以及数据处理等(代码...
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including offic...
General Speech Restoration