A PyTorch-based Speech Toolkit
Reading list for research topics in multimodal machine learning
Neural building blocks for speaker diarization: speech activity detectio...
Foundation Architecture for (M)LLMs
WaveNet vocoder
PyTorch implementation of convolutional neural networks-based text-to-sp...
Multilingual Automatic Speech Recognition with word-level timestamps and...
A curated list of awesome Speaker Diarization papers, libraries, dataset...
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Te...
SincNet is a neural architecture for efficiently processing raw audio sa...
Open source audio annotation tool for humans
General Speech Restoration
A neural network for end-to-end speech denoising
Tensorflow 2.x implementation of the DTLN real time speech denoising mod...