Best 100 Speech Processing Open Source Projects

A PyTorch-based Speech Toolkit

Reading list for research topics in multimodal machine learning

Neural building blocks for speaker diarization: speech activity detectio...

Foundation Architecture for (M)LLMs

WaveNet vocoder

PyTorch implementation of convolutional neural networks-based text-to-sp...

Multilingual Automatic Speech Recognition with word-level timestamps and...

Multilingual Automatic Speech Recognition with word-level timestamps and...

A curated list of awesome Speaker Diarization papers, libraries, dataset...

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Te...

SincNet is a neural architecture for efficiently processing raw audio sa...

Open source audio annotation tool for humans

General Speech Restoration

A neural network for end-to-end speech denoising

Tensorflow 2.x implementation of the DTLN real time speech denoising mod...