Pytorch based speech enhancement toolkit.
mayavoz is a Pytorch-based opensource toolkit for speech enhancement. It is designed to save time for audio practioners & researchers. It provides easy to use pretrained speech enhancement models and facilitates highly customisable model training.
| Quick Start | Installation | Tutorials | Available Recipes | Demo
Noisy speech followed by enhanced version.
from mayavoz.models import Mayamodel
model = Mayamodel.from_pretrained("shahules786/mayavoz-waveunet-valentini-28spk")
model.enhance("noisy_audio.wav")
Model | Dataset | STOI | PESQ | URL |
---|---|---|---|---|
WaveUnet | Valentini-28spk | 0.836 | 2.78 | shahules786/mayavoz-waveunet-valentini-28spk |
Demucs | Valentini-28spk | 0.961 | 2.56 | shahules786/mayavoz-demucs-valentini-28spk |
DCCRN | Valentini-28spk | 0.724 | 2.55 | shahules786/mayavoz-dccrn-valentini-28spk |
Demucs | MS-SNSD-20hrs | 0.56 | 1.26 | shahules786/mayavoz-demucs-ms-snsd-20 |
Test scores are based on respective test set associated with train dataset.
See tutorials to train your custom model
Only Python 3.8+ is officially supported (though it might work with Python 3.7)
pip install mayavoz
conda env create -f environment.yml
conda activate mayavoz
git clone url
cd mayavoz
pip install -e .
For commercial enquiries and scientific consulting, please contact me.
Sincere gratitude to AMPLYFI for supporting this project.