DiffWave Vocoder Save

Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.

Project README

This is a reimplementaion of the neural vocoder in DIFFWAVE: A VERSATILE DIFFUSION MODEL FOR AUDIO SYNTHESIS.

Usage:

To continue training the model, run python distributed_train.py -c config_${channel}.json, where ${channel} can be either 64 or 128.
To retrain the model, change the parameter ckpt_iter in the corresponding json file to -1 and use the above command.
To generate audio, run python inference.py -c config_${channel}.json -cond ${conditioner_name}. For example, if the name of the mel spectrogram is LJ001-0001.wav.pt, then ${conditioner_name} is LJ001-0001. Provided mel spectrograms include LJ001-0001 through LJ001-0186.
Note, you may need to carefully adjust some parameters in the json file, such as data_path and batch_size_per_gpu.

Pretrained models and generated samples:

Open Source Agenda is not affiliated with "DiffWave Vocoder" Project. README Source: philsyn/DiffWave-Vocoder

Stars

Open Issues

Last Commit

3 years ago

Repository

philsyn/DiffWave-Vocoder

License

MIT

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/diffwave-vocoder"><img src="https://www.opensourceagenda.com/projects/diffwave-vocoder/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022