FFTNet Save

A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder

Project README

FFTNet

A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder

Quick Start:

Install requirements:

pip install -r requirements.txt

Download dataset:

wget http://festvox.org/cmu_arctic/cmu_arctic/packed/cmu_us_slt_arctic-0.95-release.tar.bz2
tar xf cmu_us_slt_arctic-0.95-release.tar.bz2

Extract features:

python preprocess.py

Training with default hyperparams:

python train.py

Synthesize from model:

python generate.py --checkpoint=/path/to/model --lc_file=/path/to/local_conditon

TODO:

Test and modify conditonal sampling.
Fast generation. (Without fast generation, it generates about 230 samples per second with a free TITAN Xp.)
Post-synthesis denoising.

Notes:

This is not offical implementation, some details are different from the paper.
Now the generated speech from this repo is not so good as wavenet vocoder.
Work in progress.

Open Source Agenda is not affiliated with "FFTNet" Project. README Source: syang1993/FFTNet

Stars

Open Issues

Last Commit

5 years ago

Repository

syang1993/FFTNet

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/fftnet"><img src="https://www.opensourceagenda.com/projects/fftnet/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022