Tf Flowavenet Save

Tensorflow implementation of "FloWaveNet: A Generative Flow for Raw Audio"

Project README

FloWaveNet : A Generative Flow for Raw Audio

Unofficial tensorflow implementation of the paper "FloWaveNet : A Generative Flow for Raw Audio".

Requirements

Python 3.5
tensorflow 1.12
Librosa

How to use

Download the LJ-Speech dataset and unpack it:

>>> tar -xvf LJSpeech-1.1.tar.bz2

Preprocess dataset using the following command:

>>> python3 preprocessing.py --in_dir=LJSpeech-1.1 --out_dir=training_data

Run training:

>>> python3 train.py

Features

Implemented Multig-gpu training
Added Global condition features
Mixed precision training

With mixed precision training (enabled by default) the model can be trained for 7.5 days on a single GPU with 11Gb RAM. To use float32 training set dtype=tf.float32 and scale=1. in hparams.py.

Several examples of synthesis can be found here.

Todo list

Learning rate and batch size tuning for efficient multi-GPU training

Reference

Official pytorch implementation: https://github.com/ksw0306/FloWaveNet

Open Source Agenda is not affiliated with "Tf Flowavenet" Project. README Source: ryhorv/tf-flowavenet

Stars

Open Issues

Last Commit

5 years ago

Repository

ryhorv/tf-flowavenet

License

MIT

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/tf-flowavenet"><img src="https://www.opensourceagenda.com/projects/tf-flowavenet/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022