TPGST Tacotron Save

Google's TPGST reimplementation.

Project README

TPGST reimplementation with pytorch

Paper: PREDICTING EXPRESSIVE SPEAKING STYLE FROM TEXT IN END-TO-END SPEECH SYNTHESIS

Prerequisite

python 3.7
pytorch 1.3
librosa, scipy, tqdm, tensorboardX

Dataset

KSS, Korean female single speaker speech dataset.

Samples

Post

Usage

Download the above dataset and modify the path in config.py. And then run the below command.
```
python prepro.py
```
The model needs to train 100k+ steps
```
python train.py <gpu_id>
```
After training, you can synthesize some speech from text.
```
python synthesize.py <gpu_id> <model_path>
```
To listen your samples, you may need mel2wav vocoder. I didn't include vocoder in this repo.

Notes

I think the difference between baseline Tacotron and TPGST is small on KSS dataset.
I will be doing more experiminets soon.

Open Source Agenda is not affiliated with "TPGST Tacotron" Project. README Source: Yangyangii/TPGST-Tacotron

Stars

Open Issues

Last Commit

4 years ago

Repository

Yangyangii/TPGST-Tacotron

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/tpgst-tacotron"><img src="https://www.opensourceagenda.com/projects/tpgst-tacotron/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022