ISTFT Avocodo Pytorch Save

Ultrafast GAN based Vocoder for Text to Speech

Project README

iSTFT Avocodo Vocoder

iSTFT's generator and Avocodo's discriminators. Use faster branch for ultra fast training and inference compared to original hifi-gan.

Training :

python train.py --config config_v1.json

Notes :

  • Two of iSTFT Generator version can be used with Avocodo discriminaators.
  • Losses should similar to HiFi-GAN.
  • faster branch of this repo is 50 % speed up (0.33 s/b) in training and 60 % improvement in inference speed.
  • Quality is not as good as Avocodo and ISTFTNet.

References :

Citations :

@inproceedings{kaneko2022istftnet,
title={{iSTFTNet}: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform},
author={Takuhiro Kaneko and Kou Tanaka and Hirokazu Kameoka and Shogo Seki},
booktitle={ICASSP},
year={2022},
}
@misc{https://doi.org/10.48550/arxiv.2206.13404,
  doi = {10.48550/ARXIV.2206.13404},
  
  url = {https://arxiv.org/abs/2206.13404},
  
  author = {Bak, Taejun and Lee, Junmo and Bae, Hanbin and Yang, Jinhyeok and Bae, Jae-Sung and Joo, Young-Sun},
  
  keywords = {Audio and Speech Processing (eess.AS), Artificial Intelligence (cs.AI), Sound (cs.SD), FOS: Electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Computer and information sciences, FOS: Computer and information sciences},
  
  title = {Avocodo: Generative Adversarial Network for Artifact-free Vocoder},
  
  publisher = {arXiv},
  
  year = {2022},
  
  copyright = {arXiv.org perpetual, non-exclusive license}
}
Open Source Agenda is not affiliated with "ISTFT Avocodo Pytorch" Project. README Source: rishikksh20/iSTFT-Avocodo-pytorch
Stars
51
Open Issues
2
Last Commit
1 year ago
License
MIT

Open Source Agenda Badge

Open Source Agenda Rating