TensorFlowTTS Versions Save

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

v1.8

2 years ago
  • Support Tacotron2/Mb-Melgan for French. See pull request and colab. Many thanks Samuel Delalez
  • Integrated with Huggingface Gradio web demo. See pull request
  • Upgrade TF2.3.1 to TF2.6.0 since some users confirm that it works fine.

v1.6.1

2 years ago
  • Fix bug load model_weights in TFAutoModel

v1.6

2 years ago

Release Notes

v1.1

3 years ago

Release Notes

  • Support German TTS with Thorsten dataset (#405), note that user should install (german_transliterate)
  • Fix Savable bug in Tacotron2 and FastSpeech/FastSpeech2 (#446)

v0.11

3 years ago

Release Notes

  • Released with TensorFlow 2.3.1
  • Support multi-GPU gradient accumulator link.
  • Support HiFi-GAN vocoder link.
  • Fix some bugs.

v0.9

3 years ago

Release Notes

  • Supported both TensorFlow 2.2/2.
  • Faster Tacotron-2 training.
  • Stable training fastspeech/fastspeech2/tacotron2/mb-melgan.
  • Supported Eng/Chinese/Korean.
  • Supported ParallelWaveGAN.
  • Added C++ inference code.

v0.8

3 years ago

Edit later ...

v0.7

3 years ago

Release Notes

  • First release of TensorflowTTS.
  • Built against TensorFlow 2.2

Changelog

  • Apply black formatter.
  • Use pytest as default test runner.

TensorflowTTS Core

tensorflow_tts.bin

  • Multi-preprocess to calculate mel-spectrogram, f0, energy
  • Add code to calculate mean/std of mel-spectrogram, f0, energy
  • Add code to normalize mel-spectrogram, f0, energy based on its mean/std value

tensorflow_tts.config

  • Add configuration for FastSpeech
  • Add configuration for FastSpeech2
  • Add configuration for Tacotron-2
  • Add configuration for MelGAN
  • Add configuration for Multiband-MelGAN

tensorflow_tts.datasets

  • Add dataset abstract based on tf.data
  • Add dataloder for mel-spectrogram
  • Add dataloder for audio

tensorflow_tts.losses

  • Add MultiScale STFT Loss
  • Add Mel-spectrogram Loss

tensorflow_tts.models

  • Add FastSpeech modeling
  • Add FastSpeech2 modeling
  • Add Melgan modeling
  • Add Multiband-melgan modeling
  • Add Tacotorn-2 modeling

tensorflow_tts.optimizers

  • Add adam-weightdecay optimizers

tensorflow_tts.processor

  • Add Ljspeech processor for english charactor-based.

tensorflow_tts.trainers

  • Add base trainer including GanBasedTrainer and Seq2SeqTrainer

tensorflow_tts.utils

  • Add seq2seq dynamic decoder
  • Add cleaner for english text
  • Add group convolution for melgan
  • Add batch Griffin-Lim version based on librosa and Tensorflow
  • Add number normalization
  • Add function to detect outlier from 1D array
  • Add weight-norm layer

NoteBooks

  • Add notebook for GL inference
  • Add notebook for convert FastSpeech/FastSpeech2/Melgan/Mb-melgan/Tacotron-2 to pb and inference
  • Add notebook for convert FastSpeech/FastSpeech2/Tacotron-2 to tflite and inference

Examples

  • Add example to training fastspeech
  • Add example to training fastspeech2
  • Add example to training tacotron-2
  • Add example to training melgan
  • Add example to training melgan.stft
  • Add example to training multiband melgan

Thanks to our Contributors

@erogol @azraelkuan @l4zyf9x @myagues @sujeendran @MokkeMeguru @jaeyoo @dathudeptrai