Coqui Ai TTS Versions Save

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

v0.20.1

7 months ago

What's Changed

Drop diffusion from XTTS by @erogol in https://github.com/coqui-ai/TTS/pull/3150
Bug fixes and add support for multiples speaker references on XTTS inference by @Edresson in https://github.com/coqui-ai/TTS/pull/3149
Fix XTTS v2.0 training recipe by @Edresson in https://github.com/coqui-ai/TTS/pull/3154

Full Changelog: https://github.com/coqui-ai/TTS/compare/v0.20.0...v0.20.1

v0.20.0

7 months ago

What's Changed

Run make style & re-enable it in CI by @akx in https://github.com/coqui-ai/TTS/pull/3127
XTTS v2.0 by @Edresson in https://github.com/coqui-ai/TTS/pull/3137

Full Changelog: https://github.com/coqui-ai/TTS/compare/v0.19.1...v0.20.0

v0.19.1

7 months ago

What's Changed

Second round of issue fixing for XTTS v1.1 by @WeberJulian in https://github.com/coqui-ai/TTS/pull/3103
fix for issue 3067 by @Aya-AlJafari in https://github.com/coqui-ai/TTS/pull/3109
Bug: self.model_name needed to be initialized. by @vltmedia in https://github.com/coqui-ai/TTS/pull/2983

New Contributors

@vltmedia made their first contribution in https://github.com/coqui-ai/TTS/pull/2983

Full Changelog: https://github.com/coqui-ai/TTS/compare/v0.19.0...v0.19.1

v0.19.0

7 months ago

What's Changed

XTTS v1.1 GPT Trainer by @Edresson in https://github.com/coqui-ai/TTS/pull/3086

Full Changelog: https://github.com/coqui-ai/TTS/compare/v0.18.2...v0.19.0

v0.18.2

7 months ago

What's Changed

Fix xtts v1.1 by @WeberJulian in https://github.com/coqui-ai/TTS/pull/3096

Full Changelog: https://github.com/coqui-ai/TTS/compare/v0.18.1...v0.18.2

v0.18.1

7 months ago

What's Changed

Bug fix on XTTS v1.1 inference by @Edresson and @WeberJulian in https://github.com/coqui-ai/TTS/pull/3093

Full Changelog: https://github.com/coqui-ai/TTS/compare/v0.18.0...v0.18.1

v0.18.0

7 months ago

What's Changed

XTTS v1.1 by @WeberJulian in https://github.com/coqui-ai/TTS/pull/3089

Full Changelog: https://github.com/coqui-ai/TTS/compare/v0.17.10...v0.18.0

XTTS v1.1

This model is trained on top of XTTS v1, using output masking. We mask the part of the output that is used as the audio prompt while training and don't compute loss for that segment. This helps us to resolve the hallucination issue that V1 experienced.

Changes

Add Japanese
Resolve the hallucination issue (repeating the audio prompt)
Increased expressivity
Hash check to control model version
Added ne_hifigan that was trained without denoising that brought some EQ and compression profile that might be unwanted for some use-cases