PaddlePaddle DeepSpeech Versions Save

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

r0.1.0

2 years ago

Features

CLI : New Feature

Easy install by pip pip install paddlespeech
CLI to quick explore ASR, TTS, audio classification, speech translation and punctuation restoration.

ASR

Join CTC LM decoder
- paper link
Transformer LM model
Improve DeepSpeech2 online model
Refactor some configs

TTS

Merge Parakeet into PaddleSpeech
Add FastSpeech2-Conformer
- paper link: fastspeech2 、conformer
- example link
Add Multi Band MelGAN
- paper link
- example link
Add HiFiGAN
- paper link
- example link
Add Style MelGAN
- paper link
- example link
Add FastSpeech2 Voice Cloning with GE2E (SV2TTS)
- paper link
- example link

CLS

Add audio classification example on ESC-50 and custom dataset.
Add audio tagging demo based on PANNs and Audioset labels.

ST

ST-MTL
FAT-ST-MTL

Docs

Add quick start
Add read the doc
Improve installation documentation
Add README for each example

Demos

Audio_tagging
Automatic_video_subtitiles
Metaverse
Punctuation_restoration
Speech_recognition
Speech_translation
Story_talker
Style_fs2
Text_to_speech

Others

Update released models and results

Acknowledgements

@zh794390558 @KPatr1ck @Jackwaterveg @yt605155624 @Mingxue-Xu @grasswolfs @jerryuhoo

v2.1.1

2 years ago

ctc alignment
refactor data pipeline
autolog for deepspeech test
refactor checkpoint save/load
deepspeech online model
mfa alignment example
add text normaliztion example
TLG for aishell
more dataest: thchs30, aidatatang, timit etc.
8k speech example
ted en-zh st example
more utils

v2.1.0

2 years ago

Transformer/Conformer Offline/Online ASR
Unified CTC Loss for DS2 model and Transformer Model

v1.1

3 years ago

paddle 1.8.x with python2

v1.0

3 years ago

master latest code