PaddlePaddle DeepSpeech Versions Save

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

r0.1.0

2 years ago

Features

CLI : New Feature

  • Easy install by pip pip install paddlespeech
  • CLI to quick explore ASR, TTS, audio classification, speech translation and punctuation restoration.

ASR

  • Join CTC LM decoder
  • Transformer LM model
  • Improve DeepSpeech2 online model
  • Refactor some configs

TTS

CLS

  • Add audio classification example on ESC-50 and custom dataset.
  • Add audio tagging demo based on PANNs and Audioset labels.

ST

  • ST-MTL
  • FAT-ST-MTL

Docs

  • Add quick start
  • Add read the doc
  • Improve installation documentation
  • Add README for each example

Demos

  • Audio_tagging
  • Automatic_video_subtitiles
  • Metaverse
  • Punctuation_restoration
  • Speech_recognition
  • Speech_translation
  • Story_talker
  • Style_fs2
  • Text_to_speech

Others

  • Update released models and results

Acknowledgements

@zh794390558 @KPatr1ck @Jackwaterveg @yt605155624 @Mingxue-Xu @grasswolfs @jerryuhoo

v2.1.1

2 years ago
  1. ctc alignment
  2. refactor data pipeline
  3. autolog for deepspeech test
  4. refactor checkpoint save/load
  5. deepspeech online model
  6. mfa alignment example
  7. add text normaliztion example
  8. TLG for aishell
  9. more dataest: thchs30, aidatatang, timit etc.
  10. 8k speech example
  11. ted en-zh st example
  12. more utils

v2.1.0

2 years ago
  1. Transformer/Conformer Offline/Online ASR
  2. Unified CTC Loss for DS2 model and Transformer Model

v1.1

3 years ago

paddle 1.8.x with python2

v1.0

3 years ago

master latest code