Espnet Versions Save

End-to-End Speech Processing Toolkit

v.202402

2 months ago

News

We're thrilled to announce that our latest update brings two groundbreaking features to our project: espnetez and ESPnet-SPK!

New Features

  • [New Features][ESPnet2][ESPnet1][Installation][SE] Add diffusion-base SE model to ESPnet-SE #5572 by @LiChenda
  • [New Features][ESPnet2][ESPnet1][CI][ASR] Add Bayes Risk CTC (reworked) #5519 by @jctian98
  • [New Features][ESPnet2][TTS] TTS evaluation script and monitoring functionality using MOS prediction model #5485 by @Takaaki-Saeki
  • [New Features][ESPnet2][SE] Add USES model for speech enhancement in diverse conditions #5482 by @Emrys365
  • [New Features][ESPnet2][CI][SID] ESPnet-SPk: major update #5408 by @Jungjee
  • [New Features][ESPnet2][TTS][ASR] Add espnetez #5372 by @Masao-Someki

Enhancement

  • [Enhancement][ESPnet2][OWSM] Improving OWSM inference interface #5618 by @pyf98
  • [Enhancement][ESPnet2][OWSM] Add OWSM v3.1 #5611 by @pyf98
  • [Enhancement][ESPnet2][CI] ESPnet-SPK: Additional models, supplement readme #5559 by @Jungjee
  • [Enhancement][ESPnet2][CI][SE] Add PyTorch & GPU support for DNSMOS calculation #5548 by @Emrys365
  • [Enhancement][ESPnet2][TTS][SID] Speaker embedding extractor (with ESPnet pre-trained speaker model) #5579 by @ftshijt

Recipe

  • [Recipe][ESPnet2][Music] Fix relative setting of train-dev-test #5623 by @ftshijt
  • [Recipe][ESPnet2][SID] ESPnet-SPK: add Voxblink recipe #5583 by @Jungjee
  • [Recipe][ESPnet2][SID] ESPnet-SPK: Model upload and result generation #5558 by @Jungjee
  • [Recipe][ESPnet2][Music] ACE singer recipe fixing #5551 by @ftshijt
  • [Recipe][ESPnet2][TTS] TTS2 Template #5541 by @ftshijt
  • [Recipe][ESPnet2][ASR] fix kaldi dependency in asr2 #5540 by @ftshijt
  • [Recipe][ESPnet2][CI][S2ST] CI test for s2st #5526 by @ftshijt
  • [Recipe][ESPnet2][ASR] Added data.sh to SPRING-INX IITM Recipe #5522 by @arjun-gangwar
  • [Recipe][ESPnet2][ASR] Add Libriheavy small and medium ASR2 recipes #5512 by @akreal
  • [Recipe][ESPnet2][ASR] SPRING-INX IITM RECIPE #5505 by @arjun-gangwar
  • [Recipe][ESPnet2][ASR][RNNT] Add transducer conformer configuration to commonvoice recipe #5503 by @zuazo
  • [Recipe][ESPnet2][ESPnet1] add centralized data preparation for OWSM #5478 by @jctian98
  • [Recipe][ESPnet1] Added clean speech results #5649 by @linan2
  • [Recipe][ESPnet2][Installation][AV] AVSR recipe for Easycom Dataset #5630 by @ms-dot-k
  • [Recipe][ESPnet2] Update CHiME-7 ASR1 recipe #5555 by @popcornell
  • [Recipe][ESPnet2] Add E-Branchformer model checkpoint in OWSM v2 #5517 by @pyf98
  • [Recipe][ESPnet2][SLU] Slue PR configs #5087 by @siddhu001

Bugfix

  • [Bugfix][ESPnet2] Fix path dependency in ESPnet tutorial #5645 by @siddhu001
  • [Bugfix][ESPnet2] Fix ESPnet tutorial #5644 by @siddhu001
  • [Bugfix] Fix CI #5642 by @siddhu001
  • [Bugfix][ESPnet2] Fixed bug by copying missing Kaldi scripts #5636 by @VicentCano
  • [Bugfix][ESPnet1][ASR] CTC prefix score, fix if blank == eos #5620 by @albertz
  • [Bugfix][ESPnet2] Fix minor OWSM data prep bug #5607 by @juice500ml
  • [Bugfix][ESPnet2][ESPnet1][CI] E721 #5589 by @sw005320
  • [Bugfix][ESPnet2][ESPnet1] Make minlenratio effective #5581 by @jctian98
  • [Bugfix][ESPnet2] Fix except #5567 by @takenori-y
  • [Bugfix][ESPnet1][Installation][CI] Improve error robustness of unit tests #5535 by @Emrys365
  • [Bugfix][ESPnet2][AV] Fix bug in lrs3 data preprocessing #5520 by @ms-dot-k
  • [Bugfix][ESPnet1] replace old mustc links with new instructions #5516 by @brianyan918
  • [Bugfix][ESPnet2][ST] Fix s2st HF model uploading #5504 by @tjysdsg
  • [Bugfix][ESPnet2][ESPnet1] bug fixes for must_c v2 recipe #5640 by @jasonmusespresso

Documentation

  • [Documentation][ESPnet2] Add instructions for finetuning owsm #5539 by @pyf98
  • [Documentation] Updated the reference of the accepted JOSS paper #5515 by @neillu23

Others

  • [Others] Update Discord Invitation Link #5578 by @Fhrozen
  • [Others][ESPnet2][CI] Improve error robustness of unit tests #5523 by @Emrys365

Acknowledgements

Special thanks to @Emrys365, @Fhrozen, @Jungjee, @LiChenda, @Masao-Someki, @Takaaki-Saeki, @VicentCano, @akreal, @albertz, @arjun-gangwar, @brianyan918, @ftshijt, @jasonmusespresso, @jctian98, @juice500ml, @linan2, @ms-dot-k, @neillu23, @popcornell, @pyf98, @siddhu001, @sw005320, @takenori-y, @tjysdsg, @zuazo.

v.202310

6 months ago

What's Changed

New Contributors

Full Changelog: https://github.com/espnet/espnet/compare/v.202308...v.202310

v.202308

8 months ago

What's Changed

New Contributors

Full Changelog: https://github.com/espnet/espnet/compare/v.202304...v.202308

v.202304

11 months ago

What's Changed

New Contributors

Full Changelog: https://github.com/espnet/espnet/compare/v.202301...v.202304

v.202301

1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/espnet/espnet/compare/v.202211...v.202301

v.202211

1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/espnet/espnet/compare/v.202209...v.202211

v.202209

1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/espnet/espnet/compare/v.202207...v.202209

v.202207

1 year ago

New Features

  • [New Features][ESPnet1][ASR] Add DDP support for v1 ASR training. #4430 by @lazykyama
  • [New Features][ESPnet2] Support tensorboard graph #4418 by @kamo-naoyuki
  • [New Features][ESPnet2][ASR] Branchformer Encoder in ESPnet2 #4400 by @pyf98
  • [New Features][ESPnet2][Diarization][SE] enh_diar joint model #4339 by @YushiUeda
  • [New Features][ESPnet2][ESPnet1] Calculate RTF and latency in espnet2 #4382 by @espnetUser
  • [New Features][ESPnet2][ESPnet1][SE] Add EnhPreprocessor for Speech Enhancement #4321 by @Emrys365
  • [New Features][ESPnet2][SE] Add DPTNet and WarmupStepLR scheduler #4449 by @Emrys365
  • [New Features][ESPnet2][SE] Add support for calculating losses on noise and dereverberated signals #4476 by @Emrys365

Recipe

  • [Recipe][ESPnet2] Aishell-2 GPU info #4501 by @jctian98
  • [Recipe][ESPnet2] Fix librispeech default path to signify auto download #4517 by @karthik19967829
  • [Recipe][ESPnet2] Recipe fix for PueblaNahuatl Recipe #4522 by @ftshijt
  • [Recipe][ESPnet2][ASR][README] Add Aishell-2 ASR Recipe for Espnet2 #4451 by @jctian98
  • [Recipe][ESPnet2][ASR][README] Add AmericasNLP 2022 baselines #4428 by @akreal
  • [Recipe][ESPnet2][ESPnet1][ASR][Installation] FLEURS ASR Recipe for ESPnet2 #4455 by @wanchichen
  • [Recipe][ESPnet2][ESPnet1][ASR][README] tedx_spanish_corpus egs2 recipe #4523 by @jessicah25
  • [Recipe][ESPnet2][ESPnet1][ASR][SE] Adding L3DAS22 Task1 model to ESPNet-SE #3994 by @popcornell
  • [Recipe][ESPnet2][ESPnet1][ST] Must_C v1 and v2 in egs2 #4306 by @brianyan918
  • [Recipe][ESPnet2][README] Dcase task1 Baseline #4317 by @siddhu001
  • [Recipe][ESPnet2][README] Report Aishell-2 Transducer results #4489 by @jctian98
  • [Recipe][ESPnet2][README] Update language codes in AmericasNLP 2022 baseline #4441 by @akreal
  • [Recipe][ESPnet2][README] Vox populi baseline #4478 by @siddhu001
  • [Recipe][ESPnet2][SE] L3DAS22 enhancement recipe #4269 by @neillu23
  • [Recipe][ESPnet2][SE] Update notes in the recipes for DNS challenges #4433 by @YoshikiMas
  • [Recipe][ESPnet2][SE][SLU][ST] LT-Spatialized and SLURP-Spatialized combined enhancement recipe #4268 by @neillu23
  • [Recipe][ESPnet2][ST] Add moses check for ST recipes #4417 by @ftshijt
  • [Recipe][ESPnet2][TTS] Add talromur recipe #4379 by @G-Thor
  • [Recipe][ESPnet2][TTS] Fix for issue #4401 #4402 by @G-Thor
  • [Recipe][ESPnet2][TTS] add pre-trained model jets in the recipe of ljspeech, kss #4406 by @imdanboy

Bugfix

  • [Bugfix][ESPnet1] fix the corrupted pretrained model #4490 by @wentaoxandry
  • [Bugfix][ESPnet1][ESPnet2] Fix an4 URL #4427 by @pyf98
  • [Bugfix][ESPnet1][ESPnet2][RNNT] Fix mAES with big vocab size #4312 by @b-flo
  • [Bugfix][ESPnet2] Adding init.py to espnet2/diar/layers and espnet2/diar/separator #4470 by @cycentum
  • [Bugfix][ESPnet2] Fix tensorboard-graph creation for multi gpu mode #4431 by @kamo-naoyuki
  • [Bugfix][ESPnet2] Update char_tokenizer.py #4499 by @xiabingquan
  • [Bugfix][ESPnet2][ESPnet1][ASR][LM][MT][TTS] Fix Transducer LM fusion and add Logging for Transducer inference #4327 by @chintu619
  • [Bugfix][ESPnet2][SE] Fix a bug in enh unit test #4435 by @Emrys365

Enhancement

  • [Enhancement][ESPnet2] Optionize graph creation #4551 by @kan-bayashi
  • [Enhancement][ESPnet2][Installation][TTS] Add icelandic g2p #4384 by @G-Thor
  • [Enhancement][ESPnet2][SE] Add support of test-only criterions after each epoch #4381 by @Emrys365
  • [Enhancement][ESPnet2][SSL] raise more useful error in espnet2/asr/frontend/s3prl.py if s3prl is not installed #4480 by @popcornell
  • [Enhancement][ESPnet2][TTS] Add JETS AlignmentModule in calculate_all_attentions.py #4446 by @seastar105

Refactoring

  • [Refactoring][ESPnet1] Refactoring 'is_prefix' function #4530 by @jhlee9010
  • [Refactoring][ESPnet2][ASR] Zero_infinity option for ctc loss #4415 by @kamo-naoyuki

Others

  • [CI][ESPnet1][ESPnet2][Installation] Remove the version restriction for numpy #4419 by @kamo-naoyuki
  • [CI][ESPnet2] Canged to install espnet from wheel in the test_import CI test #4471 by @kamo-naoyuki
  • [CI][Installation] Temporary fixed numpy version #4464 by @kamo-naoyuki
  • [Documentation] Add notes on batch size and num of GPUs in ESPnet2 documentation #4436 by @pyf98
  • [Documentation][ESPnet1] Update decoder.py #4322 by @sw005320
  • [Documentation][ESPnet2] Add a note to follow the installation instructions #4477 by @akreal

Acknowledgements

Special thanks to @Emrys365, @G-Thor, @YoshikiMas, @YushiUeda, @akreal, @b-flo, @brianyan918, @chintu619, @cycentum, @espnetUser, @ftshijt, @imdanboy, @jctian98, @jessicah25, @jhlee9010, @kamo-naoyuki, @kan-bayashi, @karthik19967829, @lazykyama, @neillu23, @popcornell, @pyf98, @seastar105, @siddhu001, @sw005320, @wanchichen, @wentaoxandry, @xiabingquan.

v.202205

1 year ago

New Features

  • [New Features][ESPnet1][ESPnet2][ASR] Add quantization in ESPnet2 for asr inference #4349 by @pyf98
  • [New Features][ESPnet2][SE] Add svoice recipe for wsj0-2mix speech separation #4257 by @nateanl
  • [New Features][ESPnet2][SE] Merge Deep Clustering and Deep Attractor Network to enh separator #4110 by @earthmanylf
  • [New Features][ESPnet2][SE] Some improvements to current enh functions #4251 by @Emrys365
  • [New Features][ESPnet2][SE][Installation] Import fast_bss_eval and update some time-domain losses for enh task #4256 by @LiChenda
  • [New Features][ESPnet2][TTS] add e2e tts model: JETS #4364 by @imdanboy

Bugfix

  • [Bugfix][ESPnet1] Fix minimum input length for Conv2dSubsampling2 in check_short_utt #4378 by @akreal
  • [Bugfix][ESPnet1][ESPnet2] Minor fixes for the intermediate loss usage and Mask-CTC decoding #4374 by @YosukeHiguchi
  • [Bugfix][ESPnet2] Fix #4396 #4398 by @kamo-naoyuki
  • [Bugfix][ESPnet2] Fix a bug in utterance_mvn #4304 by @Emrys365
  • [Bugfix][ESPnet2] Minor fix for Mask-CTC forward function #4347 by @YosukeHiguchi
  • [Bugfix][ESPnet2] Wandb Minor Fix for Model Resume #4329 by @roshansh-cmu
  • [Bugfix][ESPnet2] fix the enh_s2t_task argument in espnet2/bin/st_inference.py #4323 by @simpleoier
  • [Bugfix][ESPnet2][MT][ST] fix bug in mt/st templates for having separate token lists #4149 by @brianyan918
  • [Bugfix][ESPnet2][Recipe] Fix aishell3 data preparation script #4277 by @LanceaKing
  • [Bugfix][ESPnet2][SE] Fix a bug in stats aggregation when PITSolver is used #4343 by @Emrys365
  • [Bugfix][ESPnet2][SE] fix for enhancement model loading compatibility #4259 by @LiChenda
  • [Bugfix][ESPnet2][ST] bug fixes in ST recipes #4341 by @chintu619
  • [Bugfix][ESPnet2][TTS] Fix optional data names for TTS #4355 by @kan-bayashi
  • [Bugfix][ESPnet2][TTS] fix a bug in Mandarin pypinyin_g2p_phone #4206 by @WeiGodHorse
  • [Bugfix][ESPnet2][TTS] fix loss = NaN in VITS with mixed precision #4356 by @kan-bayashi
  • [Bugfix][ESPnet2][streaming] Add unit test to streaming ASR inference #4352 by @espnetUser
  • [Bugfix][Installation] fix s3prl install by using legacy version. Temporal solution. #4399 by @simpleoier
  • [Bugfix][README] Fix typo #4338 by @ftshijt

Enhancement

  • [Enhancement][ESPnet1][ESPnet2][ASR][SE][SLU][ST] enh_s2t joint model #4226 by @simpleoier
  • [Enhancement][ESPnet2] Add progress bar to phonemization #4320 by @G-Thor
  • [Enhancement][ESPnet2][MT] Update show_translation_result.sh to show all decoding results under the given exp directory #4330 by @pyf98

Recipe

  • [Recipe][ESPnet1][ASR] Accented English Speech Recognition Challenge 2020 recipe (AESRC2020) #3898 by @brianyan918
  • [Recipe][ESPnet1][ESPnet2][ASR][README][Recipe] Add MediaSpeech ASR recipe #4183 by @AshibaWu
  • [Recipe][ESPnet2][ASR][README] recipee for Microsoft speech corpus for Indian Languages #4191 by @navya-yarrabelly
  • [Recipe][ESPnet2][ASR][README] Accented French Openslr57 ASR recipe (ESPnet2) (part of Homework3 MNLP) #4280 by @DanBerrebbi
  • [Recipe][ESPnet2][ASR][README] Add Mask-CTC results #4180 by @YosukeHiguchi
  • [Recipe][ESPnet2][ASR][README] Add ml_openslr63 ASR recipe #4173 by @bharaniuk
  • [Recipe][ESPnet2][ASR][README] Adding new recipe for Burmese (OpenSLR80) #4182 by @JainSameer06
  • [Recipe][ESPnet2][ASR][README] add chime6 recipe #4332 by @simpleoier
  • [Recipe][ESPnet2][ASR][SE][README] add egs2/chime4/enh_asr1 recipe and results #4316 by @simpleoier
  • [Recipe][ESPnet2][README][RNNT] updated librispeech-asr with rnn-t results #4281 by @chintu619
  • [Recipe][ESPnet2][README][SE] 2021 Clarity Challenge recipe #4210 by @popcornell
  • [Recipe][ESPnet2][README][SE] Add AISHELL-4 ENH recipe #4249 by @Emrys365
  • [Recipe][ESPnet2][README][SE] Add ConferencingSpeech 2021 recipe to egs2 #4192 by @Emrys365
  • [Recipe][ESPnet2][README][SE] Add ICASSP2021 DNS Challenge 2 recipe #4253 by @YoshikiMas
  • [Recipe][ESPnet2][README][SE] Add INTERSPEECH 2021 DNS Challenge 3 recipe #4238 by @YoshikiMas
  • [Recipe][ESPnet2][README][SE] Add results of ICASSP2021 DNS Challenge 2 recipe #4309 by @YoshikiMas
  • [Recipe][ESPnet2][README][SE] Rename egs2/clarity21/enh_2021 to egs2/clarity21/enh1 #4328 by @Emrys365
  • [Recipe][ESPnet2][README][SE] add convtasnet recipe for dns_ins20 #4314 by @muqiaoy
  • [Recipe][ESPnet2][README][SLU] Harpervalley recipe #4315 by @YushiUeda
  • [Recipe][ESPnet2][README][SLU] SLUE Voxpopuli base recipe #4262 by @siddhu001
  • [Recipe][ESPnet2][README][ST] CoVOST2 recipes #4300 by @ftshijt
  • [Recipe][ESPnet2][SLU][README] Update SLU results for ICASSP #4283 by @siddhu001

Others

  • [CI][Docker] Github Action Trigger Docker Build #4295 by @Fhrozen
  • [CI][Docker] Github Action for Docker build #4219 by @Fhrozen
  • [CI][ESPnet1][ESPnet2][Installation][README] Add isort checking to the CI tests #4372 by @kamo-naoyuki
  • [CI][ESPnet1][ESPnet2][Installation][README][mergify] Add pytorch=1.10.2 and 1.11.0 to ci configurations #4348 by @kamo-naoyuki
  • [CI][ESPnet2][ASR][SE] add integration test and fix the decoding in enh_asr and enh_st #4310 by @simpleoier
  • [CI][ESPnet2][New Features][SLU][ST][streaming] Add streaming ST/SLU #4243 by @D-Keqi
  • [CI][ESPnet2][ST] Add Test Functions for ST Train and Inference #4324 by @ftshijt
  • [CI][Installation] update install_pesq.sh #4265 by @LiChenda
  • [Documentation][ESPnet2][README][TTS] Minor update for JETS #4369 by @kan-bayashi
  • [Documentation][README] Change the order of README #4289 by @ftshijt
  • [Documentation][README] Update README.md #4284 by @sw005320

Acknowledgements

Special thanks to @AshibaWu, @D-Keqi, @DanBerrebbi, @Emrys365, @Fhrozen, @G-Thor, @JainSameer06, @LanceaKing, @LiChenda, @WeiGodHorse, @YoshikiMas, @YosukeHiguchi, @YushiUeda, @akreal, @bharaniuk, @brianyan918, @chintu619, @earthmanylf, @espnetUser, @ftshijt, @imdanboy, @kamo-naoyuki, @kan-bayashi, @muqiaoy, @nateanl, @navya-yarrabelly, @popcornell, @pyf98, @roshansh-cmu, @siddhu001, @simpleoier, @sw005320.

v.202204

2 years ago

News

From this version, we decided to use date-based versioning, e.g., v.202204.

New Features

  • [New Features][ESPnet1] added learnable fourier features #4029 by @popcornell
  • [New Features][ESPnet1][ESPnet2][ASR] Restricted Self Attention for E2E Speech Summarization #4071 by @roshansh-cmu
  • [New Features][ESPnet1][Installation][README] add lrs avsr recipe #4104 by @wentaoxandry
  • [New Features][ESPnet1][README] add lip reading sentences dataset code #4074 by @wentaoxandry
  • [New Features][ESPnet2][ASR] [ESPnet2] Intermediate/Self-conditioned CTC #4084 by @YosukeHiguchi
  • [New Features][ESPnet2][ASR] [WIP] [ESPnet2] Mask-CTC #4158 by @YosukeHiguchi
  • [New Features][ESPnet2][ASR][README] Add stochastic depth to conformer and share results on LibriSpeech 960h #4142 by @pyf98
  • [New Features][ESPnet2][MT] MT task for espnet2 with IWSLT14 recipe #4111 by @siddalmia
  • [New Features][ESPnet2][README][SE] Add DC-CRN complex masking and spectral mapping approach for speech enhancement #4127 by @Emrys365
  • [New Features][ESPnet2][README][SE] Add DCCRN separator #4097 by @Johnson-Lsx
  • [New Features][ESPnet2][README][SE] Add a new separator for speech enhancement/separation tasks #4062 by @LiChenda
  • [New Features][ESPnet2][README][SE] Add iFaSNet for enhancement/separation tasks. #4130 by @LiChenda
  • [New Features][ESPnet2][SE] Refactor DNN_Beamformer in espnet2 and add new beamformers #4082 by @Emrys365

Enhancement

  • [Enhancement][ESPnet2] Add an optional suffix to the averaged model file name #4067 by @pyf98
  • [Enhancement][ESPnet2] Update perturb_data_dir_speed.sh #4091 by @AmirHussein96
  • [Enhancement][ESPnet2][ASR] Add tests for Intermediate/Self-conditioned CTC #4117 by @YosukeHiguchi
  • [Enhancement][ESPnet2][TTS] Add option to use norm. feats over denorm. #4250 by @G-Thor

Recipe

  • [Recipe][ESPnet1][RNNT] [ESPNET1] Add the results of conformer-transducer for Librispeech #4080 by @eesungkim
  • [Recipe][ESPnet2][ASR] Add ASR recipe for VCTK dataset based on TTS's dataprep. #4088 by @kashikashi
  • [Recipe][ESPnet2][ASR] Add new conformer config with hop length 160 for LibriSpeech 960h #4162 by @pyf98
  • [Recipe][ESPnet2][ASR] Add new zh_openslr38 ASR recipe #4181 by @cuichenx
  • [Recipe][ESPnet2][ASR] Add transformer results for LibriSpeech 100h #4089 by @pyf98
  • [Recipe][ESPnet2][ASR] Added Marathi OpenSLR 64 recipe #4179 by @SujaySKumar
  • [Recipe][ESPnet2][ASR] Added recipe for Microsoft Speech Corpus (Indian languages) #4194 by @chintu619
  • [Recipe][ESPnet2][ASR] Automatic lyric recognition Recipe #4129 by @ftshijt
  • [Recipe][ESPnet2][ASR] ESPNET - LRS3 Recepie #4101 by @gdebayan
  • [Recipe][ESPnet2][ASR] bengali asr model with no finetuning #4047 by @dzeinali
  • [Recipe][ESPnet2][MT] IWSLT'14 Results using ESPnet2-MT #4132 by @pyf98
  • [Recipe][ESPnet2][README] Mandarin ISO id should be CMN instead of ZHO #4125 by @xinjli
  • [Recipe][ESPnet2][README] Update README.md #4037 by @dzeinali
  • [Recipe][ESPnet2][README] Update README.md #4121 by @dzeinali
  • [Recipe][ESPnet2][README] Update README.md for How2 2000h ASR,SUM #4155 by @roshansh-cmu
  • [Recipe][ESPnet2][RNNT] Create decode_rnnt_conformer.yaml #4058 by @sw005320
  • [Recipe][ESPnet2][RNNT] Create train_rnnt_conformer.yaml #4057 by @sw005320
  • [Recipe][ESPnet2][SLU] Add IEMOCAP results and configs #4100 by @YushiUeda
  • [Recipe][ESPnet2][SLU] Add new config and support for computing WER in SLUE-VoxCeleb #4152 by @siddhu001
  • [Recipe][ESPnet2][SLU] Add sentiment data preparation for IEMOCAP #4065 by @YushiUeda
  • [Recipe][ESPnet2][SLU] ESPnet2 swbd_sentiment recipe #4134 by @YushiUeda
  • [Recipe][ESPnet2][ST] egs2/iwslt22_dialect #4013 by @brianyan918

Bugfix

  • [Bugfix][CI][ESPnet2] Fix CI test failures related to torch_complex 0.4.0 #4112 by @Emrys365
  • [Bugfix][CI][Installation] fix doc ci by pinning jinja version #4239 by @xinjli
  • [Bugfix][ESPnet2] Fix n-gram decoding #4168 by @sw005320
  • [Bugfix][ESPnet2] bug fixes and efficient train/dev split in data prep of Microsoft Indian Languages recipe #4196 by @chintu619
  • [Bugfix][ESPnet2] fix errors in configs of librispeech ssl frontends #4098 by @simpleoier
  • [Bugfix][ESPnet2][ASR][ST] [bug patch] egs2/iwslt22_dialect #4049 by @brianyan918
  • [Bugfix][ESPnet2][MT][ST] Fix joint tokenization in st.sh #4143 by @pyf98
  • [Bugfix][ESPnet2][MT][ST] scoring fixes MT and ST #4146 by @siddalmia
  • [Bugfix][ESPnet2][TTS] Fix speaker normalization #4229 by @LanceaKing
  • [Bugfix][Installation] set gtn version #4122 by @brianyan918
  • [Bugfix][ESPnet1][ESPnet2] minor fixes in ST in espnet2 #4056 by @siddalmia

Others

  • [CI] Simplify vocoder compatibility test #4061 by @kan-bayashi
  • [CI][Documentation] Fix notebook in the official doc. #4171 by @ShigekiKarita
  • [Docker] Docker Updates #4064 by @Fhrozen
  • [Documentation] Add a checklist for PRs on recipe #4053 by @ftshijt
  • [Documentation] README Update for E2E Speech Summarization #4071 #4150 by @roshansh-cmu
  • [Documentation] Update the example PyTorch version in Installation doc #4116 by @pyf98
  • [Documentation] [documentation] fix minor typo in installation.md #4164 by @JDongian
  • [Documentation][ESPnet1] fix typo #4044 by @ooyamatakehisa
  • [Documentation][ESPnet1][ESPnet2][ASR] Add Huggingface-cli usage #4027 by @karthik19967829

Acknowledgements

Special thanks to @AmirHussein96, @Emrys365, @Fhrozen, @G-Thor, @JDongian, @Johnson-Lsx, @LanceaKing, @LiChenda, @ShigekiKarita, @SujaySKumar, @YosukeHiguchi, @YushiUeda, @brianyan918, @chintu619, @cuichenx, @dzeinali, @eesungkim, @ftshijt, @gdebayan, @kan-bayashi, @karthik19967829, @kashikashi, @ooyamatakehisa, @popcornell, @pyf98, @roshansh-cmu, @siddalmia, @siddhu001, @simpleoier, @sw005320, @wentaoxandry, @xinjli.