A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.12
For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.11
For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.09
For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.08
Fix logger reference by @SeanNaren :: PR: #4786
Fix error with class method reference in msdd by @SeanNaren :: PR: #4865
Add sync for logging calls to ensure aggregation across devices by @SeanNaren :: PR: #4876
Fix saving the last checkpoint when using val check interval by @SeanNaren :: PR: #4905
Add support for skipping validation on resume + extend saving last ckpt test by @SeanNaren :: PR: #4922
Move trainer calls for ssl models to training and validation steps only by @sam1373 :: PR: #4685
Change Num Partitions size expansion fix by @aklife97 :: PR: #4719
upgrade to PTL 1.7 by @nithinraok :: PR: #4672
Fixing outputs of infer() and use of NeMo length regulator helper by @borisfom :: PR: #4724
bug fix: enable async grad reduction when DP > 1 by @erhoo82 :: PR: #4740
Add LayerNorm1P, weight decay for LN and unscaled initialization by @mikolajblaz :: PR: #4743
Data Simulator by @chooper1 :: PR: #4686
jenkins data simulator fix by @nithinraok :: PR: #4751
Mutiscale Diarization Decoder (MSDD) model and module files by @tango4j :: PR: #4650
Fix logging in gradient clipping with PTL 1.7.2 by @MaximumEntropy :: PR: #4769
Fix checkpoint restoring by @nithinraok :: PR: #4777
avoid data clipping after convolution with rir samples by @nithinraok :: PR: #4806
Fixed in_features dim if bidirectional is True by @farisalasmary :: PR: #4588
Fix float/integer type error in WER.update() by @fujimotos :: PR: #4816
[Speech Data Explorer] An option to explicitly specify the base dir by @anteju :: PR: #4678
adding instancenorm as an option for conv normalization by @bmwshop :: PR: #4827
Fix small spelling mistakes by @SeanNaren :: PR: #4839
[Tutorials] Fix matplotlib version and directory name in Multispeaker_Simulator by @anteju :: PR: #4804
Update diarization folder structure by @tango4j :: PR: #4823
Missing types in clustering by @SeanNaren :: PR: #4858
add new models by @Jorjeous :: PR: #4852
Fix decoding for T5 models with RPE by @MaximumEntropy :: PR: #4847
Update Speaker Diarization notebooks with unknown oracle_num_speakers by @fayejf :: PR: #4861
Fix mha bug by @yzhang123 :: PR: #4859
Updates to adapter training by @arendu :: PR: #4842
Changes to MSDD code after review, fix test log call by @SeanNaren :: PR: #4881
Fixed output of BERT to be [batch x seq x hidden] by @michalivne :: PR: #4887
Add AMI dataset script by @SeanNaren :: PR: #4864
Update label_models.py by @stevehuang52 :: PR: #4891
Update tutorials.rst for question answering by @Zhilin123 :: PR: #4895
removed unused imports for all domains. by @XuesongYang :: PR: #4901
Fix ptl_load_state not providing cls by @MaximumEntropy :: PR: #4914
Remove unused cv collection by @okuchaiev :: PR: #4907
Add mixed-representation config to PhonemizerTokenizer by @rlangman :: PR: #4904
Fix implicit bug in _AudioLabelDataset by @stevehuang52 :: PR: #4923
Fix and refactor label models by @fayejf :: PR: #4913
Sparrowhawk deployment fix by @ekmb :: PR: #4928
Upgrade to NGC PyTorch 22.08 Container by @ericharper :: PR: #4929
Fixes for Cherry Picked PRs by @titu1994 :: PR: #4962
Fix cherry pick workflow by @ericharper :: PR: #4964
check for active conda environment by @nithinraok :: PR: #4970
fix label models restoring issue from weighted cross entropy by @nithinraok :: PR: #4968
Add simple pre-commit file (#4983) by @SeanNaren :: PR: #4995
Fix bug in Squeezeformer Conv block by @titu1994 :: PR: #5011
Fix bugs by @Zhilin123 :: PR: #5036
Add black to pre-commit (#5027) by @SeanNaren :: PR: #5045
Fix bug in question answering tutorial by @Zhilin123 :: PR: #5049
Missing fixes from r1.11.0 to T5 finetuning eval by @MaximumEntropy :: PR: #5054
P&C docs by @jubick1337 :: PR: #5068
probabilites -> probabilities by @nithinraok :: PR: #5078
Notebook bug fixes by @vadam5 :: PR: #5084
update strategy in notebook from ddp_fork to dp by @Zhilin123 :: PR: #5088
Fix Unhashable type list for Numba Cuda spec augment kernel by @titu1994 :: PR: #5093
Remove numba import by @titu1994 :: PR: #5095
T5 prompt learning fixes missing from r.11.0 merge by @MaximumEntropy :: PR: #5075
T5 Decoding with PP > 2 fix by @MaximumEntropy :: PR: #5091
Multiprocessing fix by @jubick1337 :: PR: #5106
[Bug fix] PC lexical + audio by @ekmb :: PR: #5109
bugfix: pybtex.database.InvalidNameString: Too many commas in author … by @XuesongYang :: PR: #5112
For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.07
For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.05
For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.04
For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.03