DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
distributed_port
for deepspeed.initialize
by @LZHgrla in https://github.com/microsoft/DeepSpeed/pull/5260
Full Changelog: https://github.com/microsoft/DeepSpeed/compare/v0.14.0...v0.14.1
Full Changelog: https://github.com/microsoft/DeepSpeed/compare/v0.13.5...v0.14.0
deepspeed.comm
instead of torch.distributed
by @jinyouzhi in https://github.com/microsoft/DeepSpeed/pull/5225
Full Changelog: https://github.com/microsoft/DeepSpeed/compare/v0.13.4...v0.13.5
--extra-index-url
by @loadams in https://github.com/microsoft/DeepSpeed/pull/5183
--extra-index-url
by @loadams in https://github.com/microsoft/DeepSpeed/pull/5184
Full Changelog: https://github.com/microsoft/DeepSpeed/compare/v0.13.3...v0.13.4
if collate_fn is None
by @bm-synth in https://github.com/microsoft/DeepSpeed/pull/5107
map_reduce
by @bm-synth in https://github.com/microsoft/DeepSpeed/pull/5106
index
key from output of metric_function
in DataAnalysis
map operation by @bm-synth in https://github.com/microsoft/DeepSpeed/pull/5112
run_map_reduce
to fix errors when running run_map
followed by run_reduce
by @bm-synth in https://github.com/microsoft/DeepSpeed/pull/5131
isinstance
check in PR 5112 by @bm-synth in https://github.com/microsoft/DeepSpeed/pull/5142
Full Changelog: https://github.com/microsoft/DeepSpeed/compare/v0.13.2...v0.13.3
exclude_frozen_parameters
for save_16bit_model
by @LZHgrla in https://github.com/microsoft/DeepSpeed/pull/4999
mp_size
to tensor_parallel
for TP by @yundai424 in https://github.com/microsoft/DeepSpeed/pull/5048
tensor.numel() % (2 * global_world_size) != 0
by @ByronHsu in https://github.com/microsoft/DeepSpeed/pull/5056
exclude_frozen_parameters
for zero_to_fp32.py
script by @andstor in https://github.com/microsoft/DeepSpeed/pull/4979
Full Changelog: https://github.com/microsoft/DeepSpeed/compare/v0.13.1...v0.13.2
Full Changelog: https://github.com/microsoft/DeepSpeed/compare/v0.13.0...v0.13.1
ignore_unused_parameters
by @loadams in https://github.com/microsoft/DeepSpeed/pull/4949
load_module_only
by @haileyschoelkopf in https://github.com/microsoft/DeepSpeed/pull/4141
Full Changelog: https://github.com/microsoft/DeepSpeed/compare/v0.12.6...v0.13.0
Full Changelog: https://github.com/microsoft/DeepSpeed/compare/v0.12.5...v0.12.6
Full Changelog: https://github.com/microsoft/DeepSpeed/compare/v0.12.4...v0.12.5