OpenDILab Decision AI Engine
Full Changelog: https://github.com/opendilab/DI-engine/compare/v0.5.0...v0.5.1
Contributors: @PaParaZz1 @zjowowen @nighood @kxzxvbk @puyuan1996 @Cloud-Pku @AltmanD @HarryXuancy
Full Changelog: https://github.com/opendilab/DI-engine/compare/v0.4.9...v0.5.0
Contributors: @PaParaZz1 @zjowowen @AltmanD @puyuan1996 @kxzxvbk @Super1ce @nighood @Cloud-Pku @zhangpaipai @ruoyuGao @eltociear
ding/example/dt.py
)InteractionSerialEvaluator
, simplifying redundant resultsFull Changelog: https://github.com/opendilab/DI-engine/compare/v0.4.8...v0.4.9
Contributors: @PaParaZz1 @zjowowen @zhangpaipai @AltmanD @puyuan1996 @Cloud-Pku @Super1ce @kxzxvbk @jayyoung0802 @Mossforest @lxl2gf @Privilger
stop value
is not the necessary field in config, defaults to math.inf
, users can indicate max_env_step
or max_train_iter
in training entry to run the program with a fixed termination condition.Full Changelog: https://github.com/opendilab/DI-engine/compare/v0.4.6...v0.4.7
Contributors: @PaParaZz1 @zjowowen @puyuan1996 @SolenoidWGT @Super1ce @karroyan @zhangpaipai @eltociear
wandb
as the default logger in task pipelinevalue_network
config field and implementations in SAC and related algorithmsFull Changelog: https://github.com/opendilab/DI-engine/compare/v0.4.6...v0.4.7
Contributors: @PaParaZz1 @karroyan @zjowowen @ruoyuGao @kxzxvbk @nighood @song2181 @SolenoidWGT @PSHarold @jimmydengpeng @eltociear
CkptSaver(cfg, policy, train_freq=100)
-> CkptSaver(policy, cfg.exp_name, train_freq=100)
Contributors: @PaParaZz1 @puyuan1996 @timothijoe @Cloud-Pku @ruoyuGao @Super1ce @karroyan @kxzxvbk @eltociear
BaseEnv
to utilize DingEnvWrapper
final_eval_reward
to eval_episode_return
in all related codes (including envs and evaluators)eval reward
to episode return
(#536)Contributors: @PaParaZz1 @sailxjx @zjowowen @hiha3456 @Weiyuhong-1998 @kxzxvbk @song2181 @zerlinwang
dataclass
now, rather than dict
wandb
now, rather than tensorboard
Contributors: @PaParaZz1 @puyuan1996 @karroyan @hiha3456 @davide97l @Weiyuhong-1998 @zjowowen @norman26625
Contributors: @PaParaZz1 @nighood @norman26625 @ZHZisZZ @cpwan @mahuangxu
config
will be deepcopyed by default in compile_config
functioncompile_config
function, current code repo git log
and git diff
information will be saved in exp_name
directoryContributors: @PaParaZz1 @ZHZisZZ @zjowowen @song2181 @zerlinwang @i-am-tc @hiha3456 @nighood @kxzxvbk @Weiyuhong-1998 @RobinC94