Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Mo...
optimizer & lr scheduler & loss function collections in PyTorch