Padam Save

Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks" (accepted by IJCAI 2020)

Project README

Padam

This repository contains our pytorch implementation of Partially Adaptive Momentum Estimation method (Padam) in the paper Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks (accepted by IJCAI 2020).

Prerequisites:

Pytorch
CUDA

Usage:

Use python to run run_cnn_test_cifar10.py for experiments on Cifar10 and run_cnn_test_cifar100.py for experiments on Cifar100

Command Line Arguments:

--lr: (start) learning rate
--method: optimization method, e.g., "sgdm", "adam", "amsgrad", "padam"
--net: network architecture, e.g. "vggnet", "resnet", "wideresnet"
--partial: partially adaptive parameter for Padam method
--wd: weight decay
--Nepoch: number of training epochs
--resume: whether resume from previous training process

Usage Examples:

Run experiments on Cifar10:

  -  python run_cnn_test_cifar10.py  --lr 0.1 --method "padam" --net "vggnet"  --partial 0.125 --wd 5e-4

Run experiments on Cifar100:

  -  python run_cnn_test_cifar100.py  --lr 0.1 --method "padam" --net "resnet"  --partial 0.125 --wd 5e-4

Citation

Please check our paper for technical details and full results.

@inproceedings{chen2020closing,
  title={Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks},
  author={Chen, Jinghui and Zhou, Dongruo and Tang, Yiqi and Yang, Ziyan and Cao, Yuan and Gu, Quanquan},
  booktitle={International Joint Conferences on Artificial Intelligence},
  year={2020}
}

Open Source Agenda is not affiliated with "Padam" Project. README Source: uclaml/Padam

Stars

Open Issues

Last Commit

1 year ago

Repository

uclaml/Padam

License

Apache-2.0

Homepage

https://arxiv.org/abs/1806.06763

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/padam"><img src="https://www.opensourceagenda.com/projects/padam/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022