LSTM and QRNN Language Model Toolkit for PyTorch
NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch. Fi...
深度学习常用优化方法详解
Efficient, transparent deep learning in hundreds of lines of code.
Ternary Gradients to Reduce Communication in Distributed Deep Learning (...
Machine learning algorithms in Dart programming language
Keras/TF implementation of AdamW, SGDW, NadamW, Warm Restarts, and Learn...
A Deep Learning and preprocessing framework in Rust with support for CPU...
Lua implementation of Entropy-SGD
A tour of different optimization algorithms in PyTorch.
Rethinking the Smaller-Norm-Less-Informative Assumption in Channel Pruni...
Java based sample code for developing on Android. The demos in this repo...
Riemannian stochastic optimization algorithms: Version 1.0.3
Unofficial implementation of Switching from Adam to SGD optimization in ...
Distributed Learning by Pair-Wise Averaging