NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch. Fi...
[CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch N...