A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
Thin, unified, C++-flavored wrappers for the CUDA APIs
Training neural networks in TensorFlow 2.0 with 5x less memory
A Toolkit for Training, Tracking, Saving Models and Syncing Results