Samples for CUDA Developers which demonstrates features in CUDA Toolkit
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
A community run, 5-day PyTorch Deep Learning Bootcamp
CUDA Kernel Benchmarking Library
Simple utilities to enable code reuse and portability between CUDA C/C++...
Kernel Tuner
This is an archive of materials produced for an introductory class on CU...
Amplifier allows .NET developers to easily run complex applications with...
Some CUDA design patterns and a bit of template magic for CUDA
Spiking Neural Networks in C++ with strong GPU acceleration through CUDA
CUDA kernel author's tools
Open source cross-platform compiler for compute-intensive loops used in ...
A tool for examining GPU scheduling behavior.
(REOS) Radar and Electro-Optical Simulation Framework written in C++.
Astrophysics program simulating the evolution of star systems based on t...