A high-throughput and memory-efficient inference and serving engine for ...
Open deep learning compiler stack for cpu, gpu and specialized accelerators
NumPy aware dynamic Python compiler using LLVM
NumPy & SciPy for GPU
A deep learning package for many-body potential energy representation an...
stdgpu: Efficient STL-like Data Structures on the GPU
Implementation of SYCL and C++ standard parallelism for CPUs and GPUs fr...
Dockerfiles for the various software layers defined in the ROCm software...
Abstraction Library for Parallel Kernel Acceleration :llama:
Agenium Scale vectorization library for CPUs and GPUs
Next generation BLAS implementation for ROCm platform
GPU Performance API for AMD GPUs
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a co...
HPC solver for nonlinear optimization problems