Performance-portable, length-agnostic SIMD with runtime dispatch
oneAPI Deep Neural Network Library (oneDNN)
Fast inference engine for Transformer models
Implementations of SIMD instruction sets for systems which don't nativel...
📽 Highly Optimized 2D / 3D Graphics Math (glm) for C
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematica...
C++ image processing and machine learning library with using of SIMD: SS...
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biq...
DirectXMath is an all inline SIMD C++ linear algebra library for use in ...
SIMD Vector Classes for C++
Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions ...
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Expressive Vector Engine - SIMD in C++ Goes Brrrr
Library for specialized dense and sparse matrix operations, and deep lea...
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT