Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the ...
Performance-portable, length-agnostic SIMD with runtime dispatch
Implementations of SIMD instruction sets for systems which don't nativel...
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Agenium Scale vectorization library for CPUs and GPUs
Boost SIMD
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
A general purpose machine code manipulation library for x86-32 (IA-32) a...