Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, ...
Distributed Keras Engine, Make Keras faster with only one line of code.
Learn applied deep learning from zero to deployment using TensorFlow 1.8+
A Portable C Library for Distributed CNN Inference on IoT Edge Clusters
sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference ...
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray