Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, ...
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray