Adlik: Toolkit for Accelerating Deep Learning Inference
We fork vLLM repository and add some new features to accelerate LLM inference:
Release Date: 2022-12-20 Compatibility: The functional interfaces of Adlik r1.0.0 are compatible with previous release.
Release Date: 2022-6-21 Compatibility: The functional interfaces of Adlik r0.5 are compatible with previous release.
Release Date: 2021-12-02 Compatibility: The functional interfaces of Adlik r0.4 are compatible with previous release.
Release Date: 2021-06-21 Compatibility: The functional interfaces of Adlik r0.3 are compatible with r0.2 and r0.1.
Release Date: 2020-11-20 Compatibility: The functional interfaces of Adlik r0.2 are compatible with r0.1.
Release Date: 2020-06-15 Compatibility: Because r0.1.0 is the first release version of Adlik, there is no consideration on compatibility.
Training framework | Model format | Target runtime | compiled format |
---|---|---|---|
Keras | h5 | Tf Serving | SavedModel |
OpenVINO | IR | ||
TensorRT | Plan | ||
TF-Lite | tflite | ||
TensorFlow | Ckpt/pb | Tf Serving | SavedModel |
OpenVINO | IR | ||
TensorRT | Plan | ||
TF-Lite | tflite | ||
PyTorch | pth | OpenVINO | IR |
TensorRT | Plan |
Training framework | Inference engine | hardware environment |
---|---|---|
Keras | TensorFlow Serving-1.14 | CPU/GPU |
TensorFlow Serving-2.2 | CPU/GPU | |
OpenVINO-2019 | CPU | |
TensorRT-6 | GPU | |
TensorRT-7 | GPU | |
TF Lite-2.1 | CPU(X86/ARM) | |
TensorFlow | TensorFlow Serving-1.14 | CPU/GPU |
TensorFlow Serving-2.2 | CPU/GPU | |
OpenVINO-2019 | CPU | |
TensorRT-6 | GPU | |
TensorRT-7 | GPU | |
TF Lite-2.1 | CPU(X86/ARM) | |
PyTorch | OpenVINO-2019 | CPU |
TensorRT-6 | GPU |
Inference engine | hardware environment |
---|---|
TensorFlow Serving-1.14 | CPU/GPU |
TensorFlow Serving-2.2 | CPU/GPU |
OpenVINO-2019 | CPU |
TensorRT-6 | GPU |
TensorRT-7 | GPU |
TF Lite-2.1 | CPU(X86/ARM) |