Incubator Tvm Versions Save

Open deep learning compiler stack for cpu, gpu and specialized accelerators

v0.2

6 years ago

NOTE: This is a release pre apache incubation

This release comes with a complete set of TOPI support for NNVM compiler, which allows compilation of end to end workloads. We also make major improvements in supporting new backends: ROCm for AMDGPUs and ARM GPU. Check out previous blogs that describes these major improvements in detail!

Backend support
- Support LLVM mainline(4.0, 5.0, 6.0)
- Support ROCM stack for AMD GPUs
- More robust OpenCL support for ARM GPUs
Android RPC runtime
Multi-threading optimization for ARM
- multi-threaded depthwise
- multi-threaded conv2d
New schedule primitives
- storage_align for shared memory alignment
- double_buffer
UnrollLoop : more robust version of unroll loop, count maximum steps that can be unrolled.
Full set of TOPI operators
- Introduce tvm.target to specify target options for compilation better.
- broadcast/ reduction operators
- pooling and global pooling
- Generic target support for topi
- schedule with external libraries
End to end deep learning pipelines for CPU, GPU, ARM GPU
Tutorials
- How to load compiled module in any language runtime
- How to use java runtime
Contrib library: MIOpen, CuDNN
Ongoing items that contains functioning pieces
- WebGL backend
- C++ compiler support
- MPS DNN
- low bit support, introduced popcount

v0.1rc

6 years ago

NOTE: This is a release pre apache incubation

Language runtime
- python
- javascript
- java
- c++
Backend
- arm, x86
- javascript, wasm
- CUDA
- opencl
- Metal
DNN Library integration
RPC runtime
TOPI operator pipeline python
TOPI operator pipeline in C++
Rough perf of the TOPI GPU pipeline
Rough pref of TOPI CPU pipeline
End to end graph executors
Pack libary into shared library.
External function and contrib libraries
DLPack integration support
AOT and module system
Basic code structure ready.