AMD's graph optimization engine.
Improvements include...
Add GatherND operator (#1089) Add lane reduction (#1180) Expose get_queue method for context in API (#1161) ReverseSequence op (#1177) Refactor Pooling and implement ONNX LpPool and GlobalLpPool (#1152) Reduce with runtime compilation (#1150) Half2 overloads (#1157) Fix file download for resnet50 example (#1164) Fix problem with incomplete types with older clang versions (#1174) Fix out-of-bounds access when generate uses nonpacked tensors (#1160) parallelize the ref implementation of the gemm operator (#1142) scatter operator refactoring to include reduction (#1124) fix a bug in create tensor_view with vec data type (#1155) Fix comparisons in migraphx::value class (#1146) Python Binding for the Manual Graph Buidling (#1143)
Identical to 5.1.1
ABI version bumped to 2.1 Added ONNX Operators (HardSigmoid, SoftPlus, SoftSign, GreaterOrEqual, HardSwish, Mean) Changed MessagePack file extensions to mxr Updated Examples to include cppcheck Pointwise operator improvements included Clip, auto-vectorization, and Type enforcement Add assign_to method for C++ API Support nonstandard shapes for the Squeeze and Unsqueeze Operator Improved examples and Documentation CI and Build improvements
No changes since rocm-5.0.0
No changes since rocm-5.0.0
Support for ROCm Release 5.0.0
Support for ROCm Release for 4.5.2
ROCm Release for 4.5.0
ROCm Release for 4.2.0