NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Key Features and Updates:
protobuf-lite
.BatchNormalization
nodes.12.4.0
.Key Features and Updates:
TensorRT OSS release corresponding to TensorRT 9.3.0.1 release.
Updates since TensorRT 9.2.0 release.
Key Features and Updates:
TensorRT OSS release corresponding to TensorRT 9.2.0.5 release.
Updates since TensorRT 9.1.0 release.
Key Features and Updates:
trtexec
enhancement: Added --weightless
flag to mark the engine as weightless.bertQKVToContextPlugin
.TensorRT OSS release corresponding to TensorRT 9.1.0.4 GA release.
Updates since TensorRT 8.6.1 GA release.
Key Features and Updates:
Full Changelog: https://github.com/NVIDIA/TensorRT/compare/v8.6.1...23.08
TensorRT OSS release corresponding to TensorRT 8.6.1.6 GA release.
Key Features and Updates:
--use-cuda-graph
to demoDiffusion to improve performance.TensorRT OSS release corresponding to TensorRT 8.6.0.12 EA release.
Key Features and Updates:
We needed to force-push main
and release/8.6
branches and v8.6.0 release. If you cloned/pulled the repo recently, please rebase the affected branches. Our apologies for this inconvenience.
TensorRT OSS release corresponding to TensorRT 8.5.3.1 GA release.
Key Features and Updates:
TensorRT OSS release corresponding to TensorRT 8.5.2.2 GA release.
Updates since TensorRT 8.5.1 GA release. Please refer to the TensorRT 8.5.2 GA release notes for more information.
Key Features and Updates: