This OSS release supports TensorRT-RTX 1.3, and contains sample code to showcase its capabilities and recommended usage.
Details are available in the release notes. Notable features in this release include:
- Enabled thread-safe execution for multiple GPUs with different compute capabilities, up to one network per thread.
- Performance has been improved for LLMs and convolution-based models.
- Supports CUDA contexts created in NVIDIA CUDA graphics mode on NVIDIA Blackwell devices.
- Performance has been improved for many FP8 models on Blackwell.
- Performance has been improved for many 2D convolutions.