Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/onnx/onnx-tensorrt
ONNX-TensorRT: TensorRT backend for ONNX
https://github.com/onnx/onnx-tensorrt
deep-learning nvidia onnx
Last synced: 1 day ago
JSON representation
ONNX-TensorRT: TensorRT backend for ONNX
- Host: GitHub
- URL: https://github.com/onnx/onnx-tensorrt
- Owner: onnx
- License: apache-2.0
- Created: 2018-04-30T18:11:09.000Z (over 6 years ago)
- Default Branch: 10.7-GA
- Last Pushed: 2024-12-03T23:02:05.000Z (about 1 month ago)
- Last Synced: 2024-12-25T20:04:45.427Z (8 days ago)
- Topics: deep-learning, nvidia, onnx
- Language: C++
- Size: 2.29 MB
- Stars: 2,977
- Watchers: 68
- Forks: 547
- Open Issues: 181
-
Metadata Files:
- Readme: README.md
- Contributing: docs/contributing.md
- License: LICENSE
Awesome Lists containing this project
README
# TensorRT Backend For ONNX
Parses ONNX models for execution with [TensorRT](https://developer.nvidia.com/tensorrt).
See also the [TensorRT documentation](https://docs.nvidia.com/deeplearning/tensorrt/).
For the list of recent changes, see the [changelog](docs/Changelog.md).
For a list of commonly seen issues and questions, see the [FAQ](docs/faq.md).
For business inquiries, please contact [email protected]
For press and other inquiries, please contact Hector Marinez at [email protected]
## Supported TensorRT Versions
Development on the this branch is for the latest version of [TensorRT 10.7](https://developer.nvidia.com/nvidia-tensorrt-download) with full-dimensions and dynamic shape support.
For previous versions of TensorRT, refer to their respective branches.
## Supported Operators
Current supported ONNX operators are found in the [operator support matrix](docs/operators.md).
# Installation
### Dependencies
- [Protobuf >= 3.0.x](https://github.com/google/protobuf/releases)
- [TensorRT 10.7](https://developer.nvidia.com/tensorrt)
- [TensorRT 10.7 open source libaries] (https://github.com/NVIDIA/TensorRT/)### Building
For building within docker, we recommend using and setting up the docker containers as instructed in the main [TensorRT repository](https://github.com/NVIDIA/TensorRT#setting-up-the-build-environment) to build the onnx-tensorrt library.
Once you have cloned the repository, you can build the parser libraries and executables by running:
cd onnx-tensorrt
mkdir build && cd build
cmake .. -DTENSORRT_ROOT= && make -j
# Ensure that you update your LD_LIBRARY_PATH to pick up the location of the newly built library:
export LD_LIBRARY_PATH=$PWD:$LD_LIBRARY_PATHNote that this project has a dependency on CUDA. By default the build will look in `/usr/local/cuda` for the CUDA toolkit installation. If your CUDA path is different, overwrite the default path by providing `-DCUDA_TOOLKIT_ROOT_DIR=` in the CMake command.
To build with `protobuf-lite` support, add `-DUSE_ONNX_LITE_PROTO=1` to the end of the `cmake` command.
### InstanceNormalizaiton Performance
There are two implementations of InstanceNormalization that may perform differently depending on various parameters. By default, the parser will use the native TensorRT implementation of InstanceNorm. Users that want to benchmark using the plugin implementation of InstanceNorm can unset the parser flag `kNATIVE_INSTANCENORM` prior to parsing the model. Note that the plugin implementation cannot be used for building version compatible or hardware compatible engines, and attempting to do so will result in an error.
C++ Example:
// Unset the kNATIVE_INSTANCENORM flag to use the plugin implementation.
parser->unsetFlag(nvonnxparser::OnnxParserFlag::kNATIVE_INSTANCENORM);Python Example:
// Unset the NATIVE_INSTANCENORM flag to use the plugin implementation.
parser.clear_flag(trt.OnnxParserFlag.NATIVE_INSTANCENORM)## Executable Usage
There are currently two officially supported tools for users to quickly check if an ONNX model can parse and build into a TensorRT engine from an ONNX file.
For C++ users, there is the [trtexec](https://github.com/NVIDIA/TensorRT/tree/main/samples/opensource/trtexec) binary that is typically found in the `/bin` directory. The basic command of running an ONNX model is:
`trtexec --onnx=model.onnx`
Refer to the link or run `trtexec -h` for more information on CLI options.
For Python users, there is the [polygraphy](https://github.com/NVIDIA/TensorRT/tree/main/tools/Polygraphy) tool. The basic command for running an onnx model is:
`polygraphy run model.onnx --trt`
Refer to the link or run `polygraphy run -h` for more information on CLI options.
### Python Modules
Python bindings for the ONNX-TensorRT parser are packaged in the shipped `.whl` files.
TensorRT 10.7 supports ONNX release 1.17.0. Install it with:
python3 -m pip install onnx==1.17.0
The ONNX-TensorRT backend can be installed by running:
python3 setup.py install
## ONNX-TensorRT Python Backend Usage
The TensorRT backend for ONNX can be used in Python as follows:
```python
import onnx
import onnx_tensorrt.backend as backend
import numpy as npmodel = onnx.load("/path/to/model.onnx")
engine = backend.prepare(model, device='CUDA:1')
input_data = np.random.random(size=(32, 3, 224, 224)).astype(np.float32)
output_data = engine.run(input_data)[0]
print(output_data)
print(output_data.shape)
```## C++ Library Usage
The model parser library, libnvonnxparser.so, has its C++ API declared in this header:
NvOnnxParser.h
### Tests
After installation (or inside the Docker container), ONNX backend tests can be run as follows:
Real model tests only:
python onnx_backend_test.py OnnxBackendRealModelTest
All tests:
python onnx_backend_test.py
You can use `-v` flag to make output more verbose.
## Pre-trained Models
Pre-trained models in ONNX format can be found at the [ONNX Model Zoo](https://github.com/onnx/models)