An open API service indexing awesome lists of open source software.

https://github.com/umitkacar/onnx-tensorrt-optimization

40x faster AI inference: ONNX to TensorRT optimization with FP16/INT8 quantization, multi-GPU support, and deployment
https://github.com/umitkacar/onnx-tensorrt-optimization

cuda deep-learning edge-computing fp16 gpu-acceleration inference-acceleration int8 latency-optimization mlops model-deployment model-optimization nvidia-gpu onnx onnxruntime production-ai pytorch-to-onnx quantization real-time-inference tensorflow-to-onnx tensorrt

Last synced: 22 days ago
JSON representation

40x faster AI inference: ONNX to TensorRT optimization with FP16/INT8 quantization, multi-GPU support, and deployment

Awesome Lists containing this project