Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with triton-inference-server
A curated list of projects in awesome lists tagged with triton-inference-server .
https://github.com/NVIDIA/GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
gpu-acceleration large-language-models llm llm-inference microservice nemo rag retrieval-augmented-generation tensorrt triton-inference-server
Last synced: 31 Jul 2024
https://github.com/coincheung/bisenet
Add bisenetv2. My implementation of BiSeNet
ade20k bisenet cityscapes cocostuff ncnn openvino pytorch tensorrt triton-inference-server
Last synced: 30 Sep 2024
https://github.com/CoinCheung/BiSeNet
Add bisenetv2. My implementation of BiSeNet
ade20k bisenet cityscapes cocostuff ncnn openvino pytorch tensorrt triton-inference-server
Last synced: 31 Jul 2024
https://github.com/isarsoft/yolov4-triton-tensorrt
This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server
deep-learning docker object-detection tensorrt triton-inference-server yolov4 yolov4-tiny
Last synced: 02 Aug 2024
https://github.com/torchpipe/torchpipe
Serving Inside Pytorch With Multi-threads
deployment inference llm-serving pipeline-parallelism pytorch ray serve serving tensorrt torch2trt triton-inference-server
Last synced: 31 Jul 2024
https://github.com/NVIDIA-ISAAC-ROS/isaac_ros_dnn_inference
NVIDIA-accelerated DNN model inference ROS 2 packages using NVIDIA Triton/TensorRT for both Jetson and x86_64 with CUDA-capable GPU
ai deep-learning deeplearning dnn gpu jetson nvidia ros ros2 ros2-humble tao tensorrt tensorrt-inference triton triton-inference-server
Last synced: 31 Jul 2024
https://github.com/notai-tech/fastdeploy
Deploy DL/ ML inference pipelines with minimal extra code.
deep-learning docker falcon gevent gunicorn http-server inference-server model-deployment model-serving python pytorch serving streaming-audio tensorflow-serving tf-serving torchserve triton triton-inference-server triton-server websocket
Last synced: 27 Sep 2024
https://github.com/chiehpower/Setup-deeplearning-tools
Set up CI in DL/ cuda/ cudnn/ TensorRT/ onnx2trt/ onnxruntime/ onnxsim/ Pytorch/ Triton-Inference-Server/ Bazel/ Tesseract/ PaddleOCR/ NVIDIA-docker/ minIO/ Supervisord on AGX or PC from scratch.
agx ci cuda cudnn deep-learning docker installation minio nvidia onnx-simplifier onnx2trt onnxruntime paddleocr pytorch supervisord tensorrt tensorrt-inference-server tesseract-ocr triton-inference-server triton-server
Last synced: 31 Jul 2024
https://github.com/eilliw/trash-classification-public
Custom Yolov8x-cls edge model deployment and training to classify trash vs recycling.
computer-vision image-classification machine-learning pytorch raspberry-pi-4 roboflow-dataset triton-inference-server yolov8
Last synced: 26 Sep 2024