Projects in Awesome Lists tagged with fp16
A curated list of projects in awesome lists tagged with fp16 .
https://github.com/SthPhoenix/InsightFace-REST
InsightFace REST API for easy deployment of face recognition services with TensorRT in Docker.
adaface arcface centerface docker face-detection face-recognition fastapi fp16 gpu insightface mask-detection onnx retinaface scrfd tensorrt tensorrt-conversion yolov5-face
Last synced: 11 Apr 2025
https://github.com/Maratyszcza/FP16
Conversion to/from half-precision floating point formats
floating-point fp16 half-precision
Last synced: 21 Apr 2025
https://github.com/maratyszcza/fp16
Conversion to/from half-precision floating point formats
floating-point fp16 half-precision
Last synced: 05 Apr 2025
https://github.com/kamalkraj/stable-diffusion-tritonserver
Deploy stable diffusion model with onnx/tenorrt + tritonserver
deploy docker fp16 inference machine-learning nvidia onnx python3 pytorch stablediffusion tensorrt tensorrt-inference transformers triton-inference-server
Last synced: 12 Apr 2025
https://github.com/petamoriken/float16
Stage 3 IEEE 754 half-precision floating-point ponyfill
binary16 float16 float16array fp16 half-precision ieee754 javascript typescript
Last synced: 21 Feb 2026
https://github.com/afterdusk/flop
IEEE 754-style floating-point converter
bfloat16 floating-point floating-point-conversion fp16 ieee-754 tensorfloat
Last synced: 08 May 2025
https://github.com/kentaroy47/pytorch-cifar10-fp16
Let's train CIFAR 10 Pytorch with Half-Precision!
cifar10 cnn fp16 mixed-precision mixed-precision-training pytorch training
Last synced: 02 Jul 2025
https://github.com/zerfoo/zerfoo
Pure Go machine learning framework. Train, run, and serve ML models with go build. Zero CGo.
autodiff deep-learning distributed-training float16 float8 fp16 fp8 go golang graph-ml machine-learning ml-framework neural-network onnx transformer
Last synced: 13 Jun 2026
https://github.com/umitkacar/onnx-tensorrt-optimization
40x faster AI inference: ONNX to TensorRT optimization with FP16/INT8 quantization, multi-GPU support, and deployment
cuda deep-learning edge-computing fp16 gpu-acceleration inference-acceleration int8 latency-optimization mlops model-deployment model-optimization nvidia-gpu onnx onnxruntime production-ai pytorch-to-onnx quantization real-time-inference tensorflow-to-onnx tensorrt
Last synced: 18 Feb 2026
https://github.com/johnclaw/llama-3.2-1b.vb
one-file llama 3.2 1b fp16 cpu inference in pure vb.net
basic-programming cpu-inference fp16 inference inference-engine llama llama3 llama3-2 llm llm-inference llm-serving llms vb-net vbnet visual-basic-dot-net visual-basic-net
Last synced: 01 May 2026
https://github.com/loveboyme/yolov5-tensorrt-accelerator
基于TensorRT加速的YOLOv5高性能推理框架 | High-performance YOLOv5 inference framework accelerated by TensorRT with dynamic optimization
cuda dynamic-shapes-cuda-stream fp16 int8 pycuda tensorrt yolov5
Last synced: 29 Mar 2025
https://github.com/obsidianplusplus/yolov5-tensorrt-accelerator
基于TensorRT加速的YOLOv5高性能推理框架 | High-performance YOLOv5 inference framework accelerated by TensorRT with dynamic optimization
cuda dynamic-shapes-cuda-stream fp16 int8 pycuda tensorrt yolov5
Last synced: 28 Apr 2026