Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with int8
A curated list of projects in awesome lists tagged with int8 .
https://github.com/intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
auto-tuning awq fp4 gptq int4 int8 knowledge-distillation large-language-models low-precision mxformat post-training-quantization pruning quantization quantization-aware-training smoothquant sparsegpt sparsity
Last synced: 01 Oct 2024
https://github.com/intel/neural-speed
An innovative library for efficient LLM inference via low-bit quantization
cpu fp4 fp8 gaudi2 gpu int4 int8 llamacpp llm-fine-tuning llm-inference low-bit mxformat nf4 sparsity
Last synced: 27 Sep 2024
https://github.com/Wulingtian/yolov5_tensorrt_int8_tools
tensorrt int8 量化yolov5 onnx模型
Last synced: 02 Aug 2024
https://github.com/Wulingtian/yolov5_tensorrt_int8
TensorRT int8 量化部署 yolov5s 模型,实测3.3ms一帧!
Last synced: 02 Aug 2024