Projects in Awesome Lists tagged with neural-compressor
A curated list of projects in awesome lists tagged with neural-compressor .
https://github.com/intel/auto-round
Advanced Quantization Algorithm for LLMs/VLMs.
awq gptq int4 neural-compressor quantization rounding
Last synced: 19 Apr 2025
https://github.com/huggingface/optimum-benchmark
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.
benchmark neural-compressor onnxruntime openvino pytorch tensorrt-llm text-generation-inference
Last synced: 04 Dec 2024