Projects in Awesome Lists by back2matching
A curated list of projects in awesome lists by back2matching .
https://github.com/back2matching/turboquant
First open-source TurboQuant KV cache compression for LLM inference. Drop-in for HuggingFace. pip install turboquant.
compression gpu huggingface inference kv-cache llm machine-learning pytorch quantization transformers turboquant vram
Last synced: 30 Apr 2026