Projects in Awesome Lists tagged with inference-efficiency
A curated list of projects in awesome lists tagged with inference-efficiency .
https://github.com/microsoft/moonlit
This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.
inference-efficiency model-compression neural-architecture-search token-pruning
Last synced: 07 Apr 2025