Projects in Awesome Lists tagged with token-pruning
A curated list of projects in awesome lists tagged with token-pruning .
https://github.com/microsoft/moonlit
This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.
inference-efficiency model-compression neural-architecture-search token-pruning
Last synced: 07 Apr 2025
https://github.com/adam-mazur/lazy-llama
An implementation of LazyLLM token pruning for LLaMa 2 model family.
huggingface huggingface-transformers llama llama2 token-pruning transformers
Last synced: 15 Apr 2025