0 "quantization" Awesome Lists
Awesome-Model-Quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
awesome binarized-neural-networks binary-network deep-learning efficient-deep-learning lightweight-neural-network model-acceleration model-compression model-quantization quantization
2,323 stars
231 forks
530 projects
Last updated: 15 Feb 2026
awesome-AutoML-and-Lightweight-Models
A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures, 3.) Model Compression, Quantization and Acceleration, 4.) Hyperparameter Optimization, 5.) Automated Feature Engineering.
architecture-search automated-feature-engineering automl awesome-list hyperparameter-optimization meta-learning model-acceleration model-compression nas neural-architecture-search
857 stars
158 forks
150 projects
Last updated: 06 Feb 2026
awesome-emdl
Embedded and mobile deep learning research resources
deep-learning deep-neural-networks efficient-neural-networks embedded-ai inference mobile-ai mobile-deep-learning mobile-inference neural-network-compression pruning
762 stars
165 forks
147 projects
Last updated: 07 Feb 2026
awesome-ml-model-compression
Awesome machine learning model compression research papers, quantization, tools, and learning material.
awesome-list machine-learning model-compression neural-networks pruning quantization
540 stars
61 forks
116 projects
Last updated: 25 Jan 2026
awesome-ai-infrastructures
Infrastructures™ for Machine Learning Training/Inference in Production.
apache-arrow apache-mesos apache-spark artificial-intelligence awesome-list deep-learning deep-learning-framework federated-learning knowledge-distillation kubernetes
439 stars
78 forks
31 projects
Last updated: 31 Jan 2026
awesome-edge-machine-learning
A curated list of awesome edge machine learning resources, including research papers, inference engines, challenges, books, meetups and others.
auto-ml awesome-list edge edge-deep-learning edge-machine-learning efficient-architectures embedded-machine-learning federated-learning iot mobile-machine-learning
272 stars
53 forks
44 projects
Last updated: 08 Feb 2026
awesome-quantization-and-fixed-point-training
Neural Network Quantization & Low-Bit Fixed Point Training For Hardware-Friendly Algorithm Design
fixed-point network-compression quantization
161 stars
22 forks
114 projects
Last updated: 22 Jan 2026
awesome-local-ai
152 open-source tools to run LLMs 100% locally – no cloud, no API keys, no censorship
awesome-list crewai exllama fine-tuning inference llama-cpp local-ai local-llm machine-learning-ai multi-modal
35 stars
1 forks
133 projects
Last updated: 10 Feb 2026
awesome-tinyml
TinyML & Edge AI: On-device inference, model quantization, embedded ML, ultra-low-power AI for microcontrollers and IoT devices.
arduino cortex-m edge-ai edge-computing embedded-ml embedded-systems esp32 iot knowledge-distillation microcontroller
32 stars
1 forks
227 projects
Last updated: 01 Feb 2026
awesome-approximate-dnn
Curated content for DNN approximation, acceleration ... with a focus on hardware accelerator and deployment
approximate-computing curated-list deep-learning deep-neural-networks edge-computing pruning quantization
27 stars
6 forks
78 projects
Last updated: 04 Jul 2025
awesome-ncnn
NCNN Framework: High-performance neural network inference for mobile, embedded, and edge AI deployment.
android-ndk arm edge-ai embedded-systems gpu-computing mobile-inference model-optimization ncnn neon neural-network-inference
10 stars
1 forks
124 projects
Last updated: 05 Feb 2026
awesome-mobile-ai
Mobile AI: iOS CoreML, Android TFLite, on-device inference, ONNX, TensorRT, and ML deployment for smartphones.
android-ml coreml edge-computing ios-ml mlkit mnn mobile-ai mobile-inference model-optimization ncnn
4 stars
0 forks
95 projects
Last updated: 24 Jan 2026
awesome-deep-model-compression
Awesome Deep Model Compression
awesome-list deep-learning model-compression model-distillation neural-network pruning python quantization
2 stars
0 forks
37 projects
Last updated: 10 Feb 2023
awesome-llm
Large Language Models (LLMs): GPT, Claude, Llama, Gemini, fine-tuning, RAG, prompt engineering, and AI agents for GenAI apps.
ai-agents anthropic chatgpt claude fine-tuning gemini generative-ai gpt instruction-tuning langchain
2 stars
1 forks
65 projects
Last updated: 12 Nov 2025