Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/chadHGY/awesome-deep-model-compression
Awesome Deep Model Compression
https://github.com/chadHGY/awesome-deep-model-compression
List: awesome-deep-model-compression
awesome-list deep-learning model-compression model-distillation neural-network pruning python quantization
Last synced: 3 months ago
JSON representation
Awesome Deep Model Compression
- Host: GitHub
- URL: https://github.com/chadHGY/awesome-deep-model-compression
- Owner: chadHGY
- License: mit
- Created: 2021-04-13T02:17:34.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2021-04-13T13:01:57.000Z (almost 4 years ago)
- Last Synced: 2024-05-23T00:00:25.275Z (9 months ago)
- Topics: awesome-list, deep-learning, model-compression, model-distillation, neural-network, pruning, python, quantization
- Homepage:
- Size: 47.9 KB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- ultimate-awesome - awesome-deep-model-compression - Awesome Deep Model Compression. (Other Lists / Julia Lists)
README
# Awesome Deep Model Compression
[data:image/s3,"s3://crabby-images/b792e/b792e6bb3df0596d7093e10314eb14d650d66367" alt="Awesome"](https://awesome.re) [data:image/s3,"s3://crabby-images/c2bc5/c2bc5b56c55508717878ecffdecd0394577bb2c5" alt="hackmd-github-sync-badge"](https://hackmd.io/uDS93NOpStaNuKB2Y1KXLQ)A useful list of Deep Model Compression related research papers, articles, tutorials, libraries, tools and more.
Currently the Repos are additional given tags either [Pytorch/TF]. To quickly find hands-on Repos in your commonly used framework, please Ctrl+F to get start :smiley:## Contents
- [Papers](#papers)
- [General](#general)
- [Architecture](#architecture)
- [Quantization](#quantization)
- [Binarization](#binarization)
- [Pruning](#pruning)
- [Distillation](#distillation)
- [Low Rank Approximation](#low-rank-approximation)
- [Articles](#articles)
- [Blogs](#blogs)
- [Tools](#tools)
- [Libraries](#libraries)
- [Cross Platform](#cross-platform)
- [Hard-ware Integration](#hard-ware-integration)
---## Papers
### General### Architecture
### Quantization
### Binarization
### Pruning
- The Lottery Ticket Hypothesis | ICLR, 2019, Google | [Paper](https://openreview.net/pdf?id=rJl-b3RcF7) | [Code](https://github.com/google-research/lottery-ticket-hypothesis)
### Distillation### Low Rank Approximation
---
## Articles
### Blogs
- [Pruning deep neural networks to make them fast and small](https://jacobgil.github.io/deeplearning/pruning-deep-learning) [Pytorch] By using pruning a VGG-16 based Dogs-vs-Cats classifier is made x3 faster and x4 smaller.
- [All The Ways You Can Compress BERT](http://mitchgordon.me/machine/learning/2019/11/18/all-the-ways-to-compress-BERT.html) - An overview of different compression methods for large NLP models (BERT) based on different characteristics and compares their results.
- [Deep Learning Model Compression](https://rachitsingh.com/deep-learning-model-compression/) methods.
- [Do We Really Need Model Compression](http://mitchgordon.me/machine/learning/2020/01/13/do-we-really-need-model-compression.html) in the future?---
## Tools
### Libraries
- [torch.nn.utils.prune](https://pytorch.org/tutorials/intermediate/pruning_tutorial.html) [Pytorch]
Pytorch official supported sparsify neural networks and custom pruning technique.
- [Neural Network Intelligence](https://nni.readthedocs.io/en/v1.6/model_compression.html)[Pytorch/TF]
[data:image/s3,"s3://crabby-images/5292b/5292b4cb50a481f0d31c8fb0a741253eafcf4f0f" alt="Star on GitHub"](https://github.com/microsoft/nni)
There are some popular model compression algorithms built-in in NNI. Users could further use NNI’s auto tuning power to find the best compressed model, which is detailed in Auto Model Compression.
- [Condensa](https://github.com/NVlabs/condensa) [Pytorch]
[data:image/s3,"s3://crabby-images/eee80/eee80bcbf8d4ee61bf05ae05bf7b9d02396e3f1d" alt="Star on GitHub"](https://github.com/NVlabs/condensa)
A Programming System for Neural Network Compression. | [paper](https://arxiv.org/pdf/1911.02497.pdf)
- [IntelLabs distiller](https://github.com/IntelLabs/distiller) [Pytorch] [data:image/s3,"s3://crabby-images/66f92/66f926aff36d30161f0bb0ec03fe306f88418780" alt="Star on GitHub"](https://github.com/IntelLabs/distiller)
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. | [Documentation](https://intellabs.github.io/distiller/)
- [Torch-Pruning](https://github.com/VainF/Torch-Pruning)[Pytorch]
[data:image/s3,"s3://crabby-images/c3379/c337934920fc62462c188abe1bd95bb24fd3372d" alt="Star on GitHub"](https://github.com/VainF/Torch-Pruning)
A pytorch toolkit for structured neural network pruning and layer dependency.
- [CompressAI](https://github.com/InterDigitalInc/CompressAI) [Pytorch]
[data:image/s3,"s3://crabby-images/6a077/6a077c117992a86c2dffa840ace362e93dc684b2" alt="Star on GitHub"](https://github.com/InterDigitalInc/CompressAI)
A PyTorch library and evaluation platform for end-to-end compression research.
- [Model Compression](https://github.com/j-marple-dev/model_compression)[Pytorch]
[data:image/s3,"s3://crabby-images/592af/592afda10590a686b7c5620fab66d050918af291" alt="Star on GitHub"](https://github.com/j-marple-dev/model_compression)
A onestop pytorch model compression repo. | [Reposhub](https://reposhub.com/python/deep-learning/j-marple-dev-model_compression.html)
- [TensorFlow Model Optimization Toolkit](https://github.com/tensorflow/model-optimization) [TF]
[data:image/s3,"s3://crabby-images/7818f/7818f110b6b9ad75ff3e23070d2457f1a6efc4df" alt="Star on GitHub"](https://github.com/tensorflow/model-optimization)
Accompanied blog post, [TensorFlow Model Optimization Toolkit — Pruning API](https://medium.com/tensorflow/tensorflow-model-optimization-toolkit-pruning-api-42cac9157a6a?linkId=67380711)
- [XNNPACK](https://github.com/google/xnnpack)
[data:image/s3,"s3://crabby-images/ed2c8/ed2c8de0e154f5f844ad25af5480015a2b5bce9b" alt="Star on GitHub"](https://github.com/google/xnnpack)
XNNPACK is a highly optimized library of floating-point neural network inference operators for ARM, WebAssembly, and x86 (SSE2 level) platforms. It's a based on QNNPACK library. However, unlike QNNPACK, XNNPACK focuses entirely on floating-point operators.### Cross Platform
- [Loading a TorchScript Model in C++](https://pytorch.org/tutorials/advanced/cpp_export.html) [Pytorch x C++]
From an existing Python model to a serialized representation that can be loaded and executed purely from C++, with no dependency on Python.
- [Open Neural Network Exchange (ONNX)](https://github.com/onnx/tutorials)[Pytorch, TF, Keras...etc]
[data:image/s3,"s3://crabby-images/a93ff/a93ff4db5013ae2624ddfc206b7ae34567709fad" alt="Star on GitHub"](https://github.com/onnx/onnx)
An open standard format for representing machine learning models.### Hard-ware Integration
* [TensorRT (NVIDIA)](https://developer.nvidia.com/tensorrt) [Pytorch, TF,Keras ...etc]
![]()
* [torch2trt](https://github.com/NVIDIA-AI-IOT/torch2trt) [Pytorch] [data:image/s3,"s3://crabby-images/f13a5/f13a59cee401a2851d7adc539a42d130a2aedf4f" alt="Star on GitHub"](https://github.com/NVIDIA-AI-IOT/torch2trt)
An easy to use PyTorch to TensorRT converter
* [How to Convert a Model from PyTorch to TensorRT and Speed Up Inference](https://learnopencv.com/how-to-convert-a-model-from-pytorch-to-tensorrt-and-speed-up-inference/) [Pytorch]* [TVM (Apache)](https://github.com/apache/tvm) [[Pytorch](https://tvm.apache.org/docs/tutorials/frontend/from_pytorch.html), [TF](https://tvm.apache.org/docs/tutorials/frontend/from_tensorflow.html#sphx-glr-tutorials-frontend-from-tensorflow-py)] [data:image/s3,"s3://crabby-images/ceb19/ceb19caee96da7b85659a2132b209e117bbad7dc" alt="Star on GitHub"](https://github.com/apache/tvm)
Open deep learning compiler stack for cpu, gpu and specialized accelerators* [Pytorch Glow](https://github.com/pytorch/glow) [data:image/s3,"s3://crabby-images/dcdfa/dcdfaee421adba48054b5af0e12bd449e23495fc" alt="Star on GitHub"](https://github.com/pytorch/glow)
Glow is a machine learning compiler and execution engine for hardware accelerators. It is designed to be used as a backend for high-level machine learning frameworks. The compiler is designed to allow state of the art compiler optimizations and code generation of neural network graphs.* [CoreML (Apple)](https://github.com/apple/coremltools) [Pytorch,TF,Keras,SKLearn ...etc] [data:image/s3,"s3://crabby-images/ea06e/ea06e4d3fa8edd061ddcfeb2c6bda956d31c4f5f" alt="Star on GitHub"](https://github.com/apple/coremltools)
Core ML provides a unified representation for all models. Your app uses Core ML APIs and user data to make predictions, and to train or fine-tune models, all on the user’s device.
[Introduction](https://developer.apple.com/documentation/coreml)![]()
* [Tensorflow Lite (Google)](https://www.tensorflow.org/lite) [TF]
An open source deep learning framework for on-device inference.![]()
* [~~Intel Nervana Neon~~](https://github.com/NervanaSystems/neon) [Deprecated] [data:image/s3,"s3://crabby-images/c6e7d/c6e7dd0b04d3ae1c15e13bb4771c576e68b70c4e" alt="Star on GitHub"](https://github.com/NervanaSystems/neon)