Projects in Awesome Lists tagged with efficient-model
A curated list of projects in awesome lists tagged with efficient-model .
https://github.com/mit-han-lab/temporal-shift-module
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
acceleration efficient-model low-latency nvidia-jetson-nano temporal-modeling tsm video-understanding
Last synced: 15 May 2025
https://github.com/mit-han-lab/once-for-all
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
acceleration automl edge-ai efficient-model nas tinyml
Last synced: 13 May 2025
https://github.com/mit-han-lab/proxylessnas
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
acceleration automl efficient-model hardware-aware on-device-ai specialization
Last synced: 15 May 2025
https://github.com/MIT-HAN-LAB/ProxylessNAS
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
acceleration automl efficient-model hardware-aware on-device-ai specialization
Last synced: 23 Mar 2025
https://github.com/mit-han-lab/ProxylessNAS
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
acceleration automl efficient-model hardware-aware on-device-ai specialization
Last synced: 29 Mar 2025
https://github.com/mit-han-lab/amc
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
automl automl-for-compression channel-pruning efficient-model model-compression on-device-ai
Last synced: 13 May 2025
https://github.com/mit-han-lab/haq
[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision
automl efficient-model mixed-precision quantization
Last synced: 13 May 2025
https://github.com/microsoft/nn-Meter
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
deep-learning deep-neural-networks edge-ai edge-computing efficient-model inference latency machine-learning neural-architecture-search onnx-models python pytorch tensorflow-models
Last synced: 20 Nov 2025
https://github.com/microsoft/nn-meter
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
deep-learning deep-neural-networks edge-ai edge-computing efficient-model inference latency machine-learning neural-architecture-search onnx-models python pytorch tensorflow-models
Last synced: 12 Apr 2025
https://github.com/SqueezeAILab/KVQuant
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
compression efficient-inference efficient-model large-language-models llama llm localllama localllm mistral model-compression natural-language-processing quantization small-models text-generation transformer
Last synced: 08 May 2025
https://github.com/mit-han-lab/hardware-aware-transformers
[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
efficient-model hardware-aware machine-translation natural-language-processing specialization transformer
Last synced: 13 May 2025
https://github.com/squeezeailab/kvquant
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
compression efficient-inference efficient-model large-language-models llama llm localllama localllm mistral model-compression natural-language-processing quantization small-models text-generation transformer
Last synced: 07 Apr 2025
https://github.com/kssteven418/i-bert
[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
bert efficient-model efficient-neural-networks model-compression natural-language-processing quantization transformer
Last synced: 06 Apr 2025
https://github.com/mit-han-lab/amc-models
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
automl efficient-model model-compression on-device-ai
Last synced: 13 May 2025
https://github.com/d-li14/hbonet
[ICCV 2019] Harmonious Bottleneck on Two Orthogonal Dimensions, surpassing MobileNetV2
efficient-model iccv2019 imagenet mobilenetv2 pretrained-models pytorch
Last synced: 01 Sep 2025
https://github.com/kssteven418/ltp
[KDD'22] Learned Token Pruning for Transformers
bert efficient-model efficient-neural-networks model-compression natural-language-processing pruning transformer
Last synced: 03 Sep 2025
https://github.com/shi-labs/any-precision-dnns
Any-Precision Deep Neural Networks (AAAI 2021)
any-precision efficient-model on-demand
Last synced: 13 Apr 2025
https://github.com/mit-han-lab/neurips-micronet
[JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion
efficient-model knowledge-distillation language-modeling natural-language-processing pruning quantization
Last synced: 07 Jul 2025
https://github.com/kssteven418/q-asr
[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition
automatic-speech-recognition deep-learning efficient-model efficient-neural-networks jasper model-compression quantization quartznet speech speech-recognition
Last synced: 31 Jul 2025
https://github.com/hvision-nku/offseg
[ICCV 2025] Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment
efficient-model feature-alignment offset-learning semantic-segmentation
Last synced: 07 Nov 2025
https://github.com/lironui/ABCNet
The semantic segmentation of remote sensing images
efficient-model isprs potsdam real-time remote-sensing segmentation semantic-segmentation uav uavid vaihingen
Last synced: 22 Apr 2025
https://github.com/snigdho8869/deepdream-styletransfer
Explore image transformations with DeepDream Algorithm and Neural Style Transfer in creative image processing.
artificial-intelligence deep-learning deep-neural-networks deepdream deepdream-model deepdreamgenerator deeplearning efficient-model efficient-neural-networks efficientnet flask image-manipulation image-processing neural-networks-visualization neural-style neural-style-transfer neural-style-transfer-tensorflow style-transfer style-transfer-algorithms web-development
Last synced: 15 Apr 2025