Projects in Awesome Lists tagged with sparsity

https://github.com/intel/neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

auto-tuning awq fp4 gptq int4 int8 knowledge-distillation large-language-models low-precision mxformat post-training-quantization pruning quantization quantization-aware-training smoothquant sparsegpt sparsity

Last synced: 17 Dec 2024

https://github.com/neuralmagic/sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

automl computer-vision-algorithms deep-learning-algorithms deep-learning-library deep-learning-models image-classification keras nlp object-detection onnx pruning pruning-algorithms pytorch smaller-models sparsification sparsification-recipes sparsity tensorflow transfer-learning

Last synced: 17 Dec 2024

https://github.com/paddlepaddle/paddleslim

PaddleSlim is an open-source library for deep model compression and architecture search.

bert compression detection distillation ernie nas pruning quantization segmentation sparsity tensorrt transformer yolov5 yolov6 yolov7

Last synced: 19 Dec 2024

https://github.com/PaddlePaddle/PaddleSlim

PaddleSlim is an open-source library for deep model compression and architecture search.

bert compression detection distillation ernie nas pruning quantization segmentation sparsity tensorrt transformer yolov5 yolov6 yolov7

Last synced: 28 Oct 2024

https://github.com/pytorch/ao

PyTorch native quantization and sparsity for training and inference

brrr cuda dtypes float8 inference llama mx offloading optimizer pytorch quantization sparsity training transformer

Last synced: 21 Dec 2024

https://github.com/tensorflow/model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

compression deep-learning keras machine-learning ml model-compression optimization pruning quantization quantized-networks quantized-neural-networks quantized-training sparsity tensorflow

Last synced: 17 Dec 2024

https://github.com/eric-mingjie/network-slimming

Network Slimming (Pytorch) (ICCV 2017)

channel-pruning convolutional-neural-networks deep-learning pytorch sparsity

Last synced: 20 Dec 2024

https://github.com/Bobo-y/flexible-yolov5

More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam，dcn and so on), and tensorrt

backbone cbam dcnv2 gcn hrnet moblienet neck object-detection ptq pytorch qat resnet shufflenet sparsity swin-transformer tensorrt triton-server yolov3 yolov5

Last synced: 09 Nov 2024

https://github.com/fminference/h2o

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

gpt-3 heavy-hitters high-throughput kv-cache large-language-models sparsity

Last synced: 15 Dec 2024

https://github.com/FMInference/H2O

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

gpt-3 heavy-hitters high-throughput kv-cache large-language-models sparsity

Last synced: 16 Nov 2024

https://github.com/wenwei202/caffe

Caffe for Sparse and Low-rank Deep Neural Networks

acceleration caffe compression deep-neural-networks low-rank-approximation sparse-convolution sparsity

Last synced: 26 Oct 2024

https://github.com/bwohlberg/sporco

Sparse Optimisation Research Code

admm convolutional-dictionary-learning convolutional-sparse-coding cuda dictionary-learning fista optimization optimization-algorithms plug-and-play-priors python robust-pca sparse-coding sparse-representations sparsity total-variation total-variation-minimization

Last synced: 16 Dec 2024

https://github.com/intel/neural-speed

An innovative library for efficient LLM inference via low-bit quantization

cpu fp4 fp8 gaudi2 gpu int4 int8 llamacpp llm-fine-tuning llm-inference low-bit mxformat nf4 sparsity

Last synced: 10 Oct 2024

https://github.com/fasterdecoding/teal

llm llm-inference sparsity

Last synced: 17 Dec 2024

https://github.com/opensparsellms/llama-moe-v2

🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

attention fine-tuning instruction-tuning llama llama3 mixture-of-experts moe sft sparsity

Last synced: 22 Dec 2024

https://github.com/vita-group/sparsity-win-robust-generalization

[ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun Wang*, Santosh Balachandra*, Haoyu Ma*, Zehao Wang, Zhangyang Wang

dynamic-sparse-training generalization lottery-ticket-hypothesis pruning robust-generalization robust-overfitting sparsity

Last synced: 16 Nov 2024

https://github.com/sebastianament/compressedsensing.jl

Contains a wide-ranging collection of compressed sensing and feature selection algorithms. Examples include matching pursuit algorithms, forward and backward stepwise regression, sparse Bayesian learning, and basis pursuit.

basis-pursuit compressed-sensing feature-selection julia matching-pursuit sparse-bayesian-learning sparse-linear-systems sparse-regression sparsity stepwise-regression subset-selection

Last synced: 12 Oct 2024

https://github.com/astorfi/attention-guided-sparsity

Attention-Based Guided Structured Sparsity of Deep Neural Networks

attention-mechanism convolutional-neural-networks deep-learning sparsity

Last synced: 22 Oct 2024

https://github.com/wenbihan/strollr2d_icassp2017

Image Denoising Codes using STROLLR learning, the Matlab implementation of the paper in ICASSP2017

image-denoising joint-models lowrankdenoising self-similarity sparsity state-of-the-art transform-learning unsupervised-learning

Last synced: 10 Nov 2024

https://github.com/vita-group/backdoor-lth

[CVPR 2022] "Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for Free" by Tianlong Chen*, Zhenyu Zhang*, Yihua Zhang*, Shiyu Chang, Sijia Liu, and Zhangyang Wang

backdoor-attacks linear-mode-connectivity lottery-ticket-hypothesis reverse-engineering sparsity trojan

Last synced: 16 Nov 2024

https://github.com/openmendel/mendeliht.jl

Iterative hard thresholding for l0 penalized regression

genetic-algorithms glm gwas julia model-selection multivariate multivariate-regression regression sparse-linear-solver sparsity

Last synced: 25 Nov 2024

https://github.com/vita-group/smc-bench

[ICLR 2023] "Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!" Shiwei Liu, Tianlong Chen, Zhenyu Zhang, Xuxi Chen, Tianjin Huang, AJAY KUMAR JAISWAL, Zhangyang Wang

benchmark deep-learning dynamic-sparse-training pruning sparse-neural-networks sparsity

Last synced: 16 Nov 2024

https://github.com/vita-group/tost

[ICML2022] Training Your Sparse Neural Network Better with Any Mask. Ajay Jaiswal, Haoyu Ma, Tianlong Chen, ying Ding, and Zhangyang Wang

lottery-tickets sparse-training sparsity

Last synced: 16 Nov 2024

https://github.com/ryantd/veloce

WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.

data-parallelism deep-learning distributed distributed-computing heterogeneity model-parallelism parameter-server pytorch ray sparsity

Last synced: 14 Nov 2024

https://github.com/huangcongqing/model-compression-optimization

model compression and optimization for deployment for Pytorch, including knowledge distillation, quantization and pruning.(知识蒸馏，量化，剪枝)

knowledge-distillation model-compression nas pruning pytorch quantization quantized-networks sparsity sparsity-optimization

Last synced: 02 Nov 2024

https://github.com/vita-group/linearity-grafting

[ICML 2022] "Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness" by Tianlong Chen*, Huan Zhang*, Zhenyu Zhang, Shiyu Chang, Sijia Liu, Pin-Yu Chen, Zhangyang Wang

certifiable-robustness certification linearity linearity-grafting neuron-pruning sparsity

Last synced: 16 Nov 2024

https://github.com/adrhill/sparseconnectivitytracer.jl

Fast operator-overloading Jacobian & Hessian sparsity detection.

autodiff hessian jacobian julia sparsity

Last synced: 09 Nov 2024

https://github.com/vita-group/double-win-lth

[ICML 2022] "Data-Efficient Double-Win Lottery Tickets from Robust Pre-training" by Tianlong Chen, Zhenyu Zhang, Sijia Liu, Yang Zhang, Shiyu Chang, Zhangyang Wang

adversarial-robustness data-efficient generalization lottery-ticket-hypothesis pretraining robust-pretraining sparsity transfer-learning

Last synced: 16 Nov 2024

https://github.com/zib-iol/bimp

Code to reproduce the experiments of ICLR2023-paper: How I Learned to Stop Worrying and Love Retraining

deep-learning learning-rate-scheduling neural-network optimization pruning pytorch sparsity

Last synced: 13 Dec 2024

https://github.com/mmxgn/smooth-convex-kl-nmf

Repository holding various implementation of specific NMF methods for speaker diarization

nmf nonnegative-matrix-factorization smoothness sparsity speaker-diarization

Last synced: 05 Nov 2024

https://github.com/vita-group/dataefficientlth

[NeurIPS 2022] "Sparse Winning Tickets are Data-Efficient Image Recognizers" by Mukund Varma T, Xuxi Chen, Zhenyu Zhang, Tianlong Chen, Subhashini Venugopalan, Zhangyang Wang

data-efficient-learning sparsity

Last synced: 16 Nov 2024

https://github.com/cypriengille/supervised-autoencoder

A supervised autoencoder with structured sparsity for efficient and informed clinical prognosis.

autoencoder feature-selection interpretability metabolomics metabolomics-database prognostic-score sparsity supervised-learning

Last synced: 14 Oct 2024

https://github.com/zib-iol/sms

Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"

averaging deep-learning neural-network optimization pruning pytorch sparsity

Last synced: 13 Dec 2024

https://github.com/sanjaradylov/sparse-cheml

Molecular-property prediction with sparsity

computational-chemistry elastic-net group-lasso molecular-property-prediction sparse-group-lasso sparsity

Last synced: 09 Nov 2024

https://github.com/vita-group/quantumsea

[QCE 2023]"QuantumSEA: In-Time Sparse Exploration for Noise Adaptive Quantum Circuits" Tianlong Chen, Zhenyu Zhang, Hanrui Wang, Jiaqi Gu, Zirui Li, David Z Pan, Frederic T Chong, Song Han, Zhangyang Wang

quantum-chemistry quantum-computing sparsity vqe

Last synced: 16 Nov 2024