Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with sparsity
A curated list of projects in awesome lists tagged with sparsity .
https://github.com/intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
auto-tuning awq fp4 gptq int4 int8 knowledge-distillation large-language-models low-precision mxformat post-training-quantization pruning quantization quantization-aware-training smoothquant sparsegpt sparsity
Last synced: 17 Dec 2024
https://github.com/neuralmagic/sparseml
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
automl computer-vision-algorithms deep-learning-algorithms deep-learning-library deep-learning-models image-classification keras nlp object-detection onnx pruning pruning-algorithms pytorch smaller-models sparsification sparsification-recipes sparsity tensorflow transfer-learning
Last synced: 17 Dec 2024
https://github.com/paddlepaddle/paddleslim
PaddleSlim is an open-source library for deep model compression and architecture search.
bert compression detection distillation ernie nas pruning quantization segmentation sparsity tensorrt transformer yolov5 yolov6 yolov7
Last synced: 19 Dec 2024
https://github.com/PaddlePaddle/PaddleSlim
PaddleSlim is an open-source library for deep model compression and architecture search.
bert compression detection distillation ernie nas pruning quantization segmentation sparsity tensorrt transformer yolov5 yolov6 yolov7
Last synced: 28 Oct 2024
https://github.com/pytorch/ao
PyTorch native quantization and sparsity for training and inference
brrr cuda dtypes float8 inference llama mx offloading optimizer pytorch quantization sparsity training transformer
Last synced: 21 Dec 2024
https://github.com/tensorflow/model-optimization
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
compression deep-learning keras machine-learning ml model-compression optimization pruning quantization quantized-networks quantized-neural-networks quantized-training sparsity tensorflow
Last synced: 17 Dec 2024
https://github.com/eric-mingjie/network-slimming
Network Slimming (Pytorch) (ICCV 2017)
channel-pruning convolutional-neural-networks deep-learning pytorch sparsity
Last synced: 20 Dec 2024
https://github.com/Bobo-y/flexible-yolov5
More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam,dcn and so on), and tensorrt
backbone cbam dcnv2 gcn hrnet moblienet neck object-detection ptq pytorch qat resnet shufflenet sparsity swin-transformer tensorrt triton-server yolov3 yolov5
Last synced: 09 Nov 2024
https://github.com/fminference/h2o
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
gpt-3 heavy-hitters high-throughput kv-cache large-language-models sparsity
Last synced: 15 Dec 2024
https://github.com/FMInference/H2O
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
gpt-3 heavy-hitters high-throughput kv-cache large-language-models sparsity
Last synced: 16 Nov 2024
https://github.com/wenwei202/caffe
Caffe for Sparse and Low-rank Deep Neural Networks
acceleration caffe compression deep-neural-networks low-rank-approximation sparse-convolution sparsity
Last synced: 26 Oct 2024
https://github.com/bwohlberg/sporco
Sparse Optimisation Research Code
admm convolutional-dictionary-learning convolutional-sparse-coding cuda dictionary-learning fista optimization optimization-algorithms plug-and-play-priors python robust-pca sparse-coding sparse-representations sparsity total-variation total-variation-minimization
Last synced: 16 Dec 2024
https://github.com/intel/neural-speed
An innovative library for efficient LLM inference via low-bit quantization
cpu fp4 fp8 gaudi2 gpu int4 int8 llamacpp llm-fine-tuning llm-inference low-bit mxformat nf4 sparsity
Last synced: 10 Oct 2024
https://github.com/opensparsellms/llama-moe-v2
🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
attention fine-tuning instruction-tuning llama llama3 mixture-of-experts moe sft sparsity
Last synced: 22 Dec 2024
https://github.com/vita-group/sparsity-win-robust-generalization
[ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun Wang*, Santosh Balachandra*, Haoyu Ma*, Zehao Wang, Zhangyang Wang
dynamic-sparse-training generalization lottery-ticket-hypothesis pruning robust-generalization robust-overfitting sparsity
Last synced: 16 Nov 2024
https://github.com/sebastianament/compressedsensing.jl
Contains a wide-ranging collection of compressed sensing and feature selection algorithms. Examples include matching pursuit algorithms, forward and backward stepwise regression, sparse Bayesian learning, and basis pursuit.
basis-pursuit compressed-sensing feature-selection julia matching-pursuit sparse-bayesian-learning sparse-linear-systems sparse-regression sparsity stepwise-regression subset-selection
Last synced: 12 Oct 2024
https://github.com/astorfi/attention-guided-sparsity
Attention-Based Guided Structured Sparsity of Deep Neural Networks
attention-mechanism convolutional-neural-networks deep-learning sparsity
Last synced: 22 Oct 2024
https://github.com/wenbihan/strollr2d_icassp2017
Image Denoising Codes using STROLLR learning, the Matlab implementation of the paper in ICASSP2017
image-denoising joint-models lowrankdenoising self-similarity sparsity state-of-the-art transform-learning unsupervised-learning
Last synced: 10 Nov 2024
https://github.com/vita-group/backdoor-lth
[CVPR 2022] "Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for Free" by Tianlong Chen*, Zhenyu Zhang*, Yihua Zhang*, Shiyu Chang, Sijia Liu, and Zhangyang Wang
backdoor-attacks linear-mode-connectivity lottery-ticket-hypothesis reverse-engineering sparsity trojan
Last synced: 16 Nov 2024
https://github.com/openmendel/mendeliht.jl
Iterative hard thresholding for l0 penalized regression
genetic-algorithms glm gwas julia model-selection multivariate multivariate-regression regression sparse-linear-solver sparsity
Last synced: 25 Nov 2024
https://github.com/vita-group/smc-bench
[ICLR 2023] "Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!" Shiwei Liu, Tianlong Chen, Zhenyu Zhang, Xuxi Chen, Tianjin Huang, AJAY KUMAR JAISWAL, Zhangyang Wang
benchmark deep-learning dynamic-sparse-training pruning sparse-neural-networks sparsity
Last synced: 16 Nov 2024
https://github.com/vita-group/tost
[ICML2022] Training Your Sparse Neural Network Better with Any Mask. Ajay Jaiswal, Haoyu Ma, Tianlong Chen, ying Ding, and Zhangyang Wang
lottery-tickets sparse-training sparsity
Last synced: 16 Nov 2024
https://github.com/ryantd/veloce
WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.
data-parallelism deep-learning distributed distributed-computing heterogeneity model-parallelism parameter-server pytorch ray sparsity
Last synced: 14 Nov 2024
https://github.com/huangcongqing/model-compression-optimization
model compression and optimization for deployment for Pytorch, including knowledge distillation, quantization and pruning.(知识蒸馏,量化,剪枝)
knowledge-distillation model-compression nas pruning pytorch quantization quantized-networks sparsity sparsity-optimization
Last synced: 02 Nov 2024
https://github.com/vita-group/linearity-grafting
[ICML 2022] "Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness" by Tianlong Chen*, Huan Zhang*, Zhenyu Zhang, Shiyu Chang, Sijia Liu, Pin-Yu Chen, Zhangyang Wang
certifiable-robustness certification linearity linearity-grafting neuron-pruning sparsity
Last synced: 16 Nov 2024
https://github.com/vita-group/double-win-lth
[ICML 2022] "Data-Efficient Double-Win Lottery Tickets from Robust Pre-training" by Tianlong Chen, Zhenyu Zhang, Sijia Liu, Yang Zhang, Shiyu Chang, Zhangyang Wang
adversarial-robustness data-efficient generalization lottery-ticket-hypothesis pretraining robust-pretraining sparsity transfer-learning
Last synced: 16 Nov 2024
https://github.com/zib-iol/bimp
Code to reproduce the experiments of ICLR2023-paper: How I Learned to Stop Worrying and Love Retraining
deep-learning learning-rate-scheduling neural-network optimization pruning pytorch sparsity
Last synced: 13 Dec 2024
https://github.com/mmxgn/smooth-convex-kl-nmf
Repository holding various implementation of specific NMF methods for speaker diarization
nmf nonnegative-matrix-factorization smoothness sparsity speaker-diarization
Last synced: 05 Nov 2024
https://github.com/vita-group/dataefficientlth
[NeurIPS 2022] "Sparse Winning Tickets are Data-Efficient Image Recognizers" by Mukund Varma T, Xuxi Chen, Zhenyu Zhang, Tianlong Chen, Subhashini Venugopalan, Zhangyang Wang
data-efficient-learning sparsity
Last synced: 16 Nov 2024
https://github.com/cypriengille/supervised-autoencoder
A supervised autoencoder with structured sparsity for efficient and informed clinical prognosis.
autoencoder feature-selection interpretability metabolomics metabolomics-database prognostic-score sparsity supervised-learning
Last synced: 14 Oct 2024
https://github.com/zib-iol/sms
Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"
averaging deep-learning neural-network optimization pruning pytorch sparsity
Last synced: 13 Dec 2024
https://github.com/sanjaradylov/sparse-cheml
Molecular-property prediction with sparsity
computational-chemistry elastic-net group-lasso molecular-property-prediction sparse-group-lasso sparsity
Last synced: 09 Nov 2024
https://github.com/vita-group/quantumsea
[QCE 2023]"QuantumSEA: In-Time Sparse Exploration for Noise Adaptive Quantum Circuits" Tianlong Chen, Zhenyu Zhang, Hanrui Wang, Jiaqi Gu, Zirui Li, David Z Pan, Frederic T Chong, Song Han, Zhangyang Wang
quantum-chemistry quantum-computing sparsity vqe
Last synced: 16 Nov 2024