Projects in Awesome Lists tagged with model-acceleration

A curated list of projects in awesome lists tagged with model-acceleration .

- Recently synced
- Stars

https://github.com/Alpha-Innovator/AdaptiveDiffusion

[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

adaptive-inference diffusion-models efficient-inference model-acceleration stable-diffusion training-free

Last synced: 13 Mar 2025

https://github.com/musco-ai/musco-pytorch

MUSCO: MUlti-Stage COmpression of neural networks

cp-decomposition deep-neural-networks low-rank model-acceleration model-compression network-acceleration network-compression pytorch tensor-decomposition truncated-svd tucker vbmf

Last synced: 15 Apr 2025

https://github.com/alpha-innovator/adaptivediffusion

[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

adaptive-inference diffusion-models efficient-inference model-acceleration stable-diffusion training-free

Last synced: 09 Apr 2025

https://github.com/unimodal4reasoning/adaptivediffusion

[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

adaptive-inference diffusion-models efficient-inference model-acceleration stable-diffusion training-free

Last synced: 09 Dec 2024

https://github.com/ksm26/efficiently-serving-llms

Learn the ins and outs of efficiently serving Large Language Models (LLMs). Dive into optimization techniques, including KV caching and Low Rank Adapters (LoRA), and gain hands-on experience with Predibase’s LoRAX framework inference server.

batch-processing deep-learning-techniques inference-optimization large-scale-deployment machine-learning-operations model-acceleration model-inference-service model-serving optimization-techniques performance-enhancement scalability-strategies server-optimization serving-infrastructure text-generation

Last synced: 28 Mar 2025

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome