Projects in Awesome Lists tagged with model-acceleration
A curated list of projects in awesome lists tagged with model-acceleration .
https://github.com/Alpha-Innovator/AdaptiveDiffusion
[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
adaptive-inference diffusion-models efficient-inference model-acceleration stable-diffusion training-free
Last synced: 13 Mar 2025
https://github.com/musco-ai/musco-pytorch
MUSCO: MUlti-Stage COmpression of neural networks
cp-decomposition deep-neural-networks low-rank model-acceleration model-compression network-acceleration network-compression pytorch tensor-decomposition truncated-svd tucker vbmf
Last synced: 15 Apr 2025
https://github.com/alpha-innovator/adaptivediffusion
[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
adaptive-inference diffusion-models efficient-inference model-acceleration stable-diffusion training-free
Last synced: 09 Apr 2025
https://github.com/unimodal4reasoning/adaptivediffusion
[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
adaptive-inference diffusion-models efficient-inference model-acceleration stable-diffusion training-free
Last synced: 09 Dec 2024
https://github.com/ksm26/efficiently-serving-llms
Learn the ins and outs of efficiently serving Large Language Models (LLMs). Dive into optimization techniques, including KV caching and Low Rank Adapters (LoRA), and gain hands-on experience with Predibase’s LoRAX framework inference server.
batch-processing deep-learning-techniques inference-optimization large-scale-deployment machine-learning-operations model-acceleration model-inference-service model-serving optimization-techniques performance-enhancement scalability-strategies server-optimization serving-infrastructure text-generation
Last synced: 28 Mar 2025