An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with model-acceleration

A curated list of projects in awesome lists tagged with model-acceleration .

https://github.com/Alpha-Innovator/AdaptiveDiffusion

[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

adaptive-inference diffusion-models efficient-inference model-acceleration stable-diffusion training-free

Last synced: 13 Mar 2025

https://github.com/alpha-innovator/adaptivediffusion

[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

adaptive-inference diffusion-models efficient-inference model-acceleration stable-diffusion training-free

Last synced: 09 Apr 2025

https://github.com/unimodal4reasoning/adaptivediffusion

[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

adaptive-inference diffusion-models efficient-inference model-acceleration stable-diffusion training-free

Last synced: 09 Dec 2024

https://github.com/ksm26/efficiently-serving-llms

Learn the ins and outs of efficiently serving Large Language Models (LLMs). Dive into optimization techniques, including KV caching and Low Rank Adapters (LoRA), and gain hands-on experience with Predibase’s LoRAX framework inference server.

batch-processing deep-learning-techniques inference-optimization large-scale-deployment machine-learning-operations model-acceleration model-inference-service model-serving optimization-techniques performance-enhancement scalability-strategies server-optimization serving-infrastructure text-generation

Last synced: 28 Mar 2025