Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by FMInference
A curated list of projects in awesome lists by FMInference .
https://github.com/fminference/flexgen
Running large language models on a single GPU for throughput-oriented scenarios.
deep-learning gpt-3 high-throughput large-language-models machine-learning offloading opt
Last synced: 02 Aug 2024
https://github.com/FMInference/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
deep-learning gpt-3 high-throughput large-language-models machine-learning offloading opt
Last synced: 31 Jul 2024
https://github.com/FMInference/H2O
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
gpt-3 heavy-hitters high-throughput kv-cache large-language-models sparsity
Last synced: 03 Aug 2024