An open API service indexing awesome lists of open source software.

https://github.com/FMInference/FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.
https://github.com/FMInference/FlexLLMGen

deep-learning gpt-3 high-throughput large-language-models machine-learning offloading opt

Last synced: about 1 month ago
JSON representation

Running large language models on a single GPU for throughput-oriented scenarios.

Awesome Lists containing this project