An open API service indexing awesome lists of open source software.

https://github.com/fminference/flexgen

Running large language models on a single GPU for throughput-oriented scenarios.
https://github.com/fminference/flexgen

deep-learning gpt-3 high-throughput large-language-models machine-learning offloading opt

Last synced: 4 months ago
JSON representation

Running large language models on a single GPU for throughput-oriented scenarios.

Awesome Lists containing this project