An open API service indexing awesome lists of open source software.

https://github.com/opokiu/modal-vllm-server

Run LLMs quickly and efficiently with the modal-vllm-server, which optimizes GPU settings for each model. This project simplifies deployment on Modal's serverless infrastructure, making it easy to manage and serve your models. 🐙✨
https://github.com/opokiu/modal-vllm-server

api fastapi first-repository llama llm modal modal-labs openai python pytorch transformer vllm

Last synced: 4 months ago
JSON representation

Run LLMs quickly and efficiently with the modal-vllm-server, which optimizes GPU settings for each model. This project simplifies deployment on Modal's serverless infrastructure, making it easy to manage and serve your models. 🐙✨

Awesome Lists containing this project