An open API service indexing awesome lists of open source software.

https://github.com/slinusc/deepspeed-mii-container

Launch your own high-performance DeepSpeed-MII server for seamless local LLM deployment. This repository provides a Dockerized solution to serve Hugging Face models (e.g., Mistral-7B) with an OpenAI-compatible API, enabling GPU-accelerated, low-latency inference out of the box.
https://github.com/slinusc/deepspeed-mii-container

container deepspeed docker engine inference llm mii

Last synced: 3 months ago
JSON representation

Launch your own high-performance DeepSpeed-MII server for seamless local LLM deployment. This repository provides a Dockerized solution to serve Hugging Face models (e.g., Mistral-7B) with an OpenAI-compatible API, enabling GPU-accelerated, low-latency inference out of the box.

Awesome Lists containing this project