Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/datvodinh/serve-llm

Serve high throughput and scalable LLM using Ray and vLLM
https://github.com/datvodinh/serve-llm

kubernetes llm ray torch transformers vllm

Last synced: about 17 hours ago
JSON representation

Serve high throughput and scalable LLM using Ray and vLLM

Awesome Lists containing this project

README

        

# Serve LLM with high throughput and scalable using Ray and vLLM

## Install

```bash
cd serve
make
```

## Serve

- Locally

```bash
make serve
```

- Docker

```bash
docker run --gpus all -v .:/code ghcr.io/datvodinh/serve:latest
```

- Kubernetes

```bash
kubectl apply -f k8s/ray-service.yaml
```