An open API service indexing awesome lists of open source software.

https://github.com/yinqiwen/lmsf


https://github.com/yinqiwen/lmsf

Last synced: 6 months ago
JSON representation

Awesome Lists containing this project

README

          

Rust LLM Serving Framework

## Features

- Paged Attention
- Continuous Batch
- Quantization
- awq
- squeezellm
- Models
- llama
- gemma
- chatglm

# Getting Started

**Examples**
```sh
$ cargo run --release --example llm_engine_example -- --model --gpu-memory-utilization 0.95 --block-size 8 --max-model-len 1024
```

**API Server**
```sh
$ cargo build --release
$ ./target/release/entrypoints --model --gpu-memory-utilization 0.95 --block-size 8 --max-model-len 1024 --host 0.0.0.0 --port 8000
```