https://github.com/oneflow-inc/serving
OneFlow Serving
https://github.com/oneflow-inc/serving
Last synced: about 1 year ago
JSON representation
OneFlow Serving
- Host: GitHub
- URL: https://github.com/oneflow-inc/serving
- Owner: Oneflow-Inc
- License: apache-2.0
- Created: 2021-11-01T02:34:16.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2025-04-10T02:36:10.000Z (about 1 year ago)
- Last Synced: 2025-04-10T03:33:03.409Z (about 1 year ago)
- Language: C++
- Homepage:
- Size: 2.13 MB
- Stars: 20
- Watchers: 38
- Forks: 2
- Open Issues: 6
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# OneFlow Serving
[](https://hub.docker.com/r/oneflowinc/oneflow-serving)
[](https://hub.docker.com/r/oneflowinc/oneflow-serving)
[](https://github.com/Oneflow-Inc/serving/blob/main/LICENSE)
[](https://github.com/Oneflow-Inc/serving/pulls)
Currently, we have implemented an oneflow-backend for the [Triton Inference Server](https://github.com/triton-inference-server/server) that enables model serving.
# Triton Inference Server OneFlow Backend
OneFlow Backend For Triton Inference Server
## Get Started
Here is a [tutorial](./doc/tutorial.md) about how to export the model and how to deploy it. You can also follow the instructions below to get started.
1. Download and save model
```
cd examples/resnet50/
python3 export_model.py
```
2. Launch triton server
```
cd ../../ # back to root of the serving
docker run --rm --runtime=nvidia --network=host -v$(pwd)/examples:/models \
oneflowinc/oneflow-serving
curl -v localhost:8000/v2/health/ready # ready check
```
3. Send images and predict
```
pip3 install tritonclient[all]
cd examples/resnet50/
curl -o cat.jpg https://images.pexels.com/photos/156934/pexels-photo-156934.jpeg
python3 client.py --image cat.jpg
```
## Documentation
- [Tutorial (Chinese)](./doc/tutorial.md)
- [Build](./doc/build.md)
- [Model Configuration](./doc/model_config.md)
- [OneFlow Cookies: Serving (Chinese)](https://docs.oneflow.org/master/cookies/serving.html)
- [OneFlow Cookies: Serving (English)](https://docs.oneflow.org/en/master/cookies/serving.html)
- [Command Line Tool: oneflow-serving](./doc/command_line_tool.md)
## Known Issues
### Multiple model instance execution
The current version of oneflow does not support concurrent execution of multiple model instances. You can launch multiple containers (which is easy with Kubernetes) to bypass this limitation.