https://github.com/substratusai/batchelor

Last synced: 5 months ago
JSON representation

Host: GitHub
URL: https://github.com/substratusai/batchelor
Owner: substratusai
Created: 2023-11-24T06:06:14.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2023-12-16T01:39:32.000Z (over 1 year ago)
Last Synced: 2025-01-05T09:25:17.832Z (6 months ago)
Language: Python
Size: 48.8 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Batch inference
This directory contains a simple python script that can be used to perform batch inference.

The input is expected to be a newline delimited JSON file with the following format.
The JSON object can contain any fields. The JSON object will be passed as-is to the inference server.

The script will send a POST request to the inference server with the JSON object as the request body.

The concurrent requests are limited by the `--concurrency` flag.

## Examples
Batch inference to OpenAI compatible API. The request bodies are stored in a GCS bucket and the output is saved to a GCS bucket:
```sh
python main.py --requests-path gs://my-bucket/requests.jsonl \
--output-path gs://my-bucket/output.jsonl \
--flush-every 1000 \
--concurrency 100 \
--url http://localhost:8080/v1/completions
```

### Running a K8s Job
You can run the batch inference script as a K8s job. The following example assumes that you have a GCS bucket with the requests file and a GCS bucket to store the output file.

Example Job file:

[embedmd]:# (tests/k8s-job-example.yaml)
```yaml
apiVersion: batch/v1
kind: Job
metadata:
name: batchelor
spec:
backoffLimit: 2
template:
spec:
containers:
- name: batchelor
image: ghcr.io/substratusai/batchelor:main
imagePullPolicy: Always
env:
- name: URL
value: http://lingo/v1/completions
- name: REQUESTS_PATH
value: gs://mybucket/requests.jsonl
- name: OUTPUT_PATH
value: gs://mybucket/responses.jsonl
- name: IGNORE_FIELDS
value: "id"
- name: CONCURRENCY
value: "1000"
- name: FLUSH_EVERY
value: "2000"
restartPolicy: Never
```

## Flags
```sh
usage: main.py [-h] [--url URL] [--requests-path REQUESTS_PATH] [--output-path OUTPUT_PATH]
[--flush-every FLUSH_EVERY] [--concurrency CONCURRENCY] [--ignore-fields IGNORE_FIELDS]

Test Lingo using Python OpenAI API

options:
-h, --help show this help message and exit
--url URL
--requests-path REQUESTS_PATH
Path to read requests file
--output-path OUTPUT_PATH
Path to write output files
--flush-every FLUSH_EVERY
Number of requests to flush to disk
--concurrency CONCURRENCY
Number of concurrent requests. Defaults to 100
--ignore-fields IGNORE_FIELDS
Fields to ignore in the request. Example: --ignore-fields 'id,bar'
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/substratusai/batchelor

Awesome Lists containing this project

README