https://github.com/outerbounds/triton-metaflow-starter-pack

Last synced: about 1 year ago
JSON representation

Host: GitHub
URL: https://github.com/outerbounds/triton-metaflow-starter-pack
Owner: outerbounds
Created: 2023-11-15T22:29:42.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2023-11-18T15:24:49.000Z (over 2 years ago)
Last Synced: 2025-04-12T18:29:41.733Z (over 1 year ago)
Language: Python
Size: 4.15 MB
Stars: 5
Watchers: 6
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-metaflow - triton-metaflow-starter-pack - NVIDIA Triton Inference Server starter pack. (Examples & Tutorials)

README

# What is this repo

This code will help you get started using NVIDIA's [Triton Inference Server](https://developer.nvidia.com/triton-inference-server) and [Outerbounds](https://outerbounds.com/platform) (built on open-source [Metaflow](https://docs.metaflow.org/)).

# How to use this repo

There are subdirectories each containing different getting started toolkits for using Triton with Outerbounds. You can find detailed instructions in the README files in each subdirectory.
- The [trees](./trees) repository provides a template for how to use Metaflow to orchestrate training and tuning of Scikit-learn, XGBoost, or LightGBM models, pushing the resulting model to cloud storage so it is ready to be used on a Triton Inference Server.
- The [llm](./llm) repository provides a template for how to use Metaflow to orchestrate fine-tuning for transformer models, pushing the resulting model and tokenizer state to cloud storage so it is ready to be used on a Triton Inference Server.

# What you need to get the most out of it
- Set up an inference server where you want to host models.
- Best to have a GPU for `/llm`
- Access to a [Metaflow deployment](https://outerbounds.com/engineering/welcome/)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/outerbounds/triton-metaflow-starter-pack

Awesome Lists containing this project

README