https://github.com/aastroza/modal-outlines-examples

Examples of code that use Outlines to enable structured text generation for LLMs running on Modal
https://github.com/aastroza/modal-outlines-examples

cloud generative-ai gpu json llms modal outlines serverless structured-generation

Last synced: about 2 months ago
JSON representation

Examples of code that use Outlines to enable structured text generation for LLMs running on Modal

Host: GitHub
URL: https://github.com/aastroza/modal-outlines-examples
Owner: aastroza
License: apache-2.0
Created: 2024-04-18T17:57:28.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-04-25T20:12:53.000Z (over 1 year ago)
Last Synced: 2025-01-13T23:35:16.339Z (9 months ago)
Topics: cloud, generative-ai, gpu, json, llms, modal, outlines, serverless, structured-generation
Language: Jupyter Notebook
Homepage:
Size: 37.1 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # modal-outlines-examples

Examples of code that use [Outlines](https://github.com/outlines-dev/outlines) to enable structured text generation for LLMs running on [Modal](https://modal.com).

Is your computer only capable of running small, quantized LLMs? Would you love to have an **Nvidia H100** at home? Keep reading, because we're about to make your dream come true.

## Level 1: The Basics

**The basic idea:** We deploy a function to Modal, an easy-to-use cloud platform, then we call that function locally from a Jupyter notebook.

**The unbelievable part:** The function will run on a super powerful GPU in the cloud. Plus, we get to use free processing credits ($30 monthly).

Gated repo: https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2

## Level 2: Different LLMs

Yeah, I know Mistral is good, but what if I want to test different models?

## Level 3: vLLM

Coming [soon](https://github.com/vllm-project/vllm/pull/4109).

## Level 4: FastAPI

How to serve an inference endpoint for structured text generation. GPU-powered. Serverless. Automatically scales. Pay as you go.

## Installation

```shell

conda create --name modal-outlines-examples -c conda-forge python=3.11

conda activate modal-outlines-examples

pip install -r requirements.txt

python -m modal setup

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/aastroza/modal-outlines-examples

Awesome Lists containing this project

README