https://github.com/thomasnormal/fewshot

Last synced: 7 months ago
JSON representation

Host: GitHub
URL: https://github.com/thomasnormal/fewshot
Owner: thomasnormal
Created: 2024-08-06T02:28:25.000Z (11 months ago)
Default Branch: main
Last Pushed: 2024-08-21T23:06:50.000Z (11 months ago)
Last Synced: 2024-08-22T11:02:15.959Z (11 months ago)
Language: Python
Size: 474 KB
Stars: 16
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        # Simple Few-Shot Learning with LLMs

A small [DSPy](https://github.com/stanfordnlp/dspy) clone built on [Instructor](https://python.useinstructor.com/)

## Key Features

- **Pydantic Models**: Robust data validation and serialization using Pydantic.

- **Optimizers**: Includes an Optuna-based few-shot optimizer for hyperparameter tuning.

- **Vision Models**: Easy to tune few-shot prompts, even with image examples.

- **Chat Model Templates**: Uses prompt prefilling to and custom templates to make the most of modern LLM APIs.

- **Asynchronous Processing**: Utilizes `asyncio` for efficient concurrent task handling.

## Usage

```bash

git clone [email protected]:thomasnormal/fewshot.git

cd fewshot

pip install -e .

python examples/simple.py

```

The framework supports various AI tasks. Here's a basic example for question answering:

```python

import instructor

import openai

from datasets import load_dataset

from pydantic import Field, BaseModel

from tqdm.asyncio import tqdm

from fewshot import Predictor

from fewshot.optimizers import OptunaFewShot

# DSPy inspired Pydantic classes for inputs.

class Question(BaseModel):

    """Answer questions with short factoid answers."""

    question: str

class Answer(BaseModel):

    reasoning: str = Field(description="reasoning for the answer")

    answer: str = Field(description="often between 1 and 5 words")

async def main():

    dataset = load_dataset("hotpot_qa", "fullwiki")

    trainset = [(Question(question=x["question"]), x["answer"]) for x in dataset["train"]]

    client = instructor.from_openai(openai.AsyncOpenAI())  # Use any Instructor supported LLM

    pred = Predictor(client, "gpt-4o-mini", output_type=Answer, optimizer=OptunaFewShot(3))

    async for t, (input, expected), answer in pred.as_completed(trainset):

        score = int(answer.answer == expected)

        t.backwards(score=score)  # Update the model, just like PyTorch

    pred.inspect_history()  # Inspect the messages sent to the LLM

```

## Example of Few Shot tuning on images

Code: [examples/circles.py](https://github.com/thomasnormal/fewshot/blob/main/examples/circles.py)

![circles](https://raw.githubusercontent.com/thomasnormal/fewshot/main/static/circles.png)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/thomasnormal/fewshot

Awesome Lists containing this project

README