https://github.com/ax-llm/ax

The "official" unofficial DSPy framework. Build LLM powered agents and other workflows, based on the Stanford DSP paper.
https://github.com/ax-llm/ax
ai anthropic claude cohere gemini google google-gemini gpt-4 javascript large-language-models llm nodejs ollama openai opensource rag together-compute typescript vectordb
Last synced: 6 months ago
JSON representation
The "official" unofficial DSPy framework. Build LLM powered agents and other workflows, based on the Stanford DSP paper.
Host: GitHub
URL: https://github.com/ax-llm/ax
Owner: ax-llm
License: apache-2.0
Created: 2023-02-23T19:21:40.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2025-04-03T16:27:00.000Z (8 months ago)
Last Synced: 2025-04-04T22:02:28.833Z (7 months ago)
Topics: ai, anthropic, claude, cohere, gemini, google, google-gemini, gpt-4, javascript, large-language-models, llm, nodejs, ollama, openai, opensource, rag, together-compute, typescript, vectordb
Language: TypeScript
Homepage: http://axllm.dev
Size: 20.6 MB
Stars: 1,370
Watchers: 17
Forks: 101
Open Issues: 19
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: .github/CONTRIBUTING.md
- License: LICENSE
- Security: SECURITY.md
Awesome Lists containing this project

awesome-ChatGPT-repositories - ax - The unofficial DSPy framework. Build LLM powered Agents and "Agentic workflows" based on the Stanford DSP paper. (NLP)
StarryDivineSky - ax-llm/ax - > "outputField:type"`的提示签名，自动生成类型安全的提示，并支持多种数据类型输出。 (A01_文本生成_文本对话 / 大语言对话模型及数据)
README

          # Ax - Build LLM-Powered Agents (Typescript)

Use Ax and get an end-to-end streaming, multi-modal DSPy framework with agents and typed signatures. Works with all LLMs. Ax is always streaming and handles parsing, validating, error-correcting and function calling all while streaming. Ax is easy, fast and lowers your token usage.

[![NPM Package](https://img.shields.io/npm/v/@ax-llm/ax?style=for-the-badge&color=green)](https://www.npmjs.com/package/@ax-llm/ax)

[![Discord Chat](https://dcbadge.vercel.app/api/server/DSHg3dU7dW?style=for-the-badge)](https://discord.gg/DSHg3dU7dW)

[![Twitter](https://img.shields.io/twitter/follow/dosco?style=for-the-badge&color=red)](https://twitter.com/dosco)

## Why use Ax?

- Support for all top LLMs

- Prompts auto-generated from simple signatures

- Full native end-to-end streaming

- Build Agents that can call other agents

- Built in MCP, Model Context Protocol support

- Convert docs of any format to text

- RAG, smart chunking, embedding, querying

- Works with Vercel AI SDK

- Output validation while streaming

- Multi-modal DSPy supported

- Automatic prompt tuning using optimizers

- OpenTelemetry tracing / observability

- Production ready Typescript code

- Lite weight, zero-dependencies

## What's a prompt signature?



Efficient type-safe prompts are auto-generated from a simple signature. A prompt signature is made up of a `"task description" inputField:type "field description" -> "outputField:type`. The idea behind prompt signatures is based on work done in the "Demonstrate-Search-Predict" paper.

You can have multiple input and output fields, and each field can be of the types `string`, `number`, `boolean`, `date`, `datetime`, `class "class1, class2"`, `JSON`, or an array of any of these, e.g., `string[]`. When a type is not defined, it defaults to `string`. The suffix `?` makes the field optional (required by default) and `!` makes the field internal which is good for things like reasoning.

## Output Field Types

| Type                      | Description                       | Usage                      | Example Output                                     |

|---------------------------|-----------------------------------|----------------------------|----------------------------------------------------|

| `string`                  | A sequence of characters.         | `fullName:string`          | `"example"`                                        |

| `number`                  | A numerical value.                | `price:number`             | `42`                                               |

| `boolean`                 | A true or false value.            | `isEvent:boolean`          | `true`, `false`                                    |

| `date`                    | A date value.                     | `startDate:date`           | `"2023-10-01"`                                     |

| `datetime`                | A date and time value.            | `createdAt:datetime`       | `"2023-10-01T12:00:00Z"`                           |

| `class "class1,class2"`   | A classification of items.        | `category:class`           | `["class1", "class2", "class3"]`                   |

| `string[]`                | An array of strings.              | `tags:string[]`            | `["example1", "example2"]`                         |

| `number[]`                | An array of numbers.              | `scores:number[]`          | `[1, 2, 3]`                                        |

| `boolean[]`               | An array of boolean values.       | `permissions:boolean[]`    | `[true, false, true]`                              |

| `date[]`                  | An array of dates.                | `holidayDates:date[]`      | `["2023-10-01", "2023-10-02"]`                     |

| `datetime[]`              | An array of date and time values. | `logTimestamps:datetime[]` | `["2023-10-01T12:00:00Z", "2023-10-02T12:00:00Z"]` |

| `class[] "class1,class2"` | Multiple classes                  | `categories:class[]`       | `["class1", "class2", "class3"]`                   |

| `code "language"`         | A code block in a specific language | `code:code "python"`     | `print('Hello, world!')`                          |

## LLMs Supported

`Google Gemini`, `Google Vertex`, `OpenAI`, `Azure OpenAI`, `TogetherAI`, `Anthropic`, `Cohere`, `Mistral`, `Groq`, `DeepSeek`, `Ollama`, `Reka`, `Hugging Face`

## Install

```bash

npm install @ax-llm/ax

# or

yarn add @ax-llm/ax

```

## Example: Using chain-of-thought to summarize text

```typescript

import { AxAI, AxChainOfThought } from '@ax-llm/ax';

const textToSummarize = `

The technological singularity—or simply the singularity[1]—is a hypothetical future point in time at which technological growth becomes uncontrollable and irreversible, resulting in unforeseeable changes to human civilization.[2][3] ...`;

const ai = new AxAI({

  name: 'openai',

  apiKey: process.env.OPENAI_APIKEY as string

});

const gen = new AxChainOfThought(

  `textToSummarize -> textType:class "note, email, reminder", shortSummary "summarize in 5 to 10 words"`

);

const res = await gen.forward(ai, { textToSummarize });

console.log('>', res);

```

## Example: Building an agent

Use the agent prompt (framework) to build agents that work with other agents to complete tasks. Agents are easy to make with prompt signatures. Try out the agent example.

```typescript

# npm run tsx ./src/examples/agent.ts

const researcher = new AxAgent({

  name: 'researcher',

  description: 'Researcher agent',

  signature: `physicsQuestion "physics questions" -> answer "reply in bullet points"`

});

const summarizer = new AxAgent({

  name: 'summarizer',

  description: 'Summarizer agent',

  signature: `text "text so summarize" -> shortSummary "summarize in 5 to 10 words"`

});

const agent = new AxAgent({

  name: 'agent',

  description: 'A an agent to research complex topics',

  signature: `question -> answer`,

  agents: [researcher, summarizer]

});

agent.forward(ai, { questions: "How many atoms are there in the universe" })

```

## Vector DBs Supported

Vector databases are critical to building LLM workflows. We have clean abstractions over popular vector databases and our own quick in-memory vector database.

| Provider   | Tested  |

| ---------- | ------- |

| In Memory  | 🟢 100% |

| Weaviate   | 🟢 100% |

| Cloudflare | 🟡 50%  |

| Pinecone   | 🟡 50%  |

```typescript

// Create embeddings from text using an LLM

const ret = await this.ai.embed({ texts: 'hello world' });

// Create an in memory vector db

const db = new axDB('memory');

// Insert into vector db

await this.db.upsert({

  id: 'abc',

  table: 'products',

  values: ret.embeddings[0]

});

// Query for similar entries using embeddings

const matches = await this.db.query({

  table: 'products',

  values: embeddings[0]

});

```

Alternatively you can use the `AxDBManager` which handles smart chunking, embedding and querying everything

for you, it makes things almost too easy.

```typescript

const manager = new AxDBManager({ ai, db });

await manager.insert(text);

const matches = await manager.query(

  'John von Neumann on human intelligence and singularity.'

);

console.log(matches);

```

## RAG Documents

Using documents like PDF, DOCX, PPT, XLS, etc., with LLMs is a huge pain. We make it easy with Apache Tika, an open-source document processing engine.

Launch Apache Tika

```shell

docker run -p 9998:9998 apache/tika

```

Convert documents to text and embed them for retrieval using the `AxDBManager`, which also supports a reranker and query rewriter. Two default implementations, `AxDefaultResultReranker` and `AxDefaultQueryRewriter`, are available.

```typescript

const tika = new AxApacheTika();

const text = await tika.convert('/path/to/document.pdf');

const manager = new AxDBManager({ ai, db });

await manager.insert(text);

const matches = await manager.query('Find some text');

console.log(matches);

```

## Multi-modal DSPy

When using models like `GPT-4o` and `Gemini` that support multi-modal prompts, we support using image fields, and this works with the whole DSP pipeline.

```typescript

const image = fs

  .readFileSync('./src/examples/assets/kitten.jpeg')

  .toString('base64');

const gen = new AxChainOfThought(`question, animalImage:image -> answer`);

const res = await gen.forward(ai, {

  question: 'What family does this animal belong to?',

  animalImage: { mimeType: 'image/jpeg', data: image }

});

```

When using models like `gpt-4o-audio-preview` that support multi-modal prompts with audio support, we support using audio fields, and this works with the whole DSP pipeline.

```typescript

const audio = fs

  .readFileSync('./src/examples/assets/comment.wav')

  .toString('base64');

const gen = new AxGen(`question, commentAudio:audio -> answer`);

const res = await gen.forward(ai, {

  question: 'What family does this animal belong to?',

  commentAudio: { format: 'wav', data: audio }

});

```

## Streaming

### Assertions

We support parsing output fields and function execution while streaming. This allows for fail-fast and error correction without waiting for the whole output, saving tokens and costs and reducing latency. Assertions are a powerful way to ensure the output matches your requirements; they also work with streaming.

```typescript

// setup the prompt program

const gen = new AxChainOfThought(

  ai,

  `startNumber:number -> next10Numbers:number[]`

);

// add a assertion to ensure that the number 5 is not in an output field

gen.addAssert(({ next10Numbers }: Readonly<{ next10Numbers: number[] }>) => {

  return next10Numbers ? !next10Numbers.includes(5) : undefined;

}, 'Numbers 5 is not allowed');

// run the program with streaming enabled

const res = await gen.forward({ startNumber: 1 }, { stream: true });

// or run the program with end-to-end streaming

const generator = await gen.streamingForward({ startNumber: 1 }, { stream: true });

for await (const res of generator) {}

```

The above example allows you to validate entire output fields as they are streamed in. This validation works with streaming and when not streaming and is triggered when the whole field value is available. For true validation while streaming, check out the example below. This will massively improve performance and save tokens at scale in production.

```typescript

// add a assertion to ensure all lines start with a number and a dot.

gen.addStreamingAssert(

  'answerInPoints',

  (value: string) => {

    const re = /^\d+\./;

    // split the value by lines, trim each line,

    // filter out empty lines and check if all lines match the regex

    return value

      .split('\n')

      .map((x) => x.trim())

      .filter((x) => x.length > 0)

      .every((x) => re.test(x));

  },

  'Lines must start with a number and a dot. Eg: 1. This is a line.'

);

// run the program with streaming enabled

const res = await gen.forward(

  {

    question: 'Provide a list of optimizations to speedup LLM inference.'

  },

  { stream: true, debug: true }

);

```

### Field Processors

Field processors are a powerful way to process fields in a prompt. They are used to process fields in a prompt before the prompt is sent to the LLM.

```typescript

const gen = new AxChainOfThought(

  ai,

  `startNumber:number -> next10Numbers:number[]`

);  

const streamValue = false

const processorFunction = (value) => {

  return value.map((x) => x + 1);

}

// Add a field processor to the program     

const processor = new AxFieldProcessor(gen, 'next10Numbers', processorFunction, streamValue);

const res = await gen.forward({ startNumber: 1 });

```

## AI Routing and Load Balancing

Ax provides two powerful ways to work with multiple AI services: a load balancer for high availability and a router for model-specific routing.

### Load Balancer

The load balancer automatically distributes requests across multiple AI services based on performance and availability. If one service fails, it automatically fails over to the next available service.

```typescript

import { AxAI, AxBalancer } from '@ax-llm/ax'

// Setup multiple AI services

const openai = new AxAI({ 

  name: 'openai', 

  apiKey: process.env.OPENAI_APIKEY,

})

const ollama = new AxAI({ 

  name: 'ollama', 

  config: { model: "nous-hermes2" }

})

const gemini = new AxAI({ 

  name: 'google-gemini', 

  apiKey: process.env.GOOGLE_APIKEY 

})

// Create a load balancer with all services

const balancer = new AxBalancer([openai, ollama, gemini])

// Use like a regular AI service - automatically uses the best available service

const response = await balancer.chat({

  chatPrompt: [{ role: 'user', content: 'Hello!' }],

})

// Or use the balance with AxGen

const gen = new AxGen(`question -> answer`)

const res = await gen.forward(balancer,{ question: 'Hello!' })

```

### Multi-Service Router 

The router lets you use multiple AI services through a single interface, automatically routing requests to the right service based on the model specified.

```typescript

import { AxAI, AxMultiServiceRouter, AxAIOpenAIModel } from '@ax-llm/ax'

// Setup OpenAI with model list

const openai = new AxAI({ 

  name: 'openai', 

  apiKey: process.env.OPENAI_APIKEY,

  models: [

    {

      key: 'basic',

      model: AxAIOpenAIModel.GPT4OMini,

      description: 'Model for very simple tasks such as answering quick short questions',

    },

    {

      key: 'medium',

      model: AxAIOpenAIModel.GPT4O,

      description: 'Model for semi-complex tasks such as summarizing text, writing code, and more',

    }

  ]

})

// Setup Gemini with model list

const gemini = new AxAI({ 

  name: 'google-gemini', 

  apiKey: process.env.GOOGLE_APIKEY,

  models: [

    {

      key: 'deep-thinker',

      model: 'gemini-2.0-flash-thinking',

      description: 'Model that can think deeply about a task, best for tasks that require planning',

    },

    {

      key: 'expert',

      model: 'gemini-2.0-pro',

      description: 'Model that is the best for very complex tasks such as writing large essays, complex coding, and more',

    }

  ]

})

const ollama = new AxAI({ 

  name: 'ollama', 

  config: { model: "nous-hermes2" }

})

const secretService = {

    key: 'sensitive-secret',

    service: ollama,

    description: 'Model for sensitive secrets tasks'

}

// Create a router with all services

const router = new AxMultiServiceRouter([openai, gemini, secretService])

// Route to OpenAI's expert model

const openaiResponse = await router.chat({

  chatPrompt: [{ role: 'user', content: 'Hello!' }],

  model: 'expert'

})

// Or use the router with AxGen

const gen = new AxGen(`question -> answer`)

const res = await gen.forward(router, { question: 'Hello!' })

```

The load balancer is ideal for high availability while the router is perfect when you need specific models for specific tasks Both can be used with any of Ax's features like streaming, function calling, and chain-of-thought prompting.

You can also use the balancer and the router together either the multiple balancers can be used with the router or the router can be used with the balancer.

## Model Context Protocol (MCP)

Ax provides seamless integration with the Model Context Protocol (MCP), allowing your agents to access external tools, and resources through a standardized interface.

### Using AxMCPClient

The `AxMCPClient` allows you to connect to any MCP-compatible server and use its capabilities within your Ax agents:

```typescript

import { AxMCPClient, AxMCPStdioTransport } from '@ax-llm/ax'

// Initialize an MCP client with a transport

const transport = new AxMCPStdioTransport({

  command: 'npx',

  args: ['-y', '@modelcontextprotocol/server-memory'],

})

// Create the client with optional debug mode

const client = new AxMCPClient(transport, { debug: true })

// Initialize the connection

await client.init()

// Use the client's functions in an agent

const memoryAgent = new AxAgent({

  name: 'MemoryAssistant',

  description: 'An assistant with persistent memory',

  signature: 'input, userId -> response',

  functions: [client], // Pass the client as a function provider

})

// Or use the client with AxGen

const memoryGen = new AxGen('input, userId -> response', {

    functions: [client]

})

```

## Vercel AI SDK Integration

Install the ax provider package

```shell

npm i @ax-llm/ax-ai-sdk-provider

```

Then use it with the AI SDK, you can either use the AI provider or the Agent Provider

```typescript

const ai = new AxAI({

    name: 'openai',

    apiKey: process.env['OPENAI_APIKEY'] ?? "",

});

// Create a model using the provider

const model = new AxAIProvider(ai);

export const foodAgent = new AxAgent({

  name: 'food-search',

  description:

    'Use this agent to find restaurants based on what the customer wants',

  signature,

  functions

})

// Get vercel ai sdk state

const aiState = getMutableAIState()

// Create an agent for a specific task

const foodAgent = new AxAgentProvider(ai, {

    agent: foodAgent,

    updateState: (state) => {

         aiState.done({ ...aiState.get(), state })

    },

    generate: async ({ restaurant, priceRange }) => {

        return (

            

                
{restaurant as string} {priceRange as string}

            

        )

    }

})

// Use with streamUI a critical part of building chat UIs in the AI SDK

const result = await streamUI({

    model,

    initial: ,

    messages: [

        // ...

    ],

    text: ({ content, done, delta }) => {

        // ...

    },

    tools: {

        // @ts-ignore

        'find-food': foodAgent,

    }

})

```

## OpenTelemetry support

The ability to trace and observe your llm workflow is critical to building production workflows. OpenTelemetry is an industry-standard, and we support the new `gen_ai` attribute namespace.

```typescript

import { trace } from '@opentelemetry/api';

import {

  BasicTracerProvider,

  ConsoleSpanExporter,

  SimpleSpanProcessor

} from '@opentelemetry/sdk-trace-base';

const provider = new BasicTracerProvider();

provider.addSpanProcessor(new SimpleSpanProcessor(new ConsoleSpanExporter()));

trace.setGlobalTracerProvider(provider);

const tracer = trace.getTracer('test');

const ai = new AxAI({

  name: 'ollama',

  config: { model: 'nous-hermes2' },

  options: { tracer }

});

const gen = new AxChainOfThought(

  ai,

  `text -> shortSummary "summarize in 5 to 10 words"`

);

const res = await gen.forward({ text });

```

```json

{

  "traceId": "ddc7405e9848c8c884e53b823e120845",

  "name": "Chat Request",

  "id": "d376daad21da7a3c",

  "kind": "SERVER",

  "timestamp": 1716622997025000,

  "duration": 14190456.542,

  "attributes": {

    "gen_ai.system": "Ollama",

    "gen_ai.request.model": "nous-hermes2",

    "gen_ai.request.max_tokens": 500,

    "gen_ai.request.temperature": 0.1,

    "gen_ai.request.top_p": 0.9,

    "gen_ai.request.frequency_penalty": 0.5,

    "gen_ai.request.llm_is_streaming": false,

    "http.request.method": "POST",

    "url.full": "http://localhost:11434/v1/chat/completions",

    "gen_ai.usage.completion_tokens": 160,

    "gen_ai.usage.prompt_tokens": 290

  }

}

```

## Tuning the prompts (Basic)

You can tune your prompts using a larger model to help them run more efficiently and give you better results. This is done by using an optimizer like `AxBootstrapFewShot` with and examples from the popular `HotPotQA` dataset. The optimizer generates demonstrations `demos` which when used with the prompt help improve its efficiency.

```typescript

// Download the HotPotQA dataset from huggingface

const hf = new AxHFDataLoader({

  dataset: 'hotpot_qa',

  split: 'train'

});

const examples = await hf.getData<{ question: string; answer: string }>({

  count: 100,

  fields: ['question', 'answer']

});

const ai = new AxAI({

  name: 'openai',

  apiKey: process.env.OPENAI_APIKEY as string

});

// Setup the program to tune

const program = new AxChainOfThought<{ question: string }, { answer: string }>(

  ai,

  `question -> answer "in short 2 or 3 words"`

);

// Setup a Bootstrap Few Shot optimizer to tune the above program

const optimize = new AxBootstrapFewShot<

  { question: string },

  { answer: string }

>({

  program,

  examples

});

// Setup a evaluation metric em, f1 scores are a popular way measure retrieval performance.

const metricFn: AxMetricFn = ({ prediction, example }) =>

  emScore(prediction.answer as string, example.answer as string);

// Run the optimizer and remember to save the result to use later

const result = await optimize.compile(metricFn);

```



And to use the generated demos with the above `ChainOfThought` program

```typescript

const ai = new AxAI({

  name: 'openai',

  apiKey: process.env.OPENAI_APIKEY as string

});

// Setup the program to use the tuned data

const program = new AxChainOfThought<{ question: string }, { answer: string }>(

  ai,

  `question -> answer "in short 2 or 3 words"`

);

// load tuning data

program.loadDemos('demos.json');

const res = await program.forward({

  question: 'What castle did David Gregory inherit?'

});

console.log(res);

```

## Tuning the prompts (Advanced, Mipro v2)

MiPRO v2 is an advanced prompt optimization framework that uses Bayesian optimization to automatically find the best instructions, demonstrations, and examples for your LLM programs. By systematically exploring different prompt configurations, MiPRO v2 helps maximize model performance without manual tuning.

### Key Features

- **Instruction optimization**: Automatically generates and tests multiple instruction candidates

- **Few-shot example selection**: Finds optimal demonstrations from your dataset

- **Smart Bayesian optimization**: Uses UCB (Upper Confidence Bound) strategy to efficiently explore configurations

- **Early stopping**: Stops optimization when improvements plateau to save compute

- **Program and data-aware**: Considers program structure and dataset characteristics

### Basic Usage

```typescript

import { AxAI, AxChainOfThought, AxMiPRO } from '@ax-llm/ax'

// 1. Setup your AI service

const ai = new AxAI({

  name: 'google-gemini',

  apiKey: process.env.GOOGLE_APIKEY

})

// 2. Create your program

const program = new AxChainOfThought(`input -> output`)

// 3. Configure the optimizer

const optimizer = new AxMiPRO({

  ai,

  program,

  examples: trainingData, // Your training examples

  options: {

    numTrials: 20,  // Number of configurations to try

    auto: 'medium'  // Optimization level

  }

})

// 4. Define your evaluation metric

const metricFn = ({ prediction, example }) => {

  return prediction.output === example.output

}

// 5. Run the optimization

const optimizedProgram = await optimizer.compile(metricFn, {

  valset: validationData  // Optional validation set

})

// 6. Use the optimized program

const result = await optimizedProgram.forward(ai, { input: "test input" })

```

### Configuration Options

MiPRO v2 provides extensive configuration options:

| Option | Description | Default |

|--------|-------------|---------|

| `numCandidates` | Number of instruction candidates to generate | 5 |

| `numTrials` | Number of optimization trials | 30 |

| `maxBootstrappedDemos` | Maximum number of bootstrapped demonstrations | 3 |

| `maxLabeledDemos` | Maximum number of labeled examples | 4 |

| `minibatch` | Use minibatching for faster evaluation | true |

| `minibatchSize` | Size of evaluation minibatches | 25 |

| `earlyStoppingTrials` | Stop if no improvement after N trials | 5 |

| `minImprovementThreshold` | Minimum score improvement threshold | 0.01 |

| `programAwareProposer` | Use program structure for better proposals | true |

| `dataAwareProposer` | Consider dataset characteristics | true |

| `verbose` | Show detailed optimization progress | false |

### Optimization Levels

You can quickly configure optimization intensity with the `auto` parameter:

```typescript

// Light optimization (faster, less thorough)

const optimizedProgram = await optimizer.compile(metricFn, { auto: 'light' })

// Medium optimization (balanced)

const optimizedProgram = await optimizer.compile(metricFn, { auto: 'medium' })

// Heavy optimization (slower, more thorough)

const optimizedProgram = await optimizer.compile(metricFn, { auto: 'heavy' })

```

### Advanced Example: Sentiment Analysis

```typescript

// Create sentiment analysis program

const classifyProgram = new AxChainOfThought<

  { productReview: string },

  { label: string }

>(`productReview -> label:string "positive" or "negative"`)

// Configure optimizer with advanced settings

const optimizer = new AxMiPRO({

  ai,

  program: classifyProgram,

  examples: trainingData,

  options: {

    numCandidates: 3,

    numTrials: 10,

    maxBootstrappedDemos: 2,

    maxLabeledDemos: 3,

    earlyStoppingTrials: 3,

    programAwareProposer: true,

    dataAwareProposer: true,

    verbose: true

  }

})

// Run optimization and save the result

const optimizedProgram = await optimizer.compile(metricFn, {

  valset: validationData

})

// Save configuration for future use

const programConfig = JSON.stringify(optimizedProgram, null, 2)

await fs.promises.writeFile('./optimized-config.json', programConfig)

```

### How It Works

MiPRO v2 works through these steps:

1. Generates various instruction candidates

2. Bootstraps few-shot examples from your data

3. Selects labeled examples directly from your dataset

4. Uses Bayesian optimization to find the optimal combination

5. Applies the best configuration to your program

By exploring the space of possible prompt configurations and systematically measuring performance, MiPRO v2 delivers optimized prompts that maximize your model's effectiveness.

## Built-in Functions

| Function           | Name               | Description                                  |

| ------------------ | ------------------ | -------------------------------------------- |

| JS Interpreter     | AxJSInterpreter    | Execute JS code in a sandboxed env           |

| Docker Sandbox     | AxDockerSession    | Execute commands within a docker environment |

| Embeddings Adapter | AxEmbeddingAdapter | Fetch and pass embedding to your function    |

## Check out all the examples

Use the `tsx` command to run the examples. It makes the node run typescript code. It also supports using an `.env` file to pass the AI API Keys instead of putting them in the command line.

```shell

OPENAI_APIKEY=openai_key npm run tsx ./src/examples/marketing.ts

```

| Example 
| ------------------- 
| customer-support.ts 
| function.ts 
| food-search.ts 
| marketing.ts 
| vectordb.ts 
| fibonacci.ts 
| summarize.ts 
| chain-of-thought.ts 
| rag.ts 
| rag-docs.ts 
| react.ts 
| agent.ts 
| streaming1.ts 
| streaming2.ts 
| streaming3.ts 
| smart-hone.ts 
| multi-modal.ts 
| balancer.ts 
| docker.ts 
| prime.ts 
| simple-classify.ts 
| mcp-client-memory.ts 
| mcp-client-blender.ts 
| tune-bootstrap.ts 
| tune-mipro.ts 
| tune-usage.ts

| Description                                             | | ------------------------------------------------------- | | Extract valuable details from customer communications   | | Simple single function calling example                  | | Multi-step, multi-function calling example              | | Generate short effective marketing sms messages         | | Chunk, embed and search text                            | | Use the JS code interpreter to compute fibonacci        | | Generate a short summary of a large block of text       | | Use chain-of-thought prompting to answer questions      | | Use multi-hop retrieval to answer questions             | | Convert PDF to text and embed for rag search            | | Use function calling and reasoning to answer questions  | | Agent framework, agents can use other agents, tools etc | | Output fields validation while streaming                | | Per output field validation while streaming             | | End-to-end streaming example `streamingForward()`       | | Agent looks for dog in smart home                       | | Use an image input along with other text inputs         | | Balance between various llm's based on cost, etc        | | Use the docker sandbox to find files by description     | | Using field processors to process fields in a prompt    | | Use a simple classifier to classify stuff               | | Example of using an MCP server for memory with Ax | | Example of using an MCP server for Blender with Ax | | Use bootstrap optimizer to improve prompt efficiency           | | Use mipro v2 optimizer to improve prompt efficiency           | | Use the optimized tuned prompts                         |

## Our Goal

Large language models (LLMs) are becoming really powerful and have reached a point where they can work as the backend for your entire product. However, there's still a lot of complexity to manage from using the correct prompts, models, streaming, function calls, error correction, and much more. We aim to package all this complexity into a well-maintained, easy-to-use library that can work with all state-of-the-art LLMs. Additionally, we are using the latest research to add new capabilities like DSPy to the library.

## How to use this library?

### 1. Pick an AI to work with

```ts

// Pick a LLM

const ai = new AxOpenAI({ apiKey: process.env.OPENAI_APIKEY } as AxOpenAIArgs);

```

### 2. Create a prompt signature based on your usecase

```ts

// Signature defines the inputs and outputs of your prompt program

const cot = new ChainOfThought(ai, `question:string -> answer:string`, { mem });

```

### 3. Execute this new prompt program

```ts

// Pass in the input fields defined in the above signature

const res = await cot.forward({ question: 'Are we in a simulation?' });

```

### 4. Or if you just want to directly use the LLM

```ts

const res = await ai.chat([

  { role: "system", content: "Help the customer with his questions" }

  { role: "user", content: "I'm looking for a Macbook Pro M2 With 96GB RAM?" }

]);

```

## How do you use function calling

### 1. Define the functions

```ts

// define one or more functions and a function handler

const functions = [

  {

    name: 'getCurrentWeather',

    description: 'get the current weather for a location',

    parameters: {

      type: 'object',

      properties: {

        location: {

          type: 'string',

          description: 'location to get weather for'

        },

        units: {

          type: 'string',

          enum: ['imperial', 'metric'],

          default: 'imperial',

          description: 'units to use'

        }

      },

      required: ['location']

    },

    func: async (args: Readonly<{ location: string; units: string }>) => {

      return `The weather in ${args.location} is 72 degrees`;

    }

  }

];

```

### 2. Pass the functions to a prompt

```ts

const cot = new AxGen(ai, `question:string -> answer:string`, { functions });

```

## Enable debug logs

```ts

const ai = new AxAI({ name: "openai", apiKey: process.env.OPENAI_APIKEY } as AxOpenAIArgs);

ai.setOptions({ debug: true });

```

## Reach out

We're happy to help reach out if you have questions or join the Discord

[twitter/dosco](https://twitter.com/dosco)

## FAQ

### 1. The LLM can't find the correct function to use

Improve the function naming and description. Be very clear about what the function does. Also, ensure the function parameters have good descriptions. The descriptions can be a little short but need to be precise.

### 2. How do I change the configuration of the LLM I'm using?

You can pass a configuration object as the second parameter when creating a new LLM object.

```ts

const apiKey = process.env.OPENAI_APIKEY;

const conf = AxOpenAIBestConfig();

const ai = new AxOpenAI({ apiKey, conf } as AxOpenAIArgs);

```

## 3. My prompt is too long / can I change the max tokens?

```ts

const conf = axOpenAIDefaultConfig(); // or OpenAIBestOptions()

conf.maxTokens = 2000;

```

## 4. How do I change the model? (e.g., I want to use GPT4)

```ts

const conf = axOpenAIDefaultConfig(); // or OpenAIBestOptions()

conf.model = OpenAIModel.GPT4Turbo;

```

## Monorepo tips & tricks

It is essential to remember that we should only run `npm install` from the root directory. This prevents the creation of nested `package-lock.json` files and avoids non-deduplicated `node_modules`.

Adding new dependencies in packages should be done with e.g. `npm install lodash --workspace=ax` (or just modify the appropriate `package.json` and run `npm install` from root).
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ax-llm/ax

Awesome Lists containing this project

README

{restaurant as string} {priceRange as string}