https://github.com/wrtnlabs/agent

Super-easy AI Agent Library specialized for function calling
https://github.com/wrtnlabs/agent
agent ai chatbot llm-function-calling multi-agent-system openai openapi swagger
Last synced: 8 months ago
JSON representation
Super-easy AI Agent Library specialized for function calling
Host: GitHub
URL: https://github.com/wrtnlabs/agent
Owner: wrtnlabs
License: mit
Created: 2025-02-13T02:25:56.000Z (8 months ago)
Default Branch: main
Last Pushed: 2025-02-13T08:14:35.000Z (8 months ago)
Last Synced: 2025-02-13T08:36:03.103Z (8 months ago)
Topics: agent, ai, chatbot, llm-function-calling, multi-agent-system, openai, openapi, swagger
Language: TypeScript
Homepage:
Size: 1000 Bytes
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project

README

          # `@wrtnlabs/agent`

![Key Concept Gear Diagram](https://github.com/user-attachments/assets/2cdb6e05-77f0-419a-ab7f-539b3d2bcbc5)

[![GitHub license](https://img.shields.io/badge/license-MIT-blue.svg)](https://github.com/wrtnlabs/agent/blob/master/LICENSE)

[![npm version](https://img.shields.io/npm/v/@wrtnlabs/agent.svg)](https://www.npmjs.com/package/@wrtnlabs/agent)

[![Downloads](https://img.shields.io/npm/dm/@wrtnlabs/agent.svg)](https://www.npmjs.com/package/@wrtnlabs/agent)

[![Build Status](https://github.com/wrtnlabs/agent/workflows/build/badge.svg)](https://github.com/wrtnlabs/agent/actions?query=workflow%3Abuild)

The simplest AI agent library, specialized in **LLM function calling**.

`@wrtnlabs/agent` is the simplest AI agent library specialized for LLM (Large Language Model) function calling. You can provide functions to call by *Swagger/OpenAPI* document or *TypeScript class type*, and it will make everything possible. *Super AI Chatbot* development, or *Multi-Agent Orchestration*, all of them can be realized by the function calling.

For example, if you provide **Swagger document** of a Shopping Mall Server, `@wrtnlabs/agent` will compose **Super AI Chatbot** application. In the chatbot application, customers can purchase products just by conversation texts. If you wanna automate the counseling or refunding process, you also can do it just by delivering the Swagger document.

Also, the LLM function calling strategy is effective for the **Multi-Agent Orchestration**, and it is easier to develop than any other way. You don't need to learn any complicate framework and its specific paradigms and patterns. Just connect them through class, and deliver the **TypeScript class type**. `@wrtnlabs/agent` will centralize and realize the multi-agent orchestration through function calling.

> https://github.com/user-attachments/assets/01604b53-aca4-41cb-91aa-3faf63549ea6

>

> Demonstration video of Shopping AI Chatbot

## How to Use

### Setup

```bash

npm install @wrtnlabs/agent @samchon/openapi typia

npx typia setup

```

Install not only `@wrtnlabs/agent`, but also [`@samchon/openapi`](https://github.com/samchon/openapi) and [`typia`](https://github.com/samchon/typia).

`@samchon/openapi` is an OpenAPI specification library which can convert Swagger/OpenAPI document to LLM function calling schema. And `typia` is a transformer (compiler) library which can compose LLM function calling schema from a TypeScript class type.

By the way, as `typia` is a transformer library analyzing TypeScript source code in the compilation level, it needs additional setup command `npx typia setup`. Also, if you're not using non-standard TypeScript compiler (not `tsc`) or developing the agent in the frontend environment, you have to setup [`@ryoppippi/unplugin-typia`](https://typia.io/docs/setup/#unplugin-typia) too.

### Chat with Backend Server

```typescript

import { IHttpLlmApplication } from "@samchon/openapi";

import { WrtnAgent, createHttpApplication } from "@wrtnlabs/agent";

import OpenAI from "openai";

import { IValidation } from "typia";

const main = async (): Promise => {

  // LOAD SWAGGER DOCUMENT, AND CONVERT TO LLM APPLICATION SCHEMA

  const application: IValidation> =

    createHttpApplication({

      model: "chatgpt",

      document: OpenApi.convert(

        await fetch("https://shopping-be.wrtn.ai/editor/swagger.json").then(

          (r) => r.json()

        )

      ),

    });

  if (application.success === false) {

    console.error(application.errors);

    throw new Error("Type error on the target swagger document");

  }

  // CREATE AN AGENT WITH THE APPLICATION

  const agent: WrtnAgent = new WrtnAgent({

    provider: {

      type: "chatgpt",

      model: "gpt-4o-mini",

      api: new OpenAI({

        apiKey: "YOUR_OPENAI_API_KEY",

      }),

    },

    controllers: [

      {

        protocol: "http",

        name: "shopping",

        application: application.data,

        connection: {

          host: "https://shopping-be.wrtn.ai",

        },

      },

    ],

    config: {

      locale: "en-US",

    },

  });

  // ADD EVENT LISTENERS

  agent.on("select", async (select) => {

    console.log("selected function", select.operation.function.name);

  });

  agent.on("execute", async (execute) => {

    consoe.log("execute function", {

      function: execute.operation.function.name,

      arguments: execute.arguments,

      value: execute.value,

    });

  });

  // CONVERSATE TO AI CHATBOT

  await agent.conversate("What you can do?");

};

main().catch(console.error);

```

Just load your swagger document, and put it into the `@wrtnlabs/agent`.

Then you can start conversation with your backend server, and the API functions of the backend server would be automatically called. AI chatbot will analyze your conversation texts, and executes proper API functions by the LLM (Large Language Model) function calling feature.

From now on, every backend developer is also an AI developer.

### Chat with TypeScript Class

```typescript

import { WrtnAgent } from "@wrtnlabs/agent";

import typia, { tags } from "typia";

import OpenAI from "openai";

class BbsArticleService {

  /**

   * Create a new article.

   *

   * Writes a new article and archives it into the DB.

   *

   * @param props Properties of create function

   * @returns Newly created article

   */

  public async create(props: {

    /**

     * Information of the article to create

     */

    input: IBbsArticle.ICreate;

  }): Promise;

  /**

   * Update an article.

   *

   * Updates an article with new content.

   *

   * @param props Properties of update function

   * @param input New content to update

   */

  public async update(props: {

    /**

     * Target article's {@link IBbsArticle.id}.

     */

    id: string & tags.Format<"uuid">;

    /**

     * New content to update.

     */

    input: IBbsArticle.IUpdate;

  }): Promise;

}

const main = async (): Promise => {

  const api: OpenAI = new OpenAI({

    apiKey: "YOUR_OPENAI_API_KEY",

  });

  const agent: WrtnAgent = new WrtnAgent({

    provider: {

      type: "chatgpt",

      model: "gpt-4o-mini",

      api: new OpenAI({

        apiKey: "YOUR_OPENAI_API_KEY",

      }),

    },

    controllers: [

      {

        protocol: "class",

        name: "vectorStore",

        application: typia.llm.applicationOfValidate<

          BbsArticleService,

          "chatgpt"

        >(),

        execute: new BbsArticleService(),

      },

    ],

  });

  await agent.conversate("I wanna write an article.");

};

main().catch(console.error);

```

You also can chat with a TypeScript class.

Just deliver the TypeScript type to the `@wrtnlabs/agent`, and start conversation. Then `@wrtnlabs/agent` will call the proper class functions by analyzing your conversation texts with LLM function calling feature.

From now on, every TypeScript classes you've developed can be the AI chatbot.

### Multi Agent Orchestration

```typescript

import { WrtnAgent } from "@wrtnlabs/agent";

import typia from "typia";

import OpenAI from "openai";

class OpenAIVectorStoreAgent {

  /**

   * Retrieve Vector DB with RAG.

   *

   * @param props Properties of Vector DB retrievelance

   */

  public query(props: {

    /**

     * Keywords to look up.

     *

     * Put all the keywords you want to look up. However, keywords

     * should only be included in the core, and all ambiguous things

     * should be excluded to achieve accurate results.

     */

    keywords: string;

  }): Promise;

}

const main = async (): Promise => {

  const api: OpenAI = new OpenAI({

    apiKey: "YOUR_OPENAI_API_KEY",

  });

  const agent: WrtnAgent = new WrtnAgent({

    provider: {

      type: "chatgpt",

      model: "gpt-4o-mini",

      api: new OpenAI({

        apiKey: "YOUR_OPENAI_API_KEY",

      }),

    },

    controllers: [

      {

        protocol: "class",

        name: "vectorStore",

        application: typia.llm.applicationOfValidate<

          OpenAIVectorStoreAgent,

          "chatgpt"

        >(),

        execute: new OpenAIVectorStoreAgent({

          api,

          id: "YOUR_OPENAI_VECTOR_STORE_ID",

        }),

      },

    ],

  });

  await agent.conversate("I wanna research economic articles");

};

main().catch(console.error);

```

In the `@wrtnlabs/agent`, you can implement multi-agent orchestration super easily.

Just develop a TypeScript class which contains agent feature like Vector Store, and just deliver the TypeScript class type to the `@wrtnlabs/agent` like above. The `@wrtnlabs/agent` will centralize and realize the multi-agent orchestration by LLM function calling strategy to the TypeScript class.

## Principles

### Agent Strategy

```mermaid

sequenceDiagram

actor User

actor Agent

participant Selector

participant Executor

participant Describer

activate User

User-->>Agent: Conversate:
user says

activate Agent

Agent->>Selector: Deliver conversation text

activate Selector

deactivate User

Note over Selector: Select or remove candidate functions

alt No candidate

  Selector->>Agent: Talk like plain ChatGPT

  deactivate Selector

  Agent->>User: Conversate:
agent says

  activate User

  deactivate User

end

deactivate Agent

loop Candidate functions exist

  activate Agent

  Agent->>Executor: Deliver conversation text

  activate Executor

  alt Contexts are enough

    Note over Executor: Call fulfilled functions

    Executor->>Describer: Function call histories

    deactivate Executor

    activate Describer

    Describer->>Agent: Describe function calls

    deactivate Describer

    Agent->>User: Conversate:
agent describes

    activate User

    deactivate User

  else Contexts are not enough

    break

      Executor->>Agent: Request more information

    end

    Agent->>User: Conversate:
agent requests

    activate User

    deactivate User

  end

  deactivate Agent

end

```

When user says, `@wrtnlabs/agent` delivers the conversation text to the `selector` agent, and let the `selector` agent to find (or cancel) candidate functions from the context. If the `selector` agent could not find any candidate function to call and there is not any candidate function previously selected either, the `selector` agent will work just like a plain ChatGPT.

And `@wrtnlabs/agent` enters to a loop statement until the candidate functions to be empty. In the loop statement, `executor` agent tries to LLM function calling by analyzing the user's conversation text. If context is enough to compose arguments of candidate functions, the `executor` agent actually calls the target functions, and let `decriber` agent to explain the function calling results. Otherwise the context is not enough to compose arguments, `executor` agent requests more information to user.

Such LLM (Large Language Model) function calling strategy separating `selector`, `executor`, and `describer` is the key logic of `@wrtnlabs/agent`.

### Validation Feedback

```typescript

import { FunctionCall } from "pseudo";

import { ILlmFunctionOfValidate, IValidation } from "typia";

export const correctFunctionCall = (p: {

  call: FunctionCall;

  functions: Array>;

  retry: (reason: string, errors?: IValidation.IError[]) => Promise;

}): Promise => {

  // FIND FUNCTION

  const func: ILlmFunctionOfValidate<"chatgpt"> | undefined =

    p.functions.find(f => f.name === p.call.name);

  if (func === undefined) {

    // never happened in my experience

    return p.retry(

      "Unable to find the matched function name. Try it again.",

    );

  }

  // VALIDATE

  const result: IValidation = func.validate(p.call.arguments);

  if (result.success === false) {

    // 1st trial: 50% (gpt-4o-mini in shopping mall chatbot)

    // 2nd trial with validation feedback: 99%

    // 3nd trial with validation feedback again: never have failed

    return p.retry(

      "Type errors are detected. Correct it through validation errors",

      {

        errors: result.errors,

      },

    );

  }

  return result.data;

}

```

Is LLM function calling perfect? 

The answer is not, and LLM (Large Language Model) providers like OpenAI take a lot of type level mistakes when composing the arguments of the target function to call. Even though an LLM function calling schema has defined an `Array` type, LLM often fills it just by a `string` typed value.

Therefore, when developing an LLM function calling agent, the validation feedback process is essentially required. If LLM takes a type level mistake on arguments composition, the agent must feedback the most detailed validation errors, and let the LLM to retry the function calling referencing the validation errors.

About the validation feedback, `@wrtnlabs/agent` is utilizing [`typia.validate()`](https://typia.io/docs/validators/validate) and [`typia.llm.applicationOfValidate()`](https://typia.io/docs/llm/application/#applicationofvalidate) functions. They construct validation logic by analyzing TypeScript source codes and types in the compilation level, so that detailed and accurate than any other validators like below.

Such validation feedback strategy and combination with `typia` runtime validator, `@wrtnlabs/agent` has achieved the most ideal LLM function calling. In my experience, when using OpenAI's `gpt-4o-mini` model, it tends to construct invalid function calling arguments at the first trial about 50% of the time. By the way, if correct it through validation feedback with `typia`, success rate soars to 99%. And I've never had a failure when trying validation feedback twice.

Components               | `typia` | `TypeBox` | `ajv` | `io-ts` | `zod` | `C.V.`

-------------------------|--------|-----------|-------|---------|-------|------------------

**Easy to use**          | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ 

[Object (simple)](https://github.com/samchon/typia/blob/master/test/src/structures/ObjectSimple.ts)          | ✔ | ✔ | ✔ | ✔ | ✔ | ✔

[Object (hierarchical)](https://github.com/samchon/typia/blob/master/test/src/structures/ObjectHierarchical.ts)    | ✔ | ✔ | ✔ | ✔ | ✔ | ✔

[Object (recursive)](https://github.com/samchon/typia/blob/master/test/src/structures/ObjectRecursive.ts)       | ✔ | ❌ | ✔ | ✔ | ✔ | ✔ | ✔

[Object (union, implicit)](https://github.com/samchon/typia/blob/master/test/src/structures/ObjectUnionImplicit.ts) | ✅ | ❌ | ❌ | ❌ | ❌ | ❌

[Object (union, explicit)](https://github.com/samchon/typia/blob/master/test/src/structures/ObjectUnionExplicit.ts) | ✔ | ✔ | ✔ | ✔ | ✔ | ❌

[Object (additional tags)](https://github.com/samchon/typia/#comment-tags)        | ✔ | ✔ | ✔ | ✔ | ✔ | ✔

[Object (template literal types)](https://github.com/samchon/typia/blob/master/test/src/structures/TemplateUnion.ts) | ✔ | ✔ | ✔ | ❌ | ❌ | ❌

[Object (dynamic properties)](https://github.com/samchon/typia/blob/master/test/src/structures/DynamicTemplate.ts) | ✔ | ✔ | ✔ | ❌ | ❌ | ❌

[Array (rest tuple)](https://github.com/samchon/typia/blob/master/test/src/structures/TupleRestAtomic.ts) | ✅ | ❌ | ❌ | ❌ | ❌ | ❌

[Array (hierarchical)](https://github.com/samchon/typia/blob/master/test/src/structures/ArrayHierarchical.ts)     | ✔ | ✔ | ✔ | ✔ | ✔ | ✔

[Array (recursive)](https://github.com/samchon/typia/blob/master/test/src/structures/ArrayRecursive.ts)        | ✔ | ✔ | ✔ | ✔ | ✔ | ❌

[Array (recursive, union)](https://github.com/samchon/typia/blob/master/test/src/structures/ArrayRecursiveUnionExplicit.ts) | ✔ | ✔ | ❌ | ✔ | ✔ | ❌

[Array (R+U, implicit)](https://github.com/samchon/typia/blob/master/test/src/structures/ArrayRecursiveUnionImplicit.ts)    | ✅ | ❌ | ❌ | ❌ | ❌ | ❌

[Array (repeated)](https://github.com/samchon/typia/blob/master/test/src/structures/ArrayRepeatedNullable.ts)    | ✅ | ❌ | ❌ | ❌ | ❌ | ❌

[Array (repeated, union)](https://github.com/samchon/typia/blob/master/test/structures/ArrayRepeatedUnionWithTuple.ts)    | ✅ | ❌ | ❌ | ❌ | ❌ | ❌

[**Ultimate Union Type**](https://github.com/samchon/typia/blob/master/src/schemas/IJsonSchema.ts)  | ✅ | ❌ | ❌ | ❌ | ❌ | ❌

> `C.V.` means `class-validator`

### OpenAPI Specification

```mermaid

flowchart

  subgraph "OpenAPI Specification"

    v20("Swagger v2.0") --upgrades--> emended[["OpenAPI v3.1 (emended)"]]

    v30("OpenAPI v3.0") --upgrades--> emended

    v31("OpenAPI v3.1") --emends--> emended

  end

  subgraph "OpenAPI Generator"

    emended --normalizes--> migration[["Migration Schema"]]

    migration --"Artificial Intelligence"--> lfc{{"LLM Function Calling"}}

    lfc --"OpenAI"--> chatgpt("ChatGPT")

    lfc --"Anthropic"--> claude("Claude")

    lfc --"Google"--> gemini("Gemini")

    lfc --"Meta"--> llama("Llama")

  end

```

`@wrtnlabs/agent` obtains LLM function calling schemas from both Swagger/OpenAPI documents and TypeScript class types. The TypeScript class type can be converted to LLM function calling schema by [`typia.llm.applicationOfValidate()`](https://typia.io/docs/llm/application#applicationofvalidate) function. Then how about OpenAPI document? How Swagger document can be LLM function calling schema.

The secret is on the above diagram. 

In the OpenAPI specification, there are three versions with different definitions. And even in the same version, there are too much ambiguous and duplicated expressions. To resolve these problems, [`@samchon/openapi`](https://github.com/samchon/openapi) is transforming every OpenAPI documents to v3.1 emended specification. The `@samchon/openapi`'s emended v3.1 specification has removed every ambiguous and duplicated expressions for clarity.

With the v3.1 emended OpenAPI document, `@samchon/openapi` converts it to a migration schema that is near to the function structure. And as the last step, the migration schema will be transformed to a specific LLM provider's function calling schema. LLM function calling schemas are composed like this way.

> **Why do not directly convert, but intermediate?**

>

> If directly convert from each version of OpenAPI specification to specific LLM's function calling schema, I have to make much more converters increased by cartesian product. In current models, number of converters would be 12 = 3 x 4.

>

> However, if define intermediate schema, number of converters are shrunk to plus operation. In current models, I just need to develop only (7 = 3 + 4) converters, and this is the reason why I've defined intermediate specification. This way is economic.

## Roadmap

### Guide Documents

In here README document, `@wrtnlabs/agent` is introducing its key concepts, principles, and demonstrating some examples. 

However, this contents are not fully enough for new comers of AI Chatbot development. We need much more guide documents and example projects are required for education. We have to guide backend developers to write proper definitions optimized for LLM function calling. We should introduce the best way of multi-agent orchestration implementation.

We'll write such fully detailed guide documents until 2025-03-31, and we will continuously release documents that are in the middle of being completed.

### Playground

https://nestia.io/chat/playground

I had developed Swagger AI chatbot playground website for a long time ago.

However, the another part obtaining function schemas from TypeScript class type, it is not prepared yet. I'll make the TypeScript class type based playground website by embedding TypeScript compiler (`tsc`).

The new playground website would be published until 2025-03-15.

### Optimization

As I've concenstrated on POC (Proof of Concept) development on the early stage level, internal agents composing `@wrtnlabs/agent` are not cost optimized yet. Especially, `selector` agent is consuming LLM tokens too much repeatedly. We'll optimize the `selector` agent by RAG (Retrieval Augmented Generation) skills.

Also, we will support dozens of useful add-on agents which can connect with `@wrtnlabs/agent` by TypeScript class function calling. One of them is `@wrtnlabs/hive` which optimizes `selector` agent so that reducing LLM costs dramatically. The others would be OpenAI Vector Store handler and Postgres based RAG engine.

With these `@wrtnlabs/agent` providing add-on agents, you can learn how to implement the Multi-agent orchestration through TypeScript class function calling, and understand how `@wrtnlabs/agent` makes the Multi agent system interaction super easily.

### WebSocket Module

```typescript

import { 

  IWrtnAgentEvent,

  IWrtnAgentHeader,

  IWrtnAgentListener,

  IWrtnAgentService 

} from "@wrtnlabs/agent";

import { Driver, WebSocketConnector } from "tgrid";

const main = async (): Promise => {

  const connector: WebSocketConnector<

    IWrtnAgentHeader,

    IWrtnAgentListener,

    IWrtnAgentService

  > = new WebSocketConnector(

    {

      locale: "en-US",

    } satisfies IWrtnAgentHeader,

    {

      // EVENT LISTENERS

      conversate: async (event: IWrtnAgentEvent.IConversate) => {},

      select: async (event: IWrtnAgentEvent.ISelect) => {},

      execute: async (event: IWrtnAgentEvent.IExecute) => {},

      describe: async (event: IWrtnAgentEvent.IDescribe) => {},

    } satisfies IWrtnAgentService,

  );

  await connector.connect("http://localhost:3000/chat");

  // CALL SERVER'S FUNCTION

  const service: Driver = connector.getDriver();

  await service.conversate("Hello, what you can do?");

};

main().catch(console.error);

```

`@wrtnlabs/agent` will provider WebSocket module of RPC (Remote Procedure Call) paradigm.

It will help easy integration with agent backend server and AI chatbot frontend application.

We're going to support this feature until 2025-02-28.

### LLM Providers

```mermaid

flowchart

Agent["@wrtnlabs/agent"]

OpenAI("OpenAI")

Claude

Llama

Gemini

Agent==supports==>OpenAI

Agent-.not yet.->Claude

Agent-.not yet.->Llama

Agent-.not yet.->Gemini

```

Currently, `@wrtnlabs/agent` supports only OpenAI. 

It is because `@wrtnlabs/agent` is still in the POC (Proof of Concept) and demonstration stage. However, even nthough OpenAI is the most famous model in the AI world, `@wrtnlabs/agent` have to support much more models for broad users.

We're going to support much more models until 2025-04-30.
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/wrtnlabs/agent

Awesome Lists containing this project

README