https://github.com/unifyai/unify

Notion for AI Observability 📊
https://github.com/unifyai/unify

ai claude gpt gpt-4 llama2 llm llm-inference llms mixtral openai python

Last synced: about 2 months ago
JSON representation

Notion for AI Observability 📊

Host: GitHub
URL: https://github.com/unifyai/unify
Owner: unifyai
License: apache-2.0
Created: 2024-03-26T00:59:24.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2025-05-09T16:24:35.000Z (2 months ago)
Last Synced: 2025-05-11T00:32:54.990Z (2 months ago)
Topics: ai, claude, gpt, gpt-4, llama2, llm, llm-inference, llms, mixtral, openai, python
Language: Python
Homepage: https://unify.ai
Size: 1.84 MB
Stars: 300
Watchers: 2
Forks: 31
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

jimsghstars - unifyai/unify - Build Your AI Workflow in Seconds ⚡ (Python)

README

        


    

        

        

    



----

![Static Badge](https://img.shields.io/badge/Y%20Combinator-W23-orange)

![X (formerly Twitter) Follow](https://img.shields.io/twitter/follow/letsunifyai)

 ![Static Badge](https://img.shields.io/badge/Join_Discord-464646?&logo=discord&logoColor=5865F2) 



    

        

    



**Fully hackable** LLMOps. Build *custom* interfaces for: logging, evals, guardrails, labelling, tracing, agents, human-in-the-loop, hyperparam sweeps, and anything else you can think of ✨

Just `unify.log` your data, and add an interface using the four building blocks:

1.  **tables** 🔢

2.  **views** 🔍

3. **plots** 📊

4. **editor** 🕹️ (coming soon)

Every LLM product has **unique** and **changing** requirements, as do the **users**. Your infra should reflect this!

We've tried to make Unify as **(a) simple**, **(b) modular** and **(c) hackable** as possible, so you can quickly probe, analyze, and iterate on the data that's important for **you**, your **product** and your **users** ⚡

## Quickstart

[Sign up](https://console.unify.ai/), `pip install unifyai`, run your first eval ⬇️, and then check out the logs in your first [interface](https://console.unify.ai) 📊

```python

import unify

from random import randint, choice

# initialize project

unify.activate("Maths Assistant")

# build agent

client = unify.Unify("o3-mini@openai", traced=True)

client.set_system_message(

    "You are a helpful maths assistant, "

    "tasked with adding and subtracting integers."

)

# add test cases

qs = [

    f"{randint(0, 100)} {choice(['+', '-'])} {randint(0, 100)}"

    for i in range(10)

]

# define evaluator

@unify.traced

def evaluate_response(question: str, response: str) -> float:

    correct_answer = eval(question)

    try:

        response_int = int(

            "".join(

                [

                    c for c in response.split(" ")[-1]

                    if c.isdigit()

                ]

            ),

        )

        return float(correct_answer == response_int)

    except ValueError:

        return 0.

# define evaluation

@unify.traced

def evaluate(q: str):

    response = client.copy().generate(q)

    score = evaluate_response(q, response)

    unify.log(

        question=q,

        response=response,

        score=score

    )

# execute + log your evaluation

with unify.Experiment():

    unify.map(evaluate, qs)

```

Check out our [Quickstart Video](https://youtu.be/fl9SzsoCegw?si=MhQZDfNS6U-ZsVYc) for a guided walkthrough.

## Focus on your *product*, not the *LLM* 🎯

Despite all of the hype, abstractions, and jargon, the *process* for building quality LLM apps is pretty simple.

```

create simplest possible agent 🤖

while True:

    create/expand unit tests (evals) 🗂️

    while run(tests) failing: 🧪

        Analyze failures, understand the root cause 🔍

        Vary system prompt, in-context examples, tools etc. to rectify 🔀

    Beta test with users, find more failures 🚦

```

We've tried to strip away all of the excessive LLM jargon, so you can focus on your *product*, your *users*, and the *data* you care about, and *nothing else* 📈

Unify takes inspiration from:

- [PostHog](https://posthog.com/) / [Grafana](https://grafana.com/) / [LogFire](https://pydantic.dev/logfire) for powerful observability 🔬

- [LangSmith](https://www.langchain.com/langsmith) / [BrainTrust](https://www.braintrust.dev/) / [Weave](https://wandb.ai/site/weave/) for LLM abstractions 🤖

- [Notion](https://www.notion.com/) / [Airtable](https://www.airtable.com/) for composability and versatility 🧱

Whether you're technical or non-technical, we hope Unify can help you to rapidly build top-notch LLM apps, and to remain fully focused on your *product* (not the *LLM*).

## Learn More

Check out our [docs](https://docs.unify.ai/), and if you have any questions feel free to reach out to us on [discord](https://discord.com/invite/sXyFF8tDtm) 👾

Unify is under active development 🚧, feedback in all shapes/sizes is also very welcome! 🙏

Happy prompting! 🧑‍💻

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/unifyai/unify

Awesome Lists containing this project

README