https://github.com/datastax/astra-assistants-api

Drop in replacement for the OpenAI Assistants API
https://github.com/datastax/astra-assistants-api

assistants assistants-api claude cohere gemini gpt-4 groq llama3-1 ollama openai-assistants openai-assistants-api vector

Last synced: 8 months ago
JSON representation

Drop in replacement for the OpenAI Assistants API

Host: GitHub
URL: https://github.com/datastax/astra-assistants-api
Owner: datastax
License: apache-2.0
Created: 2023-11-15T06:10:19.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2025-03-17T15:01:16.000Z (9 months ago)
Last Synced: 2025-04-07T17:04:37.771Z (8 months ago)
Topics: assistants, assistants-api, claude, cohere, gemini, gpt-4, groq, llama3-1, ollama, openai-assistants, openai-assistants-api, vector
Language: Python
Homepage:
Size: 3.81 MB
Stars: 187
Watchers: 10
Forks: 22
Open Issues: 18
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE

Awesome Lists containing this project

jimsghstars - datastax/astra-assistants-api - Drop in replacement for the OpenAI Assistants API (Python)
awesome_ai_agents - Astra Assistants API - The `astra-assistants-api` provides a backend implementation of the OpenAI Assistants API with support for various features like persistent threads, files, assistants, streaming, function calling, and more, utilizing AstraDB powered by Apache Cassandra and jvector, and is compatible with existing OpenAI apps by changing a single line of code [github](https://github.com/datastax/astra-assistants-api) (Learning / Repositories)

README

          # Astra Assistant API Service

[![commits](https://img.shields.io/github/commit-activity/m/datastax/astra-assistants-api)](https://github.com/datastax/astra-assistants-api/commits/main)

[![Github Last Commit](https://img.shields.io/github/last-commit/datastax/astra-assistants-api)](https://github.com/datastax/astra-assistants-api/commits/main)

[![Run tests](https://github.com/datastax/astra-assistants-api/actions/workflows/run-tests.yml/badge.svg?branch=main)](https://github.com/datastax/astra-assistants-api/actions/workflows/run-tests.yml)

[![Docker build and publish](https://github.com/datastax/astra-assistants-api/actions/workflows/docker.yml/badge.svg)](https://github.com/datastax/astra-assistants-api/actions/workflows/docker.yml)

[![PyPI - Downloads](https://img.shields.io/pypi/dw/astra-assistants?label=pypi%20downloads)](https://badge.fury.io/py/astra-assistants)

[![Dockerhub](https://img.shields.io/static/v1?label=Pull%20from&message=DockerHub&color=blue&logo=Docker&style=flat-square)](https://hub.docker.com/r/datastax/astra-assistants)

[![Discord chat](https://img.shields.io/static/v1?label=Chat%20on&message=Discord&color=blue&logo=Discord&style=flat-square)](https://discord.gg/MEFVXUvsuy)

[![Stars](https://img.shields.io/github/stars/datastax/astra-assistants-api?style=social)](https://github.com/datastax/astra-assistants-api/stargazers)

### An Open Source drop-in compatible service for the OpenAI Assistants API v2

![create_assistant](images/create_assistant.gif)

Astra Assistants supports streaming, persistent threads, files, vector_stores, assistants, retrieval, function calling and more using [AstraDB](https://astra.datastax.com) (DataStax's db as a service offering powered by [Apache Cassandra](https://cassandra.apache.org/_/index.html) and [jvector](https://github.com/jbellis/jvector)).

Supports dozens of third party LLM providers (or even local models) for both completion and embeddings (powered by [LiteLLM](https://github.com/BerriAI/litellm)). 

You can use our hosted Astra Assistants service, or host the open source API server yourself.

## Client Getting Started [![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/gist/phact/a80dc113dd637ba4c4193415e69198c6/assistants_api_overview_python.ipynb)

To build an app that uses the Astra Asistants service install the [astra-assistants](https://pypi.org/project/astra-assistants/) python library with your favorite package manager. The code for astra-assistants can be found under (clients/)[./clients/]:

```

poetry add astra_assistants

```

[Signup for Astra and get an Admin API token](https://astra.datastax.com/signup):

Set your environment variables (depending on what LLMs you want to use), see the [.env.bkp](./.env.bkp) file for an example:

```

#!/bin/bash

# AstraDB -> https://astra.datastax.com/ --> tokens --> administrator user --> generate

export ASTRA_DB_APPLICATION_TOKEN=""

# OpenAI Models - https://platform.openai.com/api-keys --> create new secret key

export OPENAI_API_KEY=""

# Groq Models - https://console.groq.com/keys

export GROQ_API_KEY=""

# Anthropic claude models - https://console.anthropic.com/settings/keys

export ANTHROPIC_API_KEY=""

# Gemini models -> https://makersuite.google.com/app/apikey

export GEMINI_API_KEY=""

# Perplexity models -> https://www.perplexity.ai/settings/api  --> generate

export PERPLEXITYAI_API_KEY=""

# Cohere models -> https://dashboard.cohere.com/api-keys

export COHERE_API_KEY=""

# Bedrock models -> https://docs.aws.amazon.com/bedrock/latest/userguide/setting-up.html

export AWS_REGION_NAME=""

export AWS_ACCESS_KEY_ID=""

export AWS_SECRET_ACCESS_KEY=""

# vertexai models https://console.cloud.google.com/vertex-ai

export GOOGLE_JSON_PATH=""

export GOOGLE_PROJECT_ID=""

# ... for all models see the .env.bkp file

```

Then import and patch your client:

```python

from openai import OpenAI

from astra_assistants import patch

client = patch(OpenAI())

```

The system will create a db on your behalf and name it `assistant_api_db` using your token. Note, this means that the first request will hang until your db is ready (could be a couple of minutes). This will only happen once.

Now you're ready to create an assistant

```

assistant = client.beta.assistants.create(

  instructions="You are a personal math tutor. When asked a math question, write and run code to answer the question.",

  model="gpt-4-1106-preview",

  tools=[{"type": "retrieval"}]

)

```

By default, the service uses [AstraDB](https://astra.datastax.com/signup) as the database/vector store and OpenAI for embeddings and chat completion.

## Third party LLM Support

We now support [many third party models](https://docs.litellm.ai/docs/providers) for both embeddings and completion thanks to [litellm](https://github.com/BerriAI/litellm). Pass the api key of your service using `api-key` and `embedding-model` headers.

You can pass different models, just make sure you have the right corresponding api key in your environment.

```

model="gpt-4-1106-preview"

#model="gpt-3.5-turbo"

#model="cohere_chat/command-r"

#model="perplexity/mixtral-8x7b-instruct"

#model="perplexity/llama-3-sonar-large-32k-online"

#model="anthropic.claude-v2"

#model="gemini/gemini-1.5-pro-latest"

#model = "meta.llama2-13b-chat-v1"

assistant = client.beta.assistants.create(

    name="Math Tutor",

    instructions="You are a personal math tutor. Answer questions briefly, in a sentence or less.",

    model=model,

)

```

for third party embedding models we support `embedding_model` in `client.files.create`:

```

file = client.files.create(

    file=open(

        "./test/language_models_are_unsupervised_multitask_learners.pdf",

        "rb",

    ),

    purpose="assistants",

    embedding_model="text-embedding-3-large",

)

```

To run the examples using poetry create a .env file in this directory with your secrets and run:

    poetry install

Create your .env file and add your keys to it:

    cp .env.bkp .env

and 

    poetry run python examples/python/chat_completion/basic.py

    poetry run python examples/python/retrieval/basic.py

    poetry run python examples/python/streaming_retrieval/basic.py

    poetry run python examples/python/function_calling/basic.py

## Running yourself

### Docker

with docker, first pull the image from docker hub

    docker pull datastax/astra-assistants

or a specific version if you don't want latest:

    docker pull datastax/astra-assistants:v0.2.12

then run (-p to map your docker port 8080 to your host port 8080):

    docker run -p 8080:8080 datastax/astra-assistants

### Locally with poetry

or locally with poetry:

    poetry install

    poetry run python run.py

### Docker-compose with ollama

or with docker-compose for integration with ollama

    cd examples/ollama/gpu # or examples/ollama/cpu for cpu only for gpu you need docker-toolkit

    docker-compose up -d

you need to pull the model you want to ollama before using it

    curl http://localhost:11434/api/pull -d '{ "name": "deepseek-coder-v2" }'

your assistants client should route to the ollama container setting OLLAMA_API_BASE_URL. OLLAMA_API_BASE_URL should be set to http://ollama:11434 if you are using docker-compose. If you are using ollama on your localhost you can set it to http://localhost:11434

## Feedback / Help

For help or feedback file an [issue](https://github.com/datastax/astra-assistants-api/issues) or reach out to us on [Discord](https://discord.com/invite/MEFVXUvsuy)

## Contributing

Check out our [contributing guide](./CONTRIBUTING.md)

## Coverage

See our coverage report [here](./coverage.md)

## Roadmap:

 - [X] Support for other embedding models and LLMs

 - [X] function calling

 - [X] Streaming support

 - [X] Assistants V2 with vector_store support

This project is not associated with OpenAI or any of the third party models we support. It is an open source project that aims to provide a drop-in compatible service for the OpenAI Assistants API.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/datastax/astra-assistants-api

Awesome Lists containing this project

README