Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/weaviate/verba
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
https://github.com/weaviate/verba
Last synced: 6 days ago
JSON representation
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
- Host: GitHub
- URL: https://github.com/weaviate/verba
- Owner: weaviate
- License: bsd-3-clause
- Created: 2023-07-28T16:53:42.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-10-21T11:46:15.000Z (3 months ago)
- Last Synced: 2024-10-29T15:34:33.825Z (3 months ago)
- Language: TypeScript
- Homepage:
- Size: 43.3 MB
- Stars: 6,207
- Watchers: 68
- Forks: 667
- Open Issues: 44
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project
README
# Verba
## The Golden RAGtriever - Community Edition β¨
[![Weaviate](https://img.shields.io/static/v1?label=powered%20by&message=Weaviate%20%E2%9D%A4&color=green&style=flat-square)](https://weaviate.io/)
[![PyPi downloads](https://static.pepy.tech/personalized-badge/goldenverba?period=total&units=international_system&left_color=grey&right_color=orange&left_text=pip%20downloads)](https://pypi.org/project/goldenverba/) [![Docker support](https://img.shields.io/badge/Docker_support-%E2%9C%93-4c1?style=flat-square&logo=docker&logoColor=white)](https://docs.docker.com/get-started/) [![Demo](https://img.shields.io/badge/Check%20out%20the%20demo!-yellow?&style=flat-square&logo=react&logoColor=white)](https://verba.weaviate.io/)Welcome to Verba: The Golden RAGtriever, an community-driven open-source application designed to offer an end-to-end, streamlined, and user-friendly interface for Retrieval-Augmented Generation (RAG) out of the box. In just a few easy steps, explore your datasets and extract insights with ease, either locally with Ollama and Huggingface or through LLM providers such as Anthrophic, Cohere, and OpenAI. This project is built with and for the community, please be aware that it might not be maintained with the same urgency as other Weaviate production applications. Feel free to contribute to the project and help us make Verba even better! <3
```
pip install goldenverba
```![Demo of Verba](https://github.com/weaviate/Verba/blob/2.0.0/img/verba.gif)
- [Verba](#verba)
- [π― What Is Verba?](#what-is-verba)
- [β¨ Features](#feature-lists)
- [β¨ Getting Started with Verba](#getting-started-with-verba)
- [π API Keys](#api-keys)
- [Weaviate](#weaviate)
- [Ollama](#ollama)
- [Unstructured](#unstructured)
- [AssemblyAI](#assemblyai)
- [OpenAI](#openai)
- [HuggingFace](#huggingface)
- [Groq](#groq)
- [Quickstart: Deploy with pip](#how-to-deploy-with-pip)
- [Quickstart: Build from Source](#how-to-build-from-source)
- [Quickstart: Deploy with Docker](#how-to-install-verba-with-docker)
- [πΎ Verba Walkthrough](#οΈverba-walkthrough)
- [π Open Source Contribution](#open-source-contribution)
- [π© Known Issues](#known-issues)
- [βFAQ](#faq)## What Is Verba?
Verba is a fully-customizable personal assistant utilizing [Retrieval Augmented Generation (RAG)](https://weaviate.io/rag#:~:text=RAG%20with%20Weaviate,accuracy%20of%20AI%2Dgenerated%20content.) for querying and interacting with your data, **either locally or deployed via cloud**. Resolve questions around your documents, cross-reference multiple data points or gain insights from existing knowledge bases. Verba combines state-of-the-art RAG techniques with Weaviate's context-aware database. Choose between different RAG frameworks, data types, chunking & retrieving techniques, and LLM providers based on your individual use-case.
## Open Source Spirit
**Weaviate** is proud to offer this open-source project for the community. While we strive to address issues as fast as we can, please understand that it may not be maintained with the same rigor as production software. We welcome and encourage community contributions to help keep it running smoothly. Your support in fixing open issues quickly is greatly appreciated.
### Watch our newest Verba video here:
[![VIDEO LINK](https://github.com/weaviate/Verba/blob/main/img/thumbnail.png)](https://www.youtube.com/watch?v=2VCy-YjRRhA&t=40s&ab_channel=Weaviate%E2%80%A2VectorDatabase)
## Feature Lists
| π€ Model Support | Implemented | Description |
| --------------------------------- | ----------- | ------------------------------------------------------- |
| Ollama (e.g. Llama3) | β | Local Embedding and Generation Models powered by Ollama |
| HuggingFace (e.g. MiniLMEmbedder) | β | Local Embedding Models powered by HuggingFace |
| Cohere (e.g. Command R+) | β | Embedding and Generation Models by Cohere |
| Anthrophic (e.g. Claude Sonnet) | β | Embedding and Generation Models by Anthrophic |
| OpenAI (e.g. GPT4) | β | Embedding and Generation Models by OpenAI |
| Groq (e.g. Llama3) | β | Generation Models by Groq (LPU inference) |
| Upstage (e.g. Solar) | β | Embedding and Generation Models by Upstage || π€ Embedding Support | Implemented | Description |
| -------------------- | ----------- | ---------------------------------------- |
| Weaviate | β | Embedding Models powered by Weaviate |
| Ollama | β | Local Embedding Models powered by Ollama |
| SentenceTransformers | β | Embedding Models powered by HuggingFace |
| Cohere | β | Embedding Models by Cohere |
| VoyageAI | β | Embedding Models by VoyageAI |
| OpenAI | β | Embedding Models by OpenAI |
| Upstage | β | Embedding Models by Upstage || π Data Support | Implemented | Description |
| -------------------------------------------------------- | ----------- | ---------------------------------------------- |
| [UnstructuredIO](https://docs.unstructured.io/welcome) | β | Import Data through Unstructured |
| [Firecrawl](https://www.firecrawl.dev/) | β | Scrape and Crawl URL through Firecrawl |
| [UpstageDocumentParse](https://upstage.ai/) | β | Parse Documents through Upstage Document AI |
| PDF Ingestion | β | Import PDF into Verba |
| GitHub & GitLab | β | Import Files from Github and GitLab |
| CSV/XLSX Ingestion | β | Import Table Data into Verba |
| .DOCX | β | Import .docx files |
| Multi-Modal (using [AssemblyAI](https://assemblyai.com)) | β | Import and Transcribe Audio through AssemblyAI || β¨ RAG Features | Implemented | Description |
| ----------------------- | --------------- | ------------------------------------------------------------------------- |
| Hybrid Search | β | Semantic Search combined with Keyword Search |
| Autocomplete Suggestion | β | Verba suggests autocompletion |
| Filtering | β | Apply Filters (e.g. documents, document types etc.) before performing RAG |
| Customizable Metadata | β | Free control over Metadata |
| Async Ingestion | β | Ingest data asynchronously to speed up the process |
| Advanced Querying | planned β±οΈ | Task Delegation Based on LLM Evaluation |
| Reranking | planned β±οΈ | Rerank results based on context for improved results |
| RAG Evaluation | planned β±οΈ | Interface for Evaluating RAG pipelines |
| Agentic RAG | out of scope β | Agentic RAG pipelines |
| Graph RAG | out of scope β | Graph-based RAG pipelines || π‘οΈ Chunking Techniques | Implemented | Description |
| ---------------------- | ----------- | ------------------------------------------------------- |
| Token | β | Chunk by Token powered by [spaCy](https://spacy.io/) |
| Sentence | β | Chunk by Sentence powered by [spaCy](https://spacy.io/) |
| Semantic | β | Chunk and group by semantic sentence similarity |
| Recursive | β | Recursively chunk data based on rules |
| HTML | β | Chunk HTML files |
| Markdown | β | Chunk Markdown files |
| Code | β | Chunk Code files |
| JSON | β | Chunk JSON files || π Cool Bonus | Implemented | Description |
| ------------------------ | --------------- | ------------------------------------------------------- |
| Docker Support | β | Verba is deployable via Docker |
| Customizable Frontend | β | Verba's frontend is fully-customizable via the frontend |
| Vector Viewer | β | Visualize your data in 3D |
| Multi-User Collaboration | out of scope β | Multi-User Collaboration in Verba || π€ RAG Libraries | Implemented | Description |
| ---------------- | ----------- | ---------------------------------- |
| LangChain | β | Implement LangChain RAG pipelines |
| Haystack | planned β±οΈ | Implement Haystack RAG pipelines |
| LlamaIndex | planned β±οΈ | Implement LlamaIndex RAG pipelines |> Something is missing? Feel free to create a new issue or discussion with your idea!
![Showcase of Verba](https://github.com/weaviate/Verba/blob/2.0.0/img/verba_screen.png)
---
# Getting Started with Verba
You have three deployment options for Verba:
- Install via pip
```
pip install goldenverba
```- Build from Source
```
git clone https://github.com/weaviate/Verbapip install -e .
```- Use Docker for Deployment
**Prerequisites**: If you're not using Docker, ensure that you have `Python >=3.10.0,<3.13.0` installed on your system.
```
git clone https://github.com/weaviate/Verbadocker compose --env-file up -d --build
```If you're unfamiliar with Python and Virtual Environments, please read the [python tutorial guidelines](./PYTHON_TUTORIAL.md).
# API Keys and Environment Variables
You can set all API keys in the Verba frontend, but to make your life easier, we can also prepare a `.env` file in which Verba will automatically look for the keys. Create a `.env` in the same directory you want to start Verba in. You can find an `.env.example` file in the [goldenverba](./goldenverba/.env.example) directory.
> Make sure to only set environment variables you intend to use, environment variables with missing or incorrect values may lead to errors.
Below is a comprehensive list of the API keys and variables you may require:
| Environment Variable | Value | Description |
| ---------------------- | ---------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------- |
| WEAVIATE_URL_VERBA | URL to your hosted Weaviate Cluster | Connect to your [WCS](https://console.weaviate.cloud/) Cluster |
| WEAVIATE_API_KEY_VERBA | API Credentials to your hosted Weaviate Cluster | Connect to your [WCS](https://console.weaviate.cloud/) Cluster |
| ANTHROPIC_API_KEY | Your Anthropic API Key | Get Access to [Anthropic](https://www.anthropic.com/) Models |
| OPENAI_API_KEY | Your OpenAI Key | Get Access to [OpenAI](https://openai.com/) Models |
| OPENAI_BASE_URL | URL to OpenAI instance | Models |
| COHERE_API_KEY | Your API Key | Get Access to [Cohere](https://cohere.com/) Models |
| GROQ_API_KEY | Your Groq API Key | Get Access to [Groq](https://groq.com/) Models |
| OLLAMA_URL | URL to your Ollama instance (e.g. http://localhost:11434 ) | Get Access to [Ollama](https://ollama.com/) Models |
| UNSTRUCTURED_API_KEY | Your API Key | Get Access to [Unstructured](https://docs.unstructured.io/welcome) Data Ingestion |
| UNSTRUCTURED_API_URL | URL to Unstructured Instance | Get Access to [Unstructured](https://docs.unstructured.io/welcome) Data Ingestion |
| ASSEMBLYAI_API_KEY | Your API Key | Get Access to [AssemblyAI](https://assemblyai.com) Data Ingestion |
| GITHUB_TOKEN | Your GitHub Token | Get Access to Data Ingestion via GitHub |
| GITLAB_TOKEN | Your GitLab Token | Get Access to Data Ingestion via GitLab |
| FIRECRAWL_API_KEY | Your Firecrawl API Key | Get Access to Data Ingestion via Firecrawl |
| VOYAGE_API_KEY | Your VoyageAI API Key | Get Access to Embedding Models via VoyageAI |
| EMBEDDING_SERVICE_URL | URL to your Embedding Service Instance | Get Access to Embedding Models via [Weaviate Embedding Service](https://weaviate.io/developers/wcs/embeddings) |
| EMBEDDING_SERVICE_KEY | Your Embedding Service Key | Get Access to Embedding Models via [Weaviate Embedding Service](https://weaviate.io/developers/wcs/embeddings) |
| UPSTAGE_API_KEY | Your Upstage API Key | Get Access to [Upstage](https://upstage.ai/) Models |
| UPSTAGE_BASE_URL | URL to Upstage instance | Models |
| DEFAULT_DEPLOYMENT | Local, Weaviate, Custom, Docker | Set the default deployment mode |![API Keys in Verba](https://github.com/weaviate/Verba/blob/2.0.0/img/api_screen.png)
## Weaviate
Verba provides flexibility in connecting to Weaviate instances based on your needs. You have three options:
1. **Local Deployment**: Use Weaviate Embedded which runs locally on your device (except Windows, choose the Docker/Cloud Deployment)
2. **Docker Deployment**: Choose this option when you're running Verba's Dockerfile.
3. **Cloud Deployment**: Use an existing Weaviate instance hosted on WCD to run Verba**π» Weaviate Embedded**
Embedded Weaviate is a deployment model that runs a Weaviate instance from your application code rather than from a stand-alone Weaviate server installation. When you run Verba in `Local Deployment`, it will setup and manage Embedded Weaviate in the background. Please note that Weaviate Embedded is not supported on Windows and is in Experimental Mode which can bring unexpected errors. We recommend using the Docker Deployment or Cloud Deployment instead. You can read more about Weaviate Embedded [here](https://weaviate.io/developers/weaviate/installation/embedded).**π©οΈ Weaviate Cloud Deployment (WCD)**
If you prefer a cloud-based solution, Weaviate Cloud (WCD) offers a scalable, managed environment. Learn how to set up a cloud cluster and get the API keys by following the [Weaviate Cluster Setup Guide](https://weaviate.io/developers/wcs/guides/create-instance).
**π³ Docker Deployment**
Another local alternative is deploying Weaviate using Docker. For more details, follow the [How to install Verba with Docker](#how-to-install-verba-with-docker) section.![Deployment in Verba](https://github.com/weaviate/Verba/blob/2.0.0/img/verba_deployment.png)
**βοΈ Custom Weaviate Deployment**
If you're hosting Weaviate yourself, you can use the `Custom` deployment option in Verba. This will allow you to specify the URL, PORT, and API key of your Weaviate instance.
## Ollama
Verba supports Ollama models. Download and Install Ollama on your device (https://ollama.com/download). Make sure to install your preferred LLM using `ollama run `.
Tested with `llama3`, `llama3:70b` and `mistral`. The bigger models generally perform better, but need more computational power.
> Make sure Ollama Server runs in the background and that you don't ingest documents with different ollama models since their vector dimension can vary that will lead to errors
You can verify that by running the following command
```
ollama run llama3
```## Unstructured
Verba supports importing documents through Unstructured IO (e.g plain text, .pdf, .csv, and more). To use them you need the `UNSTRUCTURED_API_KEY` and `UNSTRUCTURED_API_URL` environment variable. You can get it from [Unstructured](https://unstructured.io/)
> UNSTRUCTURED_API_URL is set to `https://api.unstructuredapp.io/general/v0/general` by default
## AssemblyAI
Verba supports importing documents through AssemblyAI (audio files or audio from video files). To use them you need the `ASSEMBLYAI_API_KEY` environment variable. You can get it from [AssemblyAI](https://assemblyai.com)
## OpenAI
Verba supports OpenAI Models such as Ada, GPT3, and GPT4. To use them, you need to specify the `OPENAI_API_KEY` environment variable. You can get it from [OpenAI](https://openai.com/)
You can also add a `OPENAI_BASE_URL` to use proxies such as LiteLLM (https://github.com/BerriAI/litellm)
```
OPENAI_BASE_URL=YOUR-OPENAI_BASE_URL
```## HuggingFace
If you want to use the HuggingFace Features, make sure to install the correct Verba package. It will install required packages to use the local embedding models.
Please note that on startup, Verba will automatically download and install embedding models when used.```bash
pip install goldenverba[huggingface]or
pip install `.[huggingface]`
```> If you're using Docker, modify the `Dockerfile` accordingly. It's not possible to install a custom Verba installation if you pull the Docker Image from the Docker Hub, as of now, you'd need to install the Docker deployment from the source code and modify the `Dockerfile` beforehand.
## Groq
To use Groq LPUs as generation engine, you need to get an API key from [Groq](https://console.groq.com/keys).
> Although you can provide it in the graphical interface when Verba is up, it is recommended to specify it as `GROQ_API_KEY` environment variable before you launch the application.
> It will allow you to choose the generation model in an up-to-date available models list.# How to deploy with pip
`Python >=3.10.0`
1. (Very Important) **Initialize a new Python Environment**
```
python3 -m virtualenv venv
source venv/bin/activate
```2. **Install Verba**
```
pip install goldenverba
```3. **Launch Verba**
```
verba start
```> You can specify the --port and --host via flags
4. **Access Verba**
```
Visit localhost:8000
```5. (Optional)**Create .env file and add environment variables**
# How to build from Source
1. **Clone the Verba repos**
```
git clone https://github.com/weaviate/Verba.git
```2. **Initialize a new Python Environment**
```
python3 -m virtualenv venv
source venv/bin/activate
```3. **Install Verba**
```
pip install -e .
```4. **Launch Verba**
```
verba start
```> You can specify the --port and --host via flags
5. **Access Verba**
```
Visit localhost:8000
```6. (Optional) **Create .env file and add environment variables**
# How to install Verba with Docker
Docker is a set of platform-as-a-service products that use OS-level virtualization to deliver software in packages called containers. To get started with deploying Verba using Docker, follow the steps below. If you need more detailed instructions on Docker usage, check out the [Docker Curriculum](https://docker-curriculum.com/).
You can use `docker pull semitechnologies/verba` to pull the latest Verba Docker Image. Please note, that by pulling directly from Docker Hub you're only able to install the vanilla Verba version that does not include packages e.g `HuggingFace`. If you want to use Docker and `HuggingFace` please follow the steps below.
To build the image yourself, you can clone the Verba repository and run `docker build -t verba .` inside the Verba directory.
0. **Clone the Verba repos**
Ensure you have Git installed on your system. Then, open a terminal or command prompt and run the following command to clone the Verba repository:```
git clone https://github.com/weaviate/Verba.git
```1. **Set necessary environment variables**
Make sure to set your required environment variables in the `.env` file. You can read more about how to set them up in the [API Keys Section](#api-keys)2. **Adjust the docker-compose file**
You can use the `docker-compose.yml` to add required environment variables under the `verba` service and can also adjust the Weaviate Docker settings to enable Authentification or change other settings of your database instance. You can read more about the Weaviate configuration in our [docker-compose documentation](https://weaviate.io/developers/weaviate/installation/docker-compose). You can also uncomment the `ollama` service to use Ollama within the same docker compose.> Please make sure to only add environment variables that you really need.
2. **Deploy using Docker**
With Docker installed and the Verba repository cloned, navigate to the directory containing the Docker Compose file in your terminal or command prompt. Run the following command to start the Verba application in detached mode, which allows it to run in the background:```bash
docker compose up -d
```
```bash
docker compose --env-file goldenverba/.env up -d --build
```
This command will download the necessary Docker images, create containers, and start Verba.
Remember, Docker must be installed on your system to use this method. For installation instructions and more details about Docker, visit the official Docker documentation.4. **Access Verba**
- You can access your local Weaviate instance at `localhost:8080`
- You can access the Verba frontend at `localhost:8000`
If you want your Docker Instance to install a specific version of Verba you can edit the `Dockerfile` and change the installation line.
```
RUN pip install -e '.'
```## Verba Walkthrough
### Select your Deployment
The first screen you'll see is the deployment screen. Here you can select between `Local`, `Docker`, `Weaviate Cloud`, or `Custom` deployment. The `Local` deployment is using Weaviate Embedded under the hood, which initializes a Weaviate instance behind the scenes. The `Docker` deployment is using a separate Weaviate instance that is running inside the same Docker network. The `Weaviate Cloud` deployment is using a Weaviate instance that is hosted on Weaviate Cloud Services (WCS). The `Custom` deployment allows you to specify your own Weaviate instance URL, PORT, and API key.
You can skip this part by setting the `DEFAULT_DEPLOYMENT` environment variable to `Local`, `Docker`, `Weaviate`, or `Custom`.
### Import Your Data
First thing you need to do is to add your data. You can do this by clicking on `Import Data` and selecting either `Add Files`, `Add Directory`, or `Add URL` tab. Here you can add all your files that you want to ingest.
You can then configure every file individually by selecting the file and clicking on `Overview` or `Configure` tab.
![Demo of Verba](https://github.com/weaviate/Verba/blob/2.0.0/img/verba_data.png)### Query Your Data
With Data imported, you can use the `Chat` page to ask any related questions. You will receive relevant chunks that are semantically relevant to your question and an answer generated by your choosen model. You can configure the RAG pipeline under the `Config` tab.
![Demo of Verba](https://github.com/weaviate/Verba/blob/2.0.0/img/verba_rag.png)
## Open Source Contribution
Your contributions are always welcome! Feel free to contribute ideas, feedback, or create issues and bug reports if you find any! Before contributing, please read the [Contribution Guide](./CONTRIBUTING.md). Visit our [Weaviate Community Forum](https://forum.weaviate.io/) if you need any help!
### Project Architecture
You can learn more about Verba's architecture and implementation in its [technical documentation](./TECHNICAL.md) and [frontend documentation](./FRONTEND.md). It's recommended to have a look at them before making any contributions.
## Known Issues
- **Weaviate Embeeded** currently not working on Windows yet
- Will be fixed in future versions, until then please use the Docker or WCS Deployment## FAQ
- **Can I use pre-existing data from my Weaviate instance?**
- No, unfortunatley not. Verba requires the data to be in a specific format to work. And as of now, this is only possible by importing data through the Verba UI.
- **Is Verba Multi-Lingual?**
- This depends on your choosen Embedding and Generation Model whether they support multi-lingual data.
- **Can I use my Ollama Server with the Verba Docker?**
- Yes, you can! Make sure the URL is set to: `OLLAMA_URL=http://host.docker.internal:11434`
- If you're running on Linux, you might need to get the IP Gateway of the Ollama server: `OLLAMA_URL="http://YOUR-IP-OF-OLLAMA:11434"`- **How to clear Weaviate Embedded Storage?**
- You'll find the stored data here: `~/.local/share/weaviate`
- **How can I specify the port?**
- You can use the port and host flag `verba start --port 9000 --host 0.0.0.0`
- **Can multiple users use Verba at the same time? How about role based access?**
- Verba is designed and optimized for single user usage only. There are no plans on supporting multiple users or role based access in the near future.
- **Does Verba offer a API endpoint to use externally?**
- No, right now Verba does not offer any useful API endpoints to interact with the application. The current FastAPI setup is optimized for the internal communication between the frontend and backend. It is not recommended to use it as a API endpoint. There are plans to add user-friendly
- **How to connect to your custom OpenAI Server?**
- Set your custom OpenAI API Key and URL in the `.env` file, this will allow Verba to start up and retrieve the models from your custom OpenAI Server. `OPENAI_BASE_URL` is set to `https://api.openai.com/v1` by default.