https://github.com/mbappeenjoyer/dynamic-multi-agent-rag-pathway

Dynamic Multi-Agent RAG solution tailor made for long documents in finance and legal domains
https://github.com/mbappeenjoyer/dynamic-multi-agent-rag-pathway

agentic-ai agi autoprompt docker jinaai multi-agent-systems page-level-retrieval rag raptor unstructured

Last synced: 2 months ago
JSON representation

Dynamic Multi-Agent RAG solution tailor made for long documents in finance and legal domains

Host: GitHub
URL: https://github.com/mbappeenjoyer/dynamic-multi-agent-rag-pathway
Owner: mbappeenjoyer
License: mit
Created: 2024-12-21T12:44:31.000Z (6 months ago)
Default Branch: main
Last Pushed: 2024-12-25T06:46:43.000Z (6 months ago)
Last Synced: 2025-04-08T22:47:33.242Z (2 months ago)
Topics: agentic-ai, agi, autoprompt, docker, jinaai, multi-agent-systems, page-level-retrieval, rag, raptor, unstructured
Language: HTML
Homepage:
Size: 6.77 MB
Stars: 5
Watchers: 1
Forks: 4
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE.txt

Awesome Lists containing this project

README

# Dynamic Agentic RAG with Pathway

In this work, we develop a dynamic agentic RAG system for long, intricate legal and financial documents. Such a system requires orchestrating multiple agents for efficient context retrieval and reasoning, along with tool-reasoning capabilities to handle domain-specific tasks. The system should be robust and have decision making abilities to handle different user use cases and failure scenarios.

# Approach

![alt text](web_application/images/system.png)

The workflow begins by receiving a query Q, a set of documents D, and a set of tools T from the user. The tools can be user-provided or pre-defined as well. Given the above information, the supervisor agent first activates the Code & Reasoning agent, which can interact with tools and the RAG agent. Upon activation, the RAG Agent builds a document index for D using Pathway's VectorStore Server. It then utilizes Jina Embeddings to perform page-level retrieval and extract the top-k most relevant pages for Q. The pages are chunked and indexed using RAPTOR, forming a hierarchical structure over the summary of the chunks.

![alt text](web_application/images/rag.png)

Once indexing is complete, the RAG agent uses an interleaving approach to iterate between reasoning and retrieval to perform multi-hop contextual reasoning and return the RAG response. This allows for dynamic decision-making and real-time transformation of retrieval queries to ensure the successful retrieval of relevant context, eliminating the need for a separate critic agent.

![alt text](web_application/images/interleaving.png)

The Code & Reasoning (C&R) agent can further utilize the tools for any tool-specific task based on the RAG agent's response and user query. Finally, the Supervisor Agent consolidates the outputs and returns the response to the user. It follows a Chain of Function Call (CoFC) policy where it invokes a single tool call at each step.

## Files
`base_pipeline:` Python modular code for our pipeline

`Codebase_Documentation:` Interactive codebase documentation for our system and Pathway intergration

`web_application:` Codebase for web application

`videos:` Summary Video and Solution Demonstration Video

`test.ipynb:` Python Notebook containing Pathway integration

`utils:` Utils python files for executing `test.ipynb`

# Usage 1 (Colab Notebook)
> [!IMPORTANT]
> Use a Google Colab Runtime.

- First, download the Jupyter notebook `test.ipynb` and connect to a Colab standard runtime.
- Run the **Imports** cell. There will be a message to restart the runtime, kindly do so.
- In the **API Keys** section, please provide all the relevant API keys.
- Run each section one by one.
- Once you run the **Inference** section, you will be asked to provide the path of the PDF over which you wish to perform question answering, along with your query. Kindly do the needful.
- After a few minutes, the response to your query will be provided.
> [!IMPORTANT]
> While running colab notebook, to intialize two vector servers, run `utils/app_jina.py` and `utils/app_raptor.py` using two separate terminal and override the pathway's inbuilt VectorStore Sever code with our updated `utils/VectorSever.py`

> [!NOTE]
> The tool list in the "Tools" section contains a few default tools; one can also provide custom tools (with proper docstrings).

Please find an interactive documentation of our modular codebase at `Codebase_Documentation\build\index.html`. Kindly refer to the same for a detailed description of our entire codebase and the integration of various system components with
$\textsf{\color{blue} Pathway}$

# Usage 2 (Web Application)

## Installation

You can run all the frontend, backend, model-microservice, and database using the given commands. Please Note that all the paths are relative to the web_application inside submission folder.

### Frontend
```bash
make frontend-all
```
(or)
```bash
cd frontend
npm i
npm run dev
```
### Database
```base
make database-up-create
```
### Model Microservice
Kindly fill in all the environment variable before running it in development mode

./model-microservice/rag_agent/.env
```py
RAPTOR_GROQ_API_KEY=
RAG_GROQ_API_KEY=
SUPERVISOR_GROQ_API_KEY=
CLIENT_GROQ_API_KEY=
UNSTRUCTURED_API_KEY=
UNSTRUCTURED_API_URL=
JINAAI_API_KEY=
EMBED_JINA_API_KEY=
TOGETHER_API_KEY=
TAVILY_API_KEY=
```
```bash
make model-microservice-all
```
(or)

For Linux
```bash
cd model-microservice
pip install -r requirements.txt
python -m venv venv
source venv/bin/activate
python main.py
```
For Windows
```bash
cd model-microservice
pip install -r requirements.txt
python -m venv venv
.\venv\Scripts\activate
python main.py
```

### Backend
Kindly fill in all the environment variable before running it in development mode

./backend/.env
```py
POSTGRES_USER=
POSTGRES_PASSWORD=
POSTGRES_HOST=
POSTGRES_DB=
AWS_ACCESS_KEY_ID=
AWS_SECRET_ACCESS_KEY=
AWS_REGION=
AWS_BUCKET_NAME=
```
```bash
make backend-all
```
(or)

For Linux
```bash
cd backend
pip install -r requirements.txt
python -m venv venv
source venv/bin/activate
python main.py
```
For Windows
```bash
cd backend
pip install -r requirements.txt
python -m venv venv
.\venv\Scripts\activate
python main.py
```

# System Architecture

alt text

- Fully functional web application using a modern tech stack to ensure scalability, performance, and ease of deployment.
- The frontend has been built using Next.js, providing seamless and dynamic user experience. The frontend connects to a FastAPI-based backend gateway, which serves as the central communication hub, managing interactions with various subsystems.
- The backend gateway integrates with an Amazon S3 bucket for efficient and secure storage of PDF.
- Additionally, it connects to a Flask-based microservice that powers an agentic Retrieval-Augmented Generation (RAG) system, enabling intelligent and context-aware document processing.
- For data management, we have utilized PostgreSQL as the
database management system (DBMS), ensuring reliable and scalable data storage and retrieval.
- All services, including the frontend, backend, microservices, and database, have been containerized using Docker.

This approach simplies development workows and ensures the application is production-ready with consistent and isolated environments across various stages of deployment.

# Web UI

alt text

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mbappeenjoyer/dynamic-multi-agent-rag-pathway

Awesome Lists containing this project

README