https://github.com/garvitjain-02/modular-rag-chatbot

A Modular Retrieval-Augmented Generation (RAG) application that allows users to upload PDF documents and chat with an AI assistant that answers queries based on the document content. It features a microservice architecture with a decoupled FastAPI backend and Streamlit frontend, using ChromaDB as the vector store and Groq's LLaMA3 model as the LLM.
https://github.com/garvitjain-02/modular-rag-chatbot

chromadb fastapi generative-ai groq langchain llama3 llm python rag-chatbot streamlit

Last synced: 3 months ago
JSON representation

Host: GitHub
URL: https://github.com/garvitjain-02/modular-rag-chatbot
Owner: garvitjain-02
Created: 2025-06-12T05:00:43.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2025-08-05T04:49:25.000Z (11 months ago)
Last Synced: 2025-08-05T06:28:34.194Z (11 months ago)
Topics: chromadb, fastapi, generative-ai, groq, langchain, llama3, llm, python, rag-chatbot, streamlit
Language: Python
Homepage:
Size: 6.36 MB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Modular RAG PDF Chatbot with FastAPI, ChromaDB & Streamlit

This project is a modular **Retrieval-Augmented Generation (RAG)** application that allows users to upload PDF documents and chat with an AI assistant that answers queries based on the document content. It features a microservice architecture with a decoupled **FastAPI backend** and **Streamlit frontend**, using **ChromaDB** as the vector store and **Groq's LLaMA3 model** as the LLM.

---

## 📂 Project Structure

---

## ✨ Features

- 📄 Upload and parse PDFs
- 🧠 Embed document chunks with HuggingFace embeddings
- 💂️ Store embeddings in ChromaDB
- 💬 Query documents using LLaMA3 via Groq
- 🌍 Microservice architecture (Streamlit client + FastAPI server)

---

## 🎓 How RAG Works

Retrieval-Augmented Generation (RAG) enhances LLMs by injecting external knowledge. Instead of relying solely on pre-trained data, the model retrieves relevant information from a vector database (like ChromaDB) and uses it to generate accurate, context-aware responses.

---

## 🚀 Getting Started Locally

### 1. Clone the Repository

```bash
git clone https://github.com/garvitjain-02/Modular-RAG-Chatbot.git
cd Modular-RAG-Chatbot
```

### 2. Setup the Backend (FastAPI)

```bash
cd server
python -m venv venv
source venv/bin/activate # Windows: venv\Scripts\activate
pip install -r requirements.txt

# Set your Groq API Key (.env)
GROQ_API_KEY="your_key_here"

# Run the FastAPI server
uvicorn main:app --reload
```

### 3. Setup the Frontend (Streamlit)

```bash
cd ../client
pip install -r requirements.txt # if you use a separate venv for client
streamlit run app.py
```

---

## 🌐 API Endpoints (FastAPI)

- `POST /upload_pdfs/` — Upload PDFs and build vectorstore
- `POST /ask/` — Send a query and receive answers

Testable via Postman or directly from the Streamlit frontend.

---

## 🌟 References

- [LangChain](https://www.langchain.com/)
- [ChromaDB](https://www.trychroma.com/)
- [Groq](https://groq.com/)
- [Streamlit](https://streamlit.io/)

---

## ✉️ Contact

For questions or suggestions, open an issue or contact at [garvitjainjnv@gmail.com]

---

> Happy Building! 🚀

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/garvitjain-02/modular-rag-chatbot

Awesome Lists containing this project

README