https://github.com/jannik-droid/pdf-rag-chatbot
Local PDF RAG with Ollama, ChromaDB and StreamLit
https://github.com/jannik-droid/pdf-rag-chatbot
chromadb ollama streamlit
Last synced: about 2 months ago
JSON representation
Local PDF RAG with Ollama, ChromaDB and StreamLit
- Host: GitHub
- URL: https://github.com/jannik-droid/pdf-rag-chatbot
- Owner: Jannik-droid
- Created: 2025-03-08T10:09:42.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-03-08T11:13:22.000Z (over 1 year ago)
- Last Synced: 2025-03-08T11:20:22.527Z (over 1 year ago)
- Topics: chromadb, ollama, streamlit
- Language: Python
- Homepage:
- Size: 4.16 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# 📄 Local PDF RAG with Ollama, ChromaDB and StreamLit
A locally running **Retrieval-Augmented Generation (RAG)** system that processes PDFs, stores embeddings in **ChromaDB**, and queries an **Ollama-powered LLM** with a StreamLit UI for intelligent search.
## 🚀 Features
- Extracts & cleans text from PDFs 📄
- Generates **embeddings** using `nomic-embed-text`
- Stores & retrieves **vectors** with **ChromaDB**
- Queries an **Ollama LLM** (`mistral`)
- Userfriendly **StreamLit UI**
## 🦾 How to use
Since this is a StreamLit App, just type: `stremalit run app.py` and follow the instructions in the app.
## ⚙️ Requirements
Make sure to have Ollama and the corresponding LLMs installed (e.g. `Mistral` and `nomic-embed-text`)
## 💡 Upcoming Features
- Being able to select which LLM model to use for generating embeddings and for text generation
- Rework UI