https://github.com/mnds18/sydney-rag-chatbot
End-to-end RAG chatbot powered by LangChain, FAISS, OpenAI GPT-3.5, and Flask. Scrapes live Sydney Wikipedia data and answers questions in real-time
https://github.com/mnds18/sydney-rag-chatbot
ai chatbot datascience faiss flask generativeai langchain openai python rag vectorsearch webscraping
Last synced: 2 months ago
JSON representation
End-to-end RAG chatbot powered by LangChain, FAISS, OpenAI GPT-3.5, and Flask. Scrapes live Sydney Wikipedia data and answers questions in real-time
- Host: GitHub
- URL: https://github.com/mnds18/sydney-rag-chatbot
- Owner: mnds18
- Created: 2025-04-13T06:15:00.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2025-04-13T07:03:15.000Z (about 1 year ago)
- Last Synced: 2025-04-15T01:17:47.872Z (about 1 year ago)
- Topics: ai, chatbot, datascience, faiss, flask, generativeai, langchain, openai, python, rag, vectorsearch, webscraping
- Language: Python
- Homepage:
- Size: 6.84 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Mr. Mrig Sydney RAG Chatbot đ¨đŠī¸
A full-stack **Retrieval-Augmented Generation (RAG)** chatbot trained on live content scraped from Wikipedia (Sydney page).
Built to demonstrate practical AI product engineering: scraping â vectorizing â retrieval â real-time chat.
---
## đ Features
- đ Live **web scraping** with BeautifulSoup
- đ§ **Vector embedding** using HuggingFace models + **FAISS** vector database
- đ¤ **RAG pipeline** with **OpenAI GPT-3.5 Turbo**
- đĨī¸ Minimalistic **Flask web app** with live chat interface
- đ **Auto vectorstore management** (create, load, save)
- âī¸ **One-click deploy to Render**
---
## đ Demo Screenshot Chat 1

## đ Demo Screenshot Chat 1

## đ Project Structure
sydney-rag-chatbot/ âââ app.py # Main application (Flask + RAG logic) âââ requirements.txt # Python dependencies âââ render.yaml # Render deployment configuration âââ templates/ â âââ index.html # Simple frontend for chatting âââ website_text.txt # (Generated) scraped Wikipedia text âââ faiss_db_folder/ # (Generated) FAISS vector database âââ .env.example # Template for environment variables âââ README.md # Documentation
---
## đ Environment Variables
To run this project, you need to configure the following environment variable:
| Variable | Purpose |
|:-----------------|:-------------------------------------------------|
| `OPENAI_API_KEY` | Your OpenAI API key for accessing GPT-3.5 models |
---
### đģ How to Set It Locally
- Create a `.env` file in your project root.
- Add this line:
```plaintext
OPENAI_API_KEY=your-openai-api-key-here