https://github.com/RobinMillford/Cortex-AI-Multi-Model-Insights-Hub

This project creates a Retrieve-and-Generate (RAG) powered chatbot for summarizing and interacting with articles. The system processes articles provided as PDFs or URLs, extracts text, splits the content into chunks, generates embeddings, and stores them in a vector database
https://github.com/RobinMillford/Cortex-AI-Multi-Model-Insights-Hub

article-extractor chatbot llama3 llm pdf-document-processor rag streamlit summarizer vector-database

Last synced: 3 months ago
JSON representation

Host: GitHub
URL: https://github.com/RobinMillford/Cortex-AI-Multi-Model-Insights-Hub
Owner: RobinMillford
License: agpl-3.0
Created: 2025-01-23T16:25:01.000Z (11 months ago)
Default Branch: main
Last Pushed: 2025-01-28T16:43:53.000Z (11 months ago)
Last Synced: 2025-01-28T17:36:06.255Z (11 months ago)
Topics: article-extractor, chatbot, llama3, llm, pdf-document-processor, rag, streamlit, summarizer, vector-database
Language: Python
Homepage: https://multi-model-rag-powered-article-chatbot.streamlit.app/
Size: 341 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Cortex AI: Multi-Model Insights Hub

🤖 **Advanced AI-Powered Document Analysis with Multimodal RAG Capabilities**

Cortex AI Hub integrates multiple Large Language Models (LLMs) with a sophisticated **Multimodal Retrieve-and-Generate (RAG)** system, enabling you to extract insights from both **text and visual content** in documents.

**✨ NEW: Multimodal Capabilities** - Now with support for images, charts, graphs, and infographics!

---

## 🌟 **Key Features**

### 🖼️ **Multimodal RAG**

- **📊 Visual Content Understanding**: Analyze images, charts, graphs, and infographics
- **🔗 Unified Text-Image Search**: Search across both textual and visual content
- **🎯 Context-Aware Analysis**: Enhanced understanding with specialized prompts
- **💾 Persistent Storage**: Efficient FAISS-based multimodal embeddings
- **🆓 Free & Local**: Uses open-source models (BLIP, BLIP-2, GIT, CLIP)

### 🔍 **Advanced Search & RAG**

- **🧠 Hybrid Search**: Combines semantic vector search with BM25 keyword search
- **📂 Multi-Document Support**: Upload PDFs or provide URLs
- **💾 Persistent Vector Database**: ChromaDB-powered storage
- **✅ Accurate Citations**: Source-linked responses with references

### 🤖 **AI-Powered Search Agent**

- **🌐 Real-Time Research**: ArXiv, Wikipedia, and web search tools
- **📰 Current Information**: Up-to-date news and research insights
- **⚡ Instant Responses**: Fast, context-aware answers

---

## 🚀 **Supported AI Models**

| Model | Provider | Best For |
| ----------------------------- | -------- | ----------------------------- |
| llama-3.3-70b-versatile | Meta | Complex reasoning, analysis |
| llama-3.1-8b-instant | Meta | Quick queries, fast responses |
| deepseek-r1-distill-llama-70b | DeepSeek | Extended conversations |
| qwen/qwen3-32b | Alibaba | Document summarization |
| openai/gpt-oss-120b | OpenAI | Complex analysis tasks |

### 🖼️ **Vision Models**

| Model | Description | Best For |
| ------ | ---------------------- | ---------------------------- |
| BLIP | Quick image captioning | Speed, basic analysis |
| BLIP-2 | Advanced understanding | Complex visual content |
| GIT | Detailed descriptions | Charts, graphs, infographics |

---

## 📸 **Application Screenshots**

### 🤖 **RAG Chatbot Interface**

![RAG Chatbot Interface](images/Ragbot_interface.png)
_Traditional RAG chatbot with document upload and multi-LLM selection_

### 🖼️ **Multimodal RAG Interface**

![Multimodal RAG Interface](images/MultiModel_Rag_Interface.png)
_Enhanced multimodal interface with vision model selection and image analysis_

### 🔍 **Search Agent Interface**

![Search Agent Interface](images/Search_Agent_Interface.png)
_AI-powered search agent with real-time research capabilities_

---

## 🔄 **System Architecture**

### 📊 **RAG Chatbot Workflow**

![RAG Chatbot Workflow](images/Ragchotbot_diagram.png)
_Complete RAG chatbot workflow with document processing, hybrid search, and multi-LLM response generation_

### 🤖 **Search Agent Workflow**

![Search Agent Workflow](images/Search_Agent_Diagram.png)
_AI-powered search agent workflow with multi-tool research and intelligent orchestration_

### 🖼️ **Multimodal RAG Workflow**

![Multimodal RAG Workflow](images/Multimodel_Rag.png)
_Enhanced multimodal workflow combining text and visual content analysis_

---

## 🚀 **Getting Started**

### 📋 **Prerequisites**

- Python 3.12+
- Git
- API Keys: ChatGroq and Tavily

### 📥 **Installation**

1. **Clone Repository**

```bash
git clone https://github.com/RobinMillford/Cortex-AI-Multi-Model-Insights-Hub.git
cd Cortex-AI-Multi-Model-Insights-Hub
```

2. **Setup Environment**

```bash
python -m venv venv
source venv/bin/activate # Windows: venv\Scripts\activate
pip install -r requirements.txt
```

3. **Configure API Keys**

```bash
cp .env.template .env
# Add your GROQ_API_KEY and TAVILY_API_KEY to .env
```

4. **Run Application**
```bash
streamlit run Main_Page.py
```

### 🌐 **Live Demo**

**[🚀 Try it now](https://cortex-ai-multi-model-insights-app.streamlit.app/)**

---

## 📖 **Usage Guide**

### 🖼️ **Multimodal Document Analysis**

1. Navigate to **"Multimodal RAG"** page
2. Choose vision model (BLIP for speed, GIT for accuracy)
3. Upload PDF with images/charts
4. Enable **"Extract and analyze images"**
5. Ask questions about text and visual content

### 📄 **Traditional Document Chat**

1. Go to **"RAG Chatbot"** page
2. Upload PDFs or enter URLs
3. Configure retrieval parameters
4. Select LLM models for comparison
5. Ask questions and get cited responses

### 🔍 **Research & Web Search**

1. Visit **"Search Agent"** page
2. Enter research queries
3. Choose preferred LLM model
4. Get real-time answers with sources

---

## 🛠️ **Technology Stack**

- **Frontend**: Streamlit with dark theme
- **Backend**: Python, LangChain/LangGraph
- **Vector DB**: ChromaDB (text), FAISS (multimodal)
- **Embeddings**: HuggingFace sentence-transformers, CLIP
- **Vision**: BLIP, BLIP-2, GIT (Hugging Face)
- **LLMs**: Groq API
- **Search**: Tavily, ArXiv, Wikipedia APIs

### 📁 **Project Structure**

```
├── Main_Page.py # App entry point
├── multimodal_helpers.py # Multimodal processing
├── helpers.py # Text utilities
├── chain_setup.py # LLM configuration
├── pages/
│ ├── 1_RAG_Chatbot.py # Traditional RAG
│ ├── 2_Search_Agent.py # Web search agent
│ └── 3_Multimodal_RAG.py # Multimodal interface
├── chroma_db/ # Text vector storage
├── multimodal_stores/ # Multimodal storage
└── requirements.txt # Dependencies
```

---

## 🔧 **Key Technical Features**

### 🧠 **Architecture Highlights**

- **Two-Layer Vision**: Vision models → descriptions, CLIP → embeddings
- **Hybrid Search**: Semantic + BM25 for optimal retrieval
- **Model Caching**: Global cache prevents reloading
- **Session Management**: Streamlit state for persistence

### ⚡ **Performance Optimizations**

- Vision models cached globally
- Processed embeddings saved for reuse
- Lazy loading when needed
- Real-time progress feedback

---

## 🤝 **Contributing**

1. Fork the repository
2. Create feature branch: `git checkout -b feature/your-feature`
3. Make changes and test locally
4. Commit and push: `git commit -m "Add feature"`
5. Create Pull Request

### 🎯 **Areas for Contribution**

- 🖼️ New vision models or analysis techniques
- 🔍 Better retrieval algorithms
- 🎨 UI/UX improvements
- 📊 Analytics and metrics
- 🧪 Testing and documentation

---

## 📝 **License**

This project is licensed under the **AGPL-3.0 License**.

---

## 🆘 **Support**

- **🐛 Issues**: [GitHub Issues](https://github.com/RobinMillford/Cortex-AI-Multi-Model-Insights-Hub/issues)
- **💬 Discussions**: [GitHub Discussions](https://github.com/RobinMillford/Cortex-AI-Multi-Model-Insights-Hub/discussions)

---

## 🙏 **Acknowledgments**

- **🤗 Hugging Face**: Free open-source vision models
- **🦙 Meta**: Llama models and CLIP
- **🔍 Salesforce**: BLIP vision models
- **🏢 Microsoft**: GIT vision model
- **⚡ Groq**: Fast LLM inference
- **🌐 Streamlit**: Amazing app framework

---

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/RobinMillford/Cortex-AI-Multi-Model-Insights-Hub

Awesome Lists containing this project

README