https://github.com/mahtabranjbar/ml-papers-rag
A Retrieval-Augmented Generation (RAG) chatbot that answers questions about machine learning by leveraging a dataset of 100 LLM papers. It provides contextually relevant responses, supports configurable model parameters,
https://github.com/mahtabranjbar/ml-papers-rag
generative-ai llms rag
Last synced: 2 months ago
JSON representation
A Retrieval-Augmented Generation (RAG) chatbot that answers questions about machine learning by leveraging a dataset of 100 LLM papers. It provides contextually relevant responses, supports configurable model parameters,
- Host: GitHub
- URL: https://github.com/mahtabranjbar/ml-papers-rag
- Owner: MahtabRanjbar
- License: mit
- Created: 2024-09-12T11:30:29.000Z (8 months ago)
- Default Branch: main
- Last Pushed: 2024-09-12T11:33:10.000Z (8 months ago)
- Last Synced: 2025-01-24T11:44:42.198Z (4 months ago)
- Topics: generative-ai, llms, rag
- Language: Python
- Homepage:
- Size: 7.81 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# ML Papers RAG Chatbot
## Overview
This project implements a Retrieval-Augmented Generation (RAG) chatbot designed to answer questions about machine learning papers. It uses a dataset of 100 LLM (Large Language Model) papers to provide informative and contextually relevant responses to user queries about recent advancements in the field of machine learning, particularly focusing on LLMs.
## Features
- Retrieval-Augmented Generation (RAG) for accurate responses about ML papers
- Utilizes a dataset of 100 recent LLM papers
- Configurable model parameters and pipeline components
- Efficient PDF processing, document splitting, and embedding
- FAISS vector store for fast similarity search
- Command-line interface for quick queries
- Streamlit web interface for interactive chatting (optional)## Dataset
This project uses the "100 LLM Papers to Explore" dataset from Kaggle:
[https://www.kaggle.com/datasets/ruchi798/100-llm-papers-to-explore](https://www.kaggle.com/datasets/ruchi798/100-llm-papers-to-explore)The dataset includes 100 PDF papers covering various aspects of Large Language Models, providing a rich knowledge base for the chatbot.
### Installation
1. Clone the repository:
```bash
git clone https://github.com/MahtabRanjbar/ml-papers-rag.git
cd ml-papers-rag
```2. Install the required packages:
```bash
pip install -r requirements.txt
```3. Configure the project:
- Open `config/config.yml` and adjust the settings as needed, especially the path to the downloaded papers### Usage
#### Command-line Interface
Run a single query:```bash
python src/main.py "What are the recent advancements in LLM fine-tuning techniques?"
```
## ModelsThis project uses the following models:
- LLM: Meta-Llama-3-8B-Instruct
- Embeddings: BAAI/bge-base-en-v1.5You can change these in the `config.yml` file.
## Dataset
The chatbot's knowledge base is built from a collection of machine learning papers. You can replace or expand this dataset by adding PDF files to the configured input directory.
## Configuration
All configurable parameters are stored in `config/config.yml`. This includes:
- Model settings (name, temperature, top_p, etc.)
- Text splitting parameters
- Embedding model selection
- File paths for input and output