https://github.com/humankernel/rag-revamped

Advanced Iterative RAG pipeline + (Gradio UI), local
https://github.com/humankernel/rag-revamped

gradio nlp retrieval-augmented-generation vllm

Last synced: 16 days ago
JSON representation

Advanced Iterative RAG pipeline + (Gradio UI), local

Host: GitHub
URL: https://github.com/humankernel/rag-revamped
Owner: humankernel
Created: 2025-03-21T09:53:07.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2025-06-20T00:10:33.000Z (about 1 year ago)
Last Synced: 2025-06-20T01:21:30.319Z (about 1 year ago)
Topics: gradio, nlp, retrieval-augmented-generation, vllm
Language: Python
Homepage:
Size: 1.21 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Advanced RAG Chatbot

This project is a simple prototype developed for a [research paper](https://github.com/humankernel/rag-paper/blob/main/rcci_template.pdf) focused on democratizing AI tools for managing PDF documents in resource-limited contexts. It serves as an advanced Retrieval-Augmented Generation (RAG) pipeline example, showcasing how AI can facilitate document management.

## Features

## Installation

To set up the project locally, follow these steps:

1. Clone the repository:

```bash
git clone https://github.com/humankernel/rag.git
cd rag
```

2. Install the dependencies: `uv sync`

(Optional) Setup PyPi cuban repos
```shell
# uv will use the cuban national pypi repos
# if you don't want this open the `pyproject.toml`
# and remove the following:
# [[tool.uv.index]]
# url = "http://nexus.prod.uci.cu/repository/pypi-all/simple"
# default = true
```

3. Setup `.env` variables
4. Execute `uv run main`, this will start the Gradio UI

(Optional) In a local environment (e.g dev mode) you can start a separated `vLLM` instance.
```shell
# https://docs.vllm.ai/en/stable/serving/engine_args.html
❯ vllm serve ./qwen2.5-1.5b-instruct-q8_0.gguf --device cuda --gpu-memory-utilization 0.6 --max-model-len 2048
```

## Tests

```sh
pytest .
```

## License
This project is licensed under the MIT License - see the LICENSE file for details

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/humankernel/rag-revamped

Awesome Lists containing this project

README