https://github.com/cxcscmu/RAGViz

Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
https://github.com/cxcscmu/RAGViz

Last synced: 5 months ago
JSON representation

Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]

Host: GitHub
URL: https://github.com/cxcscmu/RAGViz
Owner: cxcscmu
License: mit
Created: 2024-10-17T15:14:48.000Z (8 months ago)
Default Branch: main
Last Pushed: 2025-01-17T02:38:06.000Z (5 months ago)
Last Synced: 2025-01-17T03:25:30.554Z (5 months ago)
Language: TypeScript
Homepage: https://arxiv.org/abs/2411.01751
Size: 181 KB
Stars: 76
Watchers: 0
Forks: 10
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesomerag_paper - https://github.com/cxcscmu/RAGViz
awesomerag_paper - https://github.com/cxcscmu/RAGViz

README

# drawing

RAGViz

RAGViz (Retrieval Augmented Generation Visualization) is a tool that visualizes both document and token-level attention on the retrieved context feeded to the LLM to ground answer generation.

- RAGViz provides an add/remove document functionality to compare the generated tokens when certain documents are not included in the context.
- Combining both functionalities allows for a diagnosis on the effectiveness and influence of certain retrieved documents or sections of text on the LLM's answer generation.

### Demo Video
A basic demonstration of RAGViz is available [here](https://www.youtube.com/embed/cTAbuTu6ur4?si=-uZ2AyNLx-5p8MZC).

### Configuration

The following are the system configurations of our RAGViz demonstration:
- The [Pile-CC](https://github.com/EleutherAI/pile-cc) English documents are used for retrieval
- Documents are partioned into 4 [DiskANN](https://github.com/microsoft/DiskANN/) indexes on separate nodes, each with ~20 million documents
- Documents are embedded into feature vectors using [AnchorDR](https://github.com/yiqingxyq/AnchorDR). **To use [AnchorDR](https://github.com/yiqingxyq/AnchorDR) in RAGViz you must follow the installation instructions on the repo [here](https://github.com/yiqingxyq/AnchorDR) to ensure your Python environment is set up correctly. Do this after running `pip install -r backend/requirements.txt`.**
- [LLaMa2](https://huggingface.co/docs/transformers/v4.34.0/en/model_doc/llama2) generation/attention output done with [vLLM](https://github.com/vllm-project/vllm) and [HuggingFace](https://huggingface.co/) transformers library
- Frontend UI is adapted from [Lepton search engine](https://github.com/leptonai/search_with_lepton)

### Customization

#### Snippets:
You can modify the snippets used for context in RAG by adding a new file and class in `backend/snippet`, adding it to `backend/ragviz.py` and `frontend/src/app/components/search.tsx`. We currently offer the following snippets:
- Naive First:
- Represent a document with its first 128 tokens
- Sliding Window
- Compute inner product similarity between windows of 128 tokens and the query; use the most similar window to the query to represent a document

#### Datasets:
New datasets for retrieval can be added using a new file and class in `backend/search`, and modifying `backend/ragviz.py` accordingly.

We currently have implemented both a implementation the following datasets:
- Clueweb22B english documents
- Pile-CC dataset

#### LLMs:
Any model supported by [HuggingFace](https://huggingface.co/) transformers library can be used as the LLM backbone.

To apply vLLM for fast inference, the LLM backbone needs to be supported by vLLM. A list of vLLM supported model is available [here](https://docs.vllm.ai/en/latest/models/supported_models.html).

You can set the model path of the model for RAG inside of `backend/.env.example`. We used `meta-llama/Llama-2-7b-chat-hf` for the demo.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/cxcscmu/RAGViz

Awesome Lists containing this project

README