https://github.com/sleeepeer/PoisonedRAG

[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
https://github.com/sleeepeer/PoisonedRAG

ai machine-learning rag retrieval-augmented-generation security trustworthy-ai

Last synced: 7 months ago
JSON representation

[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models

Host: GitHub
URL: https://github.com/sleeepeer/PoisonedRAG
Owner: sleeepeer
License: mit
Created: 2024-02-09T04:25:56.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-09-13T18:57:14.000Z (9 months ago)
Last Synced: 2024-09-14T09:35:02.218Z (9 months ago)
Topics: ai, machine-learning, rag, retrieval-augmented-generation, security, trustworthy-ai
Language: Python
Homepage: https://arxiv.org/abs/2402.07867
Size: 14.8 MB
Stars: 59
Watchers: 2
Forks: 6
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

Awesome-LVLM-Attack - Github
Awesome-LLMSecOps - PoisonedRAG

README

        # PoisonedRAG

Official repo of [USENIX Security 2025](https://www.usenix.org/conference/usenixsecurity25) paper: [PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented Generation of Large Language Models](https://arxiv.org/abs/2402.07867).

The first knowledge poisoning attack against Retrievals-Augmented Generation (RAG) system.

## 🚀 News

🎉 Jun 20, 2024: **PoisonedRAG gets accepted to [USENIX Security 2025](https://www.usenix.org/conference/usenixsecurity25)!**

🔥 Apr 20, 2024: **If you have any question or need other code or data, feel free to open an issue or email us!**

![Illustration of PoisonedRAG](PoisonedRAG.jpg "Illustration of PoisonedRAG")

## 🔍 Quick Usage

### 📃 Setup environment

```bash

conda create -n PoisonedRAG python=3.10

```

```bash

conda activate PoisonedRAG

```

```bash

pip install beir openai google-generativeai

pip install torch==1.13.0+cu117 torchvision==0.14.0+cu117 torchaudio==0.13.0 --extra-index-url https://download.pytorch.org/whl/cu117

pip install --upgrade charset-normalizer

pip3 install "fschat[model_worker,webui]"

```

### 💽 Dataset (optional, suggested)

When running our code, the datasets will be automatically downloaded and saved in `datasets`. You could also run this line to manually download datasets.

```bash

python prepare_dataset.py

```

### 🔑 Set API key

If you want to use PaLM 2, GPT-3.5, GPT-4 or LLaMA-2, please enter your api key in **model_configs** folder.

For LLaMA-2, the api key is your **HuggingFace Access Tokens**. You could visit [LLaMA-2's HuggingFace Page](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf) first if you don't have the access token.

Here is an example:

```json

"api_key_info":{

    "api_keys":[

        "Your api key here"

    ],

    "api_key_use": 0

},

```

### 📝 Reproduce our results

 

There are some hyperparameters in **run.py** such as LLMs and datasets:

**Note:** Currently we provide default setting for main results in our [paper](https://arxiv.org/abs/2402.07867). We will update and complete other settings later.

```python

test_params = {

    # beir_info

    'eval_model_code': "contriever",

    'eval_dataset': "nq",            # nq, hotpotqa, msmarco

    'split': "test",

    'query_results_dir': 'main',

    # LLM setting

    'model_name': 'palm2',           # palm2, gpt3.5, gpt4, llama(7b|13b), vicuna(7b|13b|33b)

    'use_truth': False,

    'top_k': 5,

    'gpu_id': 0,

    # attack

    'attack_method': 'LM_targeted',  # LM_targeted (black-box), hotflip (white-box)

    'adv_per_query': 5,

    'score_function': 'dot',

    'repeat_times': 10,

    'M': 10,

    'seed': 12,

    'note': None

}

```

Execute **run.py** to reproduce experiments.

```bash

python run.py

```

## Acknowledgement

* Our code used the implementation of [corpus-poisoning](https://github.com/princeton-nlp/corpus-poisoning).

* The model part of our code is from [Open-Prompt-Injection](https://github.com/liu00222/Open-Prompt-Injection).

* Our code used [beir](https://github.com/beir-cellar/beir) benchmark.

* Our code used [contriever](https://github.com/facebookresearch/contriever) for retrieval augmented generation (RAG).

## Citation

If you use this code, please cite the following [paper](https://arxiv.org/abs/2402.07867):

```tex

@misc{zou2024poisonedrag,

      title={PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented Generation of Large Language Models}, 

      author={Wei Zou and Runpeng Geng and Binghui Wang and Jinyuan Jia},

      year={2024},

      eprint={2402.07867},

      archivePrefix={arXiv},

      primaryClass={cs.CR}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/sleeepeer/PoisonedRAG

Awesome Lists containing this project

README