https://github.com/Allen-piexl/JailbreakZoo

Last synced: 5 months ago
JSON representation

Host: GitHub
URL: https://github.com/Allen-piexl/JailbreakZoo
Owner: Allen-piexl
License: mit
Created: 2024-03-13T21:11:54.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-07-25T03:30:27.000Z (10 months ago)
Last Synced: 2024-08-12T08:13:10.731Z (9 months ago)
Size: 306 KB
Stars: 57
Watchers: 2
Forks: 3
Open Issues: 0
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE

Awesome Lists containing this project

Awesome-MLLM-Safety - Github - piexl/JailbreakZoo.svg?style=social&label=Star) (Other)

README

# JailbreakZoo: Survey, Landscapes, and Horizons in Jailbreaking Large Language and Vision-Language Models

## Introduction

Welcome to JailbreakZoo, a dedicated repository focused on the jailbreaking of large models (LMs), encompassing both large language models (LLMs) and vision language models (VLMs). This project aims to explore the vulnerabilities, exploit methods, and defense mechanisms associated with these advanced AI models. Our goal is to foster a deeper understanding and awareness of the security aspects surrounding large-scale AI systems.

Our website can be found in [here](https://chonghan-chen.com/llm-jailbreak-zoo-survey/)

Our paper can be found in [here](https://arxiv.org/pdf/2407.01599)

## Timeline

This repository is systematically organized according to the publication timeline.

:fire::fire::fire: The latest update being September 01, 2024 :fire::fire::fire:

## Contents

- [**Jailbreaks of LLMs**](https://github.com/Allen-piexl/JailbreakingZoo/blob/main/Papers/LLM_Jailbreak.md): Discover the techniques and case studies related to the jailbreaking of large language models.

- [**Defenses of LLMs**](https://github.com/Allen-piexl/JailbreakingZoo/blob/main/Papers/LLM_Defense.md): Explore the strategies and methods employed to defend large language models against various types of attacks.

- [**Jailbreaks of VLMs**](https://github.com/Allen-piexl/JailbreakingZoo/blob/main/Papers/VLM_Jailbreak.md): Learn about the vulnerabilities and jailbreaking approaches specific to vision language models.

- [**Defenses of VLMs**](https://github.com/Allen-piexl/JailbreakingZoo/blob/main/Papers/VLM_Defense.md): Understand the defense mechanisms designed for vision language models, including the most recent advancements and strategies.

## Contributing

We welcome contributions from the community! Whether you're interested in adding new research, improving existing documentation, or sharing your own jailbreak or defense strategies, your insights are valuable to us. Please check our [Contribution Guidelines](https://github.com/Allen-piexl/JailbreakingZoo/blob/main/CONTRIBUTING.md) for more information on how you can get involved.

## License and Citation

This project is available under the [MIT License](https://github.com/Allen-piexl/JailbreakingZoo/blob/main/LICENSE). Please refer to our citation guidelines if you wish to reference our work in your research or publications.

Thank you for visiting JailbreakZoo. We hope this repository serves as a valuable resource in your exploration of large model security.

## Acknowledgement

Special thanks to our notable contributors: [**Haibo Jin**](https://haibojin001.github.io/), [**Leyang Hu**](https://github.com/Leon-Leyang), [**Xinuo Li**](https://github.com/monmonli), [**Peiyan Zhang**](https://peiyance.github.io/), [**Chonghan Chen**](https://paulcccccch.github.io/), [**Jun Zhuang**](https://junzhuang.xyz/), and [**Haohan Wang**](https://haohanwang.github.io/).

*The ranking is in partial order.

## Reference

```bibtex
@article{jin2024jailbreakzoo,
title={JailbreakZoo: Survey, Landscapes, and Horizons in Jailbreaking Large Language and Vision-Language Models},
author={Jin, Haibo and Hu, Leyang and Li, Xinuo and Zhang, Peiyan and Chen, Chonghan and Zhuang, Jun and Wang, Haohan},
journal={arXiv preprint arXiv:2407.01599},
year={2024}
}

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/Allen-piexl/JailbreakZoo

Awesome Lists containing this project

README