An open API service indexing awesome lists of open source software.

https://github.com/jailbreakme-xyz/jailbreak

jailbreakme.xyz is an open-source decentralized app (dApp) where users are challenged to try and jailbreak pre-existing LLMs in order to find weaknesses and be rewarded. 🏆
https://github.com/jailbreakme-xyz/jailbreak

ai bugbounty cryptocurrency cybersecurity prompt-engineering prompt-injection solana solana-program

Last synced: 7 months ago
JSON representation

jailbreakme.xyz is an open-source decentralized app (dApp) where users are challenged to try and jailbreak pre-existing LLMs in order to find weaknesses and be rewarded. 🏆

Awesome Lists containing this project

README

          

# What is JailbreakMe? 🚀

[jailbreakme.xyz](https://www.jailbreakme.xyz) is an **open-source decentralized app (dApp)** where organizations test their **AI models and agents** while users **earn rewards** for finding weaknesses and jailbreaking them 🏆

![image1.jpg](https://cdn.dorahacks.io/static/files/194c2c19da77355c027b6994e4286740.jpg)

# What is an AI Prompt Injection? 💉

**Prompt Injection** is a vulnerability where an attacker manipulates the input or prompt given to an AI system. This can occur:

- By directly controlling the input.
- By using data from other external sources.

# Our Vision

We aim to create a decentralized platform where companies can:

- Test their AI models and agents in a distributed environment.
- Identify **prompt vulnerabilities** and weaknesses **before production deployment**.

# 🏁 How It Works

## 1. Participate:

### 1.1 Choose an agent:

![Screenshot 2025-02-01 at 20.28.00.png](https://cdn.dorahacks.io/static/files/194c2c5594fd945a276d4174a06bd662.png)

### 1.2 Break the LLM Restrictions 🤖

![Screenshot 2025-02-01 at 20.31.36.png](https://cdn.dorahacks.io/static/files/194c2c95384738d8595c6f944e8a8b88.png)

### 1.3 Win the Prize Pool 🏆

![Screenshot 2025-02-01 at 20.32.00.png](https://cdn.dorahacks.io/static/files/194c2c987531c3a7c064c7544a5aca12.png)

# How is the Winner Picked? 🤔

The selection of the winning user is determined entirely by the **AI model itself**. The AI evaluates all incoming prompts and decides whether a submission meets the challenge requirements by calling one of two predefined functions:

1. `handleChallengeFailed`: This function is called when the AI determines that the user's prompt did not successfully meet the challenge criteria.
2. `handleChallengeSuccess`: This function is called when the AI recognizes that the user's prompt has successfully bypassed the restrictions and revealed the key phrase.

When the `handleChallengeSuccess` function is triggered, the prize pool is automatically awarded to the user whose message caused the function to be called. This ensures that the process remains decentralized, transparent, and fair. 🎉

## 2. Launch an agent:

### 2.1 Choose how would you like to create your agent

![Screenshot 2025-02-01 at 20.36.38.png](https://cdn.dorahacks.io/static/files/194c2cdfc0a6514c80fba3c4d1b800f5.png)

### 2.2 Prompt Launch

Describe your agent's personality and behavior. Our AI will generate a complete agent configuration based on your description.

![Screenshot 2025-02-01 at 20.38.21.png](https://cdn.dorahacks.io/static/files/194c2ceb3aa1e48e8de932146b6b2d6c.png)

### 2.3 Quick Creation

Create a simple "Secret Phrase" challenge with default options.

![Screenshot 2025-02-01 at 20.38.42.png](https://cdn.dorahacks.io/static/files/194c2d0afb312097be5bef543cf8898f.png)

### 2.4 Advanced Creation

Multiple configurations + function calls:

[![Advanced Creation Tutorial](https://img.youtube.com/vi/D-YAFGzqExk/0.jpg)](https://www.youtube.com/watch?v=D-YAFGzqExk)

### 2.5 API Integration

Submit the form and we will create a custom integration with your API.

# 📜 Settings & Rules

Each tournament has unique rules, including:

- **Custom Prize Pools**
- **Message Pricing**
- **Expiry Settings**

# 🔗 Useful Links

- **X**: [https://x.com/jailbreakme_xyz](https://x.com/jailbreakme_xyz)
- **Telegram Community**: [https://t.me/jailbreakme_xyz](https://t.me/jailbreakme_xyz)
- **Documentation**: [https://jailbreakme.xyz/docs](https://jailbreakme.xyz/docs)
- **Github Repo**: [https://github.com/jailbreakme-xyz/jailbreak](https://github.com/jailbreakme-xyz/jailbreak)
- **Smart Contract**: [https://solscan.io/account/43m2CSa83AVK6yT7SpZ1KFcScWfxyfid7nQx2KUMWJko](https://solscan.io/account/43m2CSa83AVK6yT7SpZ1KFcScWfxyfid7nQx2KUMWJko)

# Feedback & Support

Feel free to reach out at **dev@jailbreakme.xyz** for feedback or support.

Jailbreak the World 🦍