https://github.com/dreadnode/research
General research for Dreadnode
https://github.com/dreadnode/research
Last synced: 5 months ago
JSON representation
General research for Dreadnode
- Host: GitHub
- URL: https://github.com/dreadnode/research
- Owner: dreadnode
- License: mit
- Created: 2024-03-10T20:55:59.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-06-17T16:00:19.000Z (over 1 year ago)
- Last Synced: 2025-04-15T01:41:05.213Z (9 months ago)
- Language: Jupyter Notebook
- Size: 1.35 MB
- Stars: 21
- Watchers: 1
- Forks: 1
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-rainmana - dreadnode/research - General research for Dreadnode (Jupyter Notebook)
README
# Dreadnode Research
This is a general repository to hold research, projects, reference code, etc. for research we perform at [dreadnode](https://dreadnode.io).
**[Mistral - Adversarial Suffix](notebooks/Mistral%20-%20Adversarial%20Suffix.ipynb)**
Implementation of ["Universal and Transferable Adversarial Attacks on Aligned Language Models"](https://llm-attacks.org) for Mistral 7B.
**[Mistral - BEAST Beam Attack](notebooks/Mistral%20-%20BEAST%20Beam%20Attack.ipynb)**
Implementation of ["Fast Adversarial Attacks on Language Models In One GPU Minute"](https://arxiv.org/pdf/2402.15570.pdf) for Mistral 7B. At the time of release the authors have not posted the reference code from the paper, so this implementation is likely incorrect.
**[Llama PGD](scripts/pgd.py)**
Implementation of ["Attacking Large Language Models with Projected Gradient Descent"](https://arxiv.org/abs/2402.09154) for Llama model variants with LitGPT. At teh time of release the authors have not posted any reference code, so be careful.
**Needle [Triage](notebooks/Needle%20-%20Triage.ipynb)/[Fix](notebooks/Needle%20-%20Fix.ipynb)**
Research in partnership with [OpenSSF](https://openssf.org) for the [AIxCC Event](https://aicyberchallenge.com/).