https://github.com/dreadnode/research

General research for Dreadnode
https://github.com/dreadnode/research

Last synced: 7 months ago
JSON representation

General research for Dreadnode

Host: GitHub
URL: https://github.com/dreadnode/research
Owner: dreadnode
License: mit
Created: 2024-03-10T20:55:59.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2024-06-17T16:00:19.000Z (over 1 year ago)
Last Synced: 2025-04-15T01:41:05.213Z (10 months ago)
Language: Jupyter Notebook
Size: 1.35 MB
Stars: 21
Watchers: 1
Forks: 1
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-rainmana - dreadnode/research - General research for Dreadnode (Jupyter Notebook)

README

          # Dreadnode Research

This is a general repository to hold research, projects, reference code, etc. for research we perform at [dreadnode](https://dreadnode.io). 

**[Mistral - Adversarial Suffix](notebooks/Mistral%20-%20Adversarial%20Suffix.ipynb)**

Implementation of ["Universal and Transferable Adversarial Attacks on Aligned Language Models"](https://llm-attacks.org) for Mistral 7B.

**[Mistral - BEAST Beam Attack](notebooks/Mistral%20-%20BEAST%20Beam%20Attack.ipynb)**

Implementation of ["Fast Adversarial Attacks on Language Models In One GPU Minute"](https://arxiv.org/pdf/2402.15570.pdf) for Mistral 7B. At the time of release the authors have not posted the reference code from the paper, so this implementation is likely incorrect.

**[Llama PGD](scripts/pgd.py)**

Implementation of ["Attacking Large Language Models with Projected Gradient Descent"](https://arxiv.org/abs/2402.09154) for Llama model variants with LitGPT. At teh time of release the authors have not posted any reference code, so be careful.

**Needle [Triage](notebooks/Needle%20-%20Triage.ipynb)/[Fix](notebooks/Needle%20-%20Fix.ipynb)**

Research in partnership with [OpenSSF](https://openssf.org) for the [AIxCC Event](https://aicyberchallenge.com/).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/dreadnode/research

Awesome Lists containing this project

README