An open API service indexing awesome lists of open source software.

https://github.com/dreadnode/research

General research for Dreadnode
https://github.com/dreadnode/research

Last synced: 5 months ago
JSON representation

General research for Dreadnode

Awesome Lists containing this project

README

          

# Dreadnode Research

This is a general repository to hold research, projects, reference code, etc. for research we perform at [dreadnode](https://dreadnode.io).

**[Mistral - Adversarial Suffix](notebooks/Mistral%20-%20Adversarial%20Suffix.ipynb)**

Implementation of ["Universal and Transferable Adversarial Attacks on Aligned Language Models"](https://llm-attacks.org) for Mistral 7B.

**[Mistral - BEAST Beam Attack](notebooks/Mistral%20-%20BEAST%20Beam%20Attack.ipynb)**

Implementation of ["Fast Adversarial Attacks on Language Models In One GPU Minute"](https://arxiv.org/pdf/2402.15570.pdf) for Mistral 7B. At the time of release the authors have not posted the reference code from the paper, so this implementation is likely incorrect.

**[Llama PGD](scripts/pgd.py)**

Implementation of ["Attacking Large Language Models with Projected Gradient Descent"](https://arxiv.org/abs/2402.09154) for Llama model variants with LitGPT. At teh time of release the authors have not posted any reference code, so be careful.

**Needle [Triage](notebooks/Needle%20-%20Triage.ipynb)/[Fix](notebooks/Needle%20-%20Fix.ipynb)**

Research in partnership with [OpenSSF](https://openssf.org) for the [AIxCC Event](https://aicyberchallenge.com/).