Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome_LLM-harmful-fine-tuning-papers
A survey on harmful fine-tuning attack for large language model
https://github.com/git-disl/awesome_LLM-harmful-fine-tuning-papers
Last synced: 6 days ago
JSON representation
-
Content
-
Defenses
- 2024/10/13
- 2024/3/8
- 2024/2/2 - disl/Vaccine)]
- 2024/5/23 - noising)]
- 2024/5/24 - Safety-41C2)] [[Openreview](https://openreview.net/forum?id=NrfP7zZNiG)]
- 2024/8/1 - tamirisa/tamper-resistance)]
- 2024/9/3 - disl/Booster)] [[Openreview](https://openreview.net/forum?id=tTPHgb0EtV)]
- 2024/10/13 - Vaccine)]
- 2023/8/25
- 2023/9/14 - tuned-llamas)]
- 2024/2/3 - zong/VLGuard)]
- 2024/2/7 - attribution-code)]
- 2024/2/22 - Enhanced-Alignment)]
- 2024/2/28
- 2024/5/28 - disl/Lisa)]
- 2024/6/10 - vs-deep-alignment)] [[Openriew]](https://openreview.net/forum?id=6Mxhg9PtDE)
- 2024/6/12
- 2024/8/27
- 2024/10/05
- 2024/10/05
- 2024/10/05
- 2024/10/05
- 2024/10/05
- 2024/5/15
- 2024/5/23
- 2024/5/27
- 2024/8/18
- 2024/10/05
- 2024/10/05
- 2024/8/1 - tamirisa/tamper-resistance)]
- 2024/8/30
-
Attacks and Defenses for Federated Fine-tuning
-
Attacks
- 2024/7/29 - editing/editing-attack)]
- 2023/10/5
- 2024/10/21
- 2024/10/23
- 2023/10/4
- 2023/10/5 - Tuning-Safety/LLMs-Finetuning-Safety)]
- 2023/10/31
- 2023/11/9
- 2024/4/1 - nlp/benign-data-breaks-safety)]
- 2024/5/28
- 2024/6/28
-
Other awesome resources on LLM safety
-
Mechanical Study
- 2024/5/25
- 2024/5/27
- 2024/10/05
- 2024/10/05
- 2024/11/13
- 2024/10/05 - Finetuning-Attacks)]
-
Benchmark
- 2024/9/19 - noising-xpo)]
-
Categories
Sub Categories