"malicious" Awesome Lists
awesome_LLM-harmful-fine-tuning-papers
A survey on harmful fine-tuning attack for large language model
alignment attack defense emergent fine-tuning finetuning harmful llms malicious misalignment
208 stars
6 forks
125 projects
Last updated: 22 Sep 2025