0 "misalignment" Awesome Lists
awesome_LLM-harmful-fine-tuning-papers
A survey on harmful fine-tuning attack for large language model
alignment attack defense emergent fine-tuning finetuning harmful llms malicious misalignment
232 stars
7 forks
169 projects
Last updated: 02 Feb 2026