0 "misalignment" Awesome Lists
awesome_LLM-harmful-fine-tuning-papers
A survey on harmful fine-tuning attack for large language model
alignment attack defense emergent fine-tuning finetuning harmful llms malicious misalignment
230 stars
7 forks
159 projects
Last updated: 10 Jan 2026