An open API service indexing awesome lists of open source software.

https://github.com/git-disl/Virus

This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"
https://github.com/git-disl/Virus

attack defense fine-tuning guardrail harmful llms moderation safety

Last synced: 7 months ago
JSON representation

This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"

Awesome Lists containing this project