This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation" - View it on GitHub
Star
53
Rank
466056