This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024) - View it on GitHub
Star
25
Rank
802498