Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
IBM
Fetched on 2025/01/20 09:02
IBM
/
Gradient-Cuff
Repo for NeurIPS 2024 paper "Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes" -
View it on GitHub
Star
1
Rank
5327228