Code for reproducing the results from the paper 'Effective Red-Teaming of Policy-Adherent Agents' - View it on GitHub
Star
1
Rank
5731044