This is a code base that implements the core algorithm of the paper Contrastive Activation Steering for Activation Training (CASAL). - View it on GitHub
Star
5
Rank
2475999