This is a code base that implements the core algorithm of the paper Contrastive Activation Steering for Activation Training (CASAL). - View it on GitHub
Star
4
Rank
2799014