This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity - View it on GitHub
Star
40
Rank
536086