This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity - View it on GitHub
Star
38
Rank
515046