This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity - View it on GitHub
Star
27
Rank
624802