nanoRLHF: from-scratch journey into how LLMs and RLHF really work. - View it on GitHub
Star
180
Rank
192780