Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
ddworken
Fetched on 2026/03/14 07:16
ddworken
/
hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback" -
View it on GitHub
https://arxiv.org/abs/2204.05862
Star
0
Rank
13752213