Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
win4r
Fetched on 2026/06/23 05:50
win4r
/
RLHF-make-review-positive
RLHF-based text generation optimization using PPO and reward modeling with Hugging Face TRL. -
View it on GitHub
Star
0
Rank
14037453