Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
lucidrains
Fetched on 2025/10/31 04:15
lucidrains
/
llama-qrlhf
Implementation of the Llama architecture with RLHF + Q-learning -
View it on GitHub
Star
167
Rank
187271