Implementation of the Llama architecture with RLHF + Q-learning - View it on GitHub
Star
168
Rank
189776