ashishpatel26/LLM-RLHF-Tuning - Gitstar Ranking

Gitstar Ranking

Users
Organizations
Repositories
Rankings

Sign in with GitHub

ashishpatel26

Fetched on 2026/07/12 04:44

ashishpatel26 / LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA) - View it on GitHub

Star

1

Rank

6110868

Released by @k0kubun in December 2014. Fork me on GitHub.