jackaduma/Vicuna-LoRA-RLHF-PyTorch

jackaduma

Fetched on 2026/06/23 00:55

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna - View it on GitHub

Star

220

Rank

164213

jackaduma

jackaduma / Vicuna-LoRA-RLHF-PyTorch