jackaduma/ChatGLM-LoRA-RLHF-PyTorch

jackaduma

Fetched on 2026/06/23 00:55

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM - View it on GitHub

Star

138

Rank

239944

jackaduma

jackaduma / ChatGLM-LoRA-RLHF-PyTorch