mytechnotalent/gpt_from_scratch

mytechnotalent

Fetched on 2026/01/31 17:37

This notebook builds a complete GPT (Generative Pre-trained Transformer) model from scratch using PyTorch. It covers tokenization, self-attention, multi-head attention, transformer blocks, and text generation and all explained step-by-step with a simple nursery rhyme corpus. - View it on GitHub

Star

Rank

5730283

mytechnotalent

mytechnotalent / gpt_from_scratch