Implementation of a small GPT-style transformer from scratch in PyTorch. Learn how Large Language Models work by building, training, generating text, and visualizing attention. - View it on GitHub
Star
9
Rank
1711695