Implementation of a small GPT-style transformer from scratch in PyTorch. Learn how Large Language Models work by building, training, generating text, and visualizing attention. - View it on GitHub
Star
4
Rank
2766351