Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
building-gpt.ipynb		building-gpt.ipynb
gpt-v1.py		gpt-v1.py
gpt-v2.py		gpt-v2.py
gpt-v3.py		gpt-v3.py
gpt-v4.py		gpt-v4.py
gpt-v5.py		gpt-v5.py
gpt-v6.py		gpt-v6.py
gpt-v7.py		gpt-v7.py

README.md

GPT, from scratch!

This is a simple implementation of a GPT-like decoder-only transformer model. The implementation is done step-by-step, starting from a simple bigram language model and ending with a full transformer model. The code is written in PyTorch and is meant to be as simple as possible. Efficency is not a concern.

The code is largely based on Andrej Karpathy's nanoGPT lecture. All credit goes to him.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gpt

gpt

README.md

GPT, from scratch!

Files

gpt

Directory actions

More options

Directory actions

More options

Latest commit

History

gpt

Folders and files

parent directory

README.md

GPT, from scratch!