Skip to content

Files

Latest commit

6d93651 · May 19, 2023

History

History
12 lines (10 loc) · 298 Bytes

README.md

File metadata and controls

12 lines (10 loc) · 298 Bytes

Caveman GPT

A very basic implementation of the GPT model.

References

@article{radford2019language, title={Language Models are Unsupervised Multitask Learners}, author={Radford, Alec and Wu, Jeff and Child, Rewon and Luan, David and Amodei, Dario and Sutskever, Ilya}, year={2019} }