GPT-2 Model Implementation from Scratch

This project demonstrates how to implement the GPT-2 model (124M) from scratch using PyTorch. The script loads pre-trained weights released by OpenAI, configures the model, and generates text based on a provided prompt.

Requirements

Python 3.x
PyTorch (CUDA/MPS support recommended)
gpt_download3
LLMArchitecture (custom modules)
DataPreprocessing (custom modules)

Install Dependencies

Ensure all required dependencies are installed in your environment:

pip install -r requirements.txt

Run

The model with the GPT-2 weights can be run by the command:

python run.py

The input can be modified in the run.py to see different outputs.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
BuildingGPT2		BuildingGPT2
README.md		README.md
gpt_download3.py		gpt_download3.py
requirements.txt		requirements.txt
the-verdict.txt		the-verdict.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPT-2 Model Implementation from Scratch

Requirements

Install Dependencies

Run

About

Releases

Packages

Languages

Priya-753/GPT2

Folders and files

Latest commit

History

Repository files navigation

GPT-2 Model Implementation from Scratch

Requirements

Install Dependencies

Run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages