Instructions to Run Our Code

Part 1

With quantization: run python run_part1_with_quant.py

Without quantization: run python run_part1_no_quant.py

Standard vs Speculative decoding: run python run_part1_decoding.py

Part 2

Protocol A:

run python run_part2_a.py --prune_method='individual' for individual weights pruning and python run_part2_a.py --prune_method='l2norm' for structured pruning

Protocol B: run python run_part2_b.py --prune_method='individual' for individual weights pruning and python run_part2_b.py --prune_method='l2norm' for structured pruning

Leaderboard

run python run_experiments.py which generates results.json

Name		Name	Last commit message	Last commit date
Latest commit History 223 Commits
assets		assets
config		config
data		data
optimization		optimization
.gitattributes		.gitattributes
.gitignore		.gitignore
CS229S_Final_Report.pdf		CS229S_Final_Report.pdf
LICENSE		LICENSE
README.md		README.md
README_old.md		README_old.md
bench.py		bench.py
configurator.py		configurator.py
inference.py		inference.py
model.py		model.py
results.json		results.json
run_experiments.py		run_experiments.py
run_part1_decoding.py		run_part1_decoding.py
run_part1_no_quant.py		run_part1_no_quant.py
run_part1_with_quant.py		run_part1_with_quant.py
run_part2_a.py		run_part2_a.py
run_part2_b.py		run_part2_b.py
sample.py		sample.py
scaling_laws.ipynb		scaling_laws.ipynb
train.py		train.py
transformer_sizing.ipynb		transformer_sizing.ipynb
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Instructions to Run Our Code

Part 1

Part 2

Leaderboard

About

Releases

Packages

Languages

License

blahBlahhhJ/cs229s-nanoGPT

Folders and files

Latest commit

History

Repository files navigation

Instructions to Run Our Code

Part 1

Part 2

Leaderboard

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages