Skip to content

LLM papers I'm reading, mostly on inference and model compression

Notifications You must be signed in to change notification settings

evanmiller/LLM-Reading-List

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 

Repository files navigation

Just helping myself keep track of LLM papers that I‘m reading, with an emphasis on inference and model compression.

Transformer Architectures

Foundation Models

Position Encoding

KV Cache

Activation

Pruning

Quantization

Normalization

Sparsity and rank compression

Fine-tuning

Sampling

Scaling

Mixture of Experts

Watermarking

More

About

LLM papers I'm reading, mostly on inference and model compression

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published