stocksentllm

Fine tuning an llm to predict stock sentiment based on headlines. This project attempts to train distilbert-base-uncased on this dataset.

Why distilbert-base-uncased?

Well, all my gpus are busy on some cuda experiments so I don't have VRAM to spare and am forced to train on my M1 Macbook. We can probably get better results on a different model.

Roadmap

Add checks for overfitting
Refactor, isolate and clean up training and inference code
Preferably find a better model
Switch from training a full model to a LoRA or QLoRA
More training data
Add way to pull in headlines from GNews (dreading using bs4)

Not super well versed in the llm game (yet), so if anyone wants to help or has ideas feel free.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

stocksentllm

Why distilbert-base-uncased?

Roadmap

About

Releases

Packages

Languages

License

gregyjames/stocksentllm

Folders and files

Latest commit

History

Repository files navigation

stocksentllm

Why distilbert-base-uncased?

Roadmap

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages