SMART-LLaVa

SMART Dataset

Download the dataset using:

wget https://zenodo.org/records/7775984/files/SMART101-release-v1.zip
unzip SMART101-release-v1.zip

Zero-Shot Model Baselines

The Model_baselines.ipynb file can be run to evaluate each of the models on the test data. These are the zero-shot generalization benchmarks.

LLaVa model prompt tuning

The LLaVa_baselines.py file prompt tunes a LLaVa/BakLLaVa model on SMART data. The prompt tokens are injected between the image and language instruction tokens.

The code trains and uploads a model to huggingface. The huggingface token must be provided as $HUGGINGFACE_TOKEN before running the code. The following packages need to be installed before running the code.

pip install transformers
pip install peft
pip install bitsandbytes==0.41.3 accelerate==0.25.0
python3 LLaVa_baselines.py

Evaluating puzzle types

The relevant files are LLaVa_types.ipynb and resnet.py

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
LLaVa_baselines.py		LLaVa_baselines.py
LLaVa_types.ipynb		LLaVa_types.ipynb
Model_baselines.ipynb		Model_baselines.ipynb
README.md		README.md
SMART_Eval.ipynb		SMART_Eval.ipynb
puzzle_type_info.csv		puzzle_type_info.csv
resnet.py		resnet.py
test_data_bakllava.json		test_data_bakllava.json
test_data_bakllava_hp2.json		test_data_bakllava_hp2.json
test_data_blip.json		test_data_blip.json
test_data_llava.json		test_data_llava.json
test_data_llava_cot.json		test_data_llava_cot.json
test_data_llava_dpt.json		test_data_llava_dpt.json
test_data_llava_ft.json		test_data_llava_ft.json
test_data_mcqa.json		test_data_mcqa.json
test_data_mpqa.json		test_data_mpqa.json
trainable_prompt.png		trainable_prompt.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SMART-LLaVa

SMART Dataset

Zero-Shot Model Baselines

LLaVa model prompt tuning

Evaluating puzzle types

About

Releases

Packages

Languages

License

akankshya107/SMART-LLaVa

Folders and files

Latest commit

History

Repository files navigation

SMART-LLaVa

SMART Dataset

Zero-Shot Model Baselines

LLaVa model prompt tuning

Evaluating puzzle types

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages