Finetune Clip

Installing

Run the following

cd open_clip/
make install

Then install pytorch and torchvision and run.

make install-training

To install the benchmarking stuff run the following in the project root.

make install

Current Project Structure

Benchmarks: Separate scripts and files for testing all open_clip models

Misc-Testing: random visualizations

Scripts: Scripts to finetune CLIP

Benchmark-scripts: Scripts to run benchmarks on different CLIP models

Benchmark-results: Results from benchmarking open_clip models

Open-Clip: src

Running Finetuning Scripts

Very similar to open_clip, but with a separate script for finetuning instead of training. i.e.

export MASTER_PORT=12802

master_addr=$(scontrol show hostnames "$SLURM_JOB_NODELIST" | head -n 1)
export MASTER_ADDR=$master_addr

torchrun --nproc_per_node 1 -m finetune.main \
    --dataset-type "csv" \
    --train-data "/path/to/dataset.csv" \
    --warmup 1000 \
    --batch-size 128 \
    --lr 1e-5 \
    --wd 0.1 \
    --epochs 1 \
    --workers 2 \
    --model "ViT-B-32" \
    --lora "10:1" \
    --report-to "wandb" \
    --log-every-n-steps 100

Additional Scripts can be found in scripts/, I will try to make slurm and sh pairs for every script so that it can easily be run on TACC.

Running Benchmarking Scripts

python benchmark/main.py \
    --dataset "/path/to/dataset" \
    --dataset-ann "/path/to/annotation.csv" \
    --pretrained "path/to/checkpoint" \ # This can also be the name of an online source
    --finetune-path "logs/run-name" \ # Path to Log directory of a finetune run
    --name "Name_of_Benchmark_Run" \
    --k 1 5 \
    --batchsize 256

Running Benchmarking on local logs will infer how to construct the model. Otherwise, just specify model specifics when calling the benchmarking script.

Adding New Finetuning Methods

Go to finetune.configure_finetune to add a new method to finetune clip models. Currently the following methods are supported.

LoRA
Layer Freezing

ColBERT Style Loss Functions

We have several customizations to Colbert style losses.

    --colbert \ # Use a ColBert style loss
    --colbert-dropout 0.1 \ # The probability of randomly dropping tokens before taking the MaxSim Loss
    --colbert-local-contrastive "loss-type" \ # Whether the MaxSim will be taken with respect to "patch-wise", "token-wise", or "all"
    --colbert-global-contrastive "loss-type" \ # Whether the contrastive loss will be taken w.r.t. "image-wise", "text-wise", or "all"

SPARC Style Loss Functions

We also support loss functions for Sparc

    --sparc \ # Use SPARC style loss
    --sparc-global-lambda 1.0 \ # Set the global importance for SPARC Loss
    --sparc-local-lambda 1.0 \ # Set the local importance for SPARC Loss

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
benchmark-results		benchmark-results
benchmark-scripts		benchmark-scripts
benchmark		benchmark
misc_test		misc_test
open_clip		open_clip
scripts		scripts
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
convert_dataset.py		convert_dataset.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Finetune Clip

Installing

Current Project Structure

Running Finetuning Scripts

Running Benchmarking Scripts

Adding New Finetuning Methods

ColBERT Style Loss Functions

SPARC Style Loss Functions

About

Releases

Packages

Contributors 2

Languages

Hprairie/finetune-clip

Folders and files

Latest commit

History

Repository files navigation

Finetune Clip

Installing

Current Project Structure

Running Finetuning Scripts

Running Benchmarking Scripts

Adding New Finetuning Methods

ColBERT Style Loss Functions

SPARC Style Loss Functions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages