Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A question about finetuning on RSICD #12

Open
mcx-mcx opened this issue Jan 3, 2024 · 5 comments
Open

A question about finetuning on RSICD #12

mcx-mcx opened this issue Jan 3, 2024 · 5 comments

Comments

@mcx-mcx
Copy link

mcx-mcx commented Jan 3, 2024

Thank you very much for your outstanding work!
I have a question that I haven't quite understood. When fine-tuning your RS5M model on RSICD or RSITMD using the methods outlined in the paper (infoNCE, lr=1e-6), I did not achieve the expected performance. Taking RSICD as an example, the paper and the weights you provided for RS5M RET-2 version result in an accuracy around 38, but when I fine-tuned using my own RS5M VitB32 version, the result was around 34. Could you provide more details on fine-tuning RET-2 or RSICD so that I can better replicate the process? Thank you very much.

@zilunzhang
Copy link
Collaborator

Thank you very much for your outstanding work! I have a question that I haven't quite understood. When fine-tuning your RS5M model on RSICD or RSITMD using the methods outlined in the paper (infoNCE, lr=1e-6), I did not achieve the expected performance. Taking RSICD as an example, the paper and the weights you provided for RS5M RET-2 version result in an accuracy around 38, but when I fine-tuned using my own RS5M VitB32 version, the result was around 34. Could you provide more details on fine-tuning RET-2 or RSICD so that I can better replicate the process? Thank you very much.

Hi

We used the ITRA codebase to tune the model.

The RET-2 data was made by: ChenDelong1999/RemoteCLIP#13 (comment)

Hyperparams for RET-2 model:

--lr: 5e-06
--weight_decay: 0.5
--batch_size: 600
--epochs: 7
--warmup 100
--max_grad_norm: 50.0

Best,

Zilun

@mcx-mcx
Copy link
Author

mcx-mcx commented Jan 3, 2024

Thank you very much!

@mcx-mcx mcx-mcx closed this as completed Jan 3, 2024
@lemyx
Copy link

lemyx commented Feb 20, 2024

Thanks for your great work!

I wanna reproduce the following results, could you share the hyperparameters related to lr, weight_decay, bs, epochs, warmup and max_grad_norm?

image image

@zilunzhang
Copy link
Collaborator

Thanks for your great work!

I wanna reproduce the following results, could you share the hyperparameters related to lr, weight_decay, bs, epochs, warmup and max_grad_norm?

image image

Sure. Could you leave us an email address?

@r-deo
Copy link

r-deo commented Nov 28, 2024

@zilunzhang I was trying the reproduce FT on RS5M. But, I was not able to reproduce. Would it be possible to share the hyper parameter and training related modifications?

Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants