add qwen2 support for pretraining and finetuning #1573

TobyYang7 · 2024-06-25T07:54:57Z

No description provided.

NicoZenith · 2024-07-31T16:24:35Z

Hi thank you for sharing!
I checked your commit, looks good!
However, the slurm script for fine-tuning with qwen is calling the model_path 1.5 13b. Could you provide the training script with the corresponding qwen model?

How did the fine-tuning work so far with qwen 2?

Many thanks!

TobyYang7 · 2024-07-31T17:30:36Z

Hi,

I’ve updated the script as requested.

Due to computational resource limitations, I only tested the Qwen2-1.5B model. Considering model's parameters, the performance was still quite satisfactory.

Here are the MMMU (validation) results:

Subject	Data Num	Acc
Overall-Art and Design	120	0.35
Art	30	0.3
Art_Theory	30	0.467
Design	30	0.467
Music	30	0.167
Overall-Business	150	0.22
Accounting	30	0.267
Economics	30	0.133
Finance	30	0.2
Manage	30	0.3
Marketing	30	0.2
Overall-Science	150	0.267
Biology	30	0.167
Chemistry	30	0.267
Geography	30	0.233
Math	30	0.333
Physics	30	0.333
Overall-Health and Medicine	150	0.267
Basic_Medical_Science	30	0.233
Clinical_Medicine	30	0.333
Diagnostics_and_Laboratory_Medicine	30	0.167
Pharmacy	30	0.267
Public_Health	30	0.333
Overall-Humanities and Social Science	120	0.458
History	30	0.467
Literature	30	0.7
Sociology	30	0.4
Psychology	30	0.267
Overall-Tech and Engineering	210	0.3
Agriculture	30	0.367
Architecture_and_Engineering	30	0.3
Computer_Science	30	0.1
Electronics	30	0.2
Energy_and_Power	30	0.4
Materials	30	0.333
Mechanical_Engineering	30	0.4
Overall	900	0.303

Many thanks!

NicoZenith · 2024-07-31T20:23:11Z

amazing thanks for your commit!
Btw, have you tried with Lora fine-tuning?

TobyYang7 · 2024-08-01T08:15:51Z

yes, the script is as same as llava-1.5

TobyYang7 · 2024-08-01T08:16:13Z

also, you can continue sft on the existing qwen model

add qwen2 support for pretraining and finetuning

2d7f0e0

update slurm script for qwen2

24c4e35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add qwen2 support for pretraining and finetuning #1573

add qwen2 support for pretraining and finetuning #1573

TobyYang7 commented Jun 25, 2024

NicoZenith commented Jul 31, 2024

TobyYang7 commented Jul 31, 2024 •

edited

Loading

NicoZenith commented Jul 31, 2024

TobyYang7 commented Aug 1, 2024

TobyYang7 commented Aug 1, 2024

add qwen2 support for pretraining and finetuning #1573

Are you sure you want to change the base?

add qwen2 support for pretraining and finetuning #1573

Conversation

TobyYang7 commented Jun 25, 2024

NicoZenith commented Jul 31, 2024

TobyYang7 commented Jul 31, 2024 • edited Loading

NicoZenith commented Jul 31, 2024

TobyYang7 commented Aug 1, 2024

TobyYang7 commented Aug 1, 2024

TobyYang7 commented Jul 31, 2024 •

edited

Loading