Recommended approach for custom validation #1115

feralvam · 2019-09-06T10:55:27Z

Hi,

Currently, the "best" model at validation time is chosen according to the value of the "loss". However, I would like for it to be chosen using another metric (a custom one that could behave like BLEU, for example). What would be the best way to implement this?

I noticed there is the best-checkpoint-metric parameter for fairseq-train. But I am unsure about where this new function should be implemented so that it can be used by the trainer. The only example I could find is for fine-tuning RoBERTa on GLUE using accuracy. But then accuracy is hardcoded in train.py (as far as I could notice). In addition, save_checkpoint in checkpoint_utils.py has "val_loss" hardcoded, too. Does this mean that I would need to change the code of core modules to incorporate the new validation metric?

Thanks for any guidance you could provide.

The text was updated successfully, but these errors were encountered:

huihuifan · 2019-09-15T16:07:30Z

The easiest way I suppose, that modifies 0 fairseq code, is to run a separate script over each of the validation checkpoints to calculate your SARI/FKGL (?), then select the checkpoint that way. The second easiest way I see that involves small modifications, is to insert this calculation during validation time, so when the model prints validation metrics, it's calculated as well, so you can grep the log files for your desired checkpoint. Otherwise, yes, you would need to modify the fairseq code to change to your custom metric if you want to override it.

feralvam · 2019-09-16T08:25:50Z

Thanks for the suggestions @huihuifan! The solution of using a separate script is what I've been using. It works but I was curious if there was a more 'elegant' way of implementing it in the code. I guess directly hacking the code is the only way to go.

huihuifan · 2019-09-16T08:27:14Z

I think you'll need to make the slightly larger changes to be more elegant, the hardcoding of validation loss, etc metrics will need to be changed.

zf761 · 2020-06-21T03:31:39Z

Hello，@feralvam. I have the same problem as yours. I want to know 'The solution of using a separate script is what I've been using ', when i use the matric SARI instead of BLEU ?
Thank you, smiling.

feralvam changed the title ~~Recommended Approach for Custom Validation Evaluation~~ Recommended approach for custom validation Sep 6, 2019

huihuifan closed this as completed Sep 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recommended approach for custom validation #1115

Recommended approach for custom validation #1115

feralvam commented Sep 6, 2019

huihuifan commented Sep 15, 2019

feralvam commented Sep 16, 2019

huihuifan commented Sep 16, 2019

zf761 commented Jun 21, 2020

Recommended approach for custom validation #1115

Recommended approach for custom validation #1115

Comments

feralvam commented Sep 6, 2019

huihuifan commented Sep 15, 2019

feralvam commented Sep 16, 2019

huihuifan commented Sep 16, 2019

zf761 commented Jun 21, 2020