Ask for advice for att2in2 under scst training #37

miracle24 · 2018-04-12T03:20:34Z

Hi. I trained att2in2 model with the default settings, but I got a lower score than https://github.com/ruotianluo/ImageCaptioning.pytorch/issues/10。
Here is my result:
Bleu_1: 0.796 Bleu_2: 0.622 Bleu_3: 0.471 Bleu_4: 0.351 ROUGE_L: 0.561 CIDEr: 1.118
and result in https://github.com/ruotianluo/ImageCaptioning.pytorch/issues/10:
Bleu_1: 0.777 Bleu_2: 0.613 Bleu_3: 0.465 Bleu_4: 0.347 ROUGE_L: 0.560 CIDEr: 1.156

And this how I trained the model:
(1) pretrained att2in2 for 25 epochs with the same settings (the same spatial feature of image, the same batch size, schedule sampling strategy from 0, the same learning rate decay, and so on), and I obtained comparable results with yours.
(2) then I trained it with scst for another 35 epochs. Learning rate was fixed to 5e-5. The cache for computing CIDEr is coco-train-idxs.

Compared with your result, the CIDEr is worse, but others metrics are better. The result bothers me a little bit, which makes me doubt about my experiment settings.
Is there anything trivial details I missed?
I wonder the schedule-sampling used in pretrained model will affect the exploration of the RL, but I have not had it a try. Any advice will be appreciated. Thanks a lot.

ruotianluo · 2018-04-12T03:51:11Z

I did find if you start scst at different times, the performance will be different in such way. (Higher on other metrics and lower on cider)

I actually forgot what exact setting I use. Try to start scst at 30 epochs or 35 epochs?

miracle24 · 2018-04-12T04:09:18Z

Ok. Thanks a lot. I will keep trying. It really takes too much time to train the model with RL. Sigh.

miracle24 changed the title ~~The performance of att2in2 with scst bothers me.~~ Ask for advice for att2in2 under scst training Apr 12, 2018

xuyan1115 mentioned this issue Aug 17, 2018

About multi-GPU training #53

Closed

ruotianluo closed this as completed Dec 31, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ask for advice for att2in2 under scst training #37

Ask for advice for att2in2 under scst training #37

miracle24 commented Apr 12, 2018

ruotianluo commented Apr 12, 2018

miracle24 commented Apr 12, 2018

Ask for advice for att2in2 under scst training #37

Ask for advice for att2in2 under scst training #37

Comments

miracle24 commented Apr 12, 2018

ruotianluo commented Apr 12, 2018

miracle24 commented Apr 12, 2018