what is the learning rate when start self-critical training? #26

siyilingting · 2017-12-19T13:53:46Z

Hi, ruotian. Thank you for your job. I am a bit confused about the learning rate (5e-5) for self-critical training.
First, I train without self-critical, set --learning_rate 5e-4 --learning_rate_decay_start 0 --scheduled_sampling_start 0 --max_epochs 30 as you write. Then after 30 epochs I train using self-critical with the setting --learning_rate 5e-5 --start_from log_fc_rl, no learning_rate_decay_start and scheduled_sampling_start.
In the train.py, I find that

self-critical.pytorch/train.py

Line 82 in 275e22c

    
           if vars(opt).get('start_from', None) is not None and os.path.isfile(os.path.join(opt.start_from,"optimizer.pth")):

if vars(opt).get('start_from', None) is not None and os.path.isfile(os.path.join(opt.start_from,"optimizer.pth")): optimizer.load_state_dict(torch.load(os.path.join(opt.start_from, 'optimizer.pth')))

.
So does this mean that the learning_rate 5e-5 are discarded, and it's unnecessary for us to set this parameter when we train using self-critical?

The text was updated successfully, but these errors were encountered:

ruotianluo · 2017-12-19T14:54:57Z

The learning rate will be updated according to the argument.

siyilingting · 2017-12-19T15:08:42Z

Thank you. But I am still puzzled which learning rate are used in self-critical, the 5e-5 or the one stored in optimizer.pth.

ruotianluo · 2017-12-19T16:15:17Z

I take it back. After reviewing my code, you are correct. This is definitely a bug. The learning rate will be what is saved in the optimizer.

ruotianluo · 2017-12-19T16:18:28Z

I fix it in the latest commit. Thank you for pointing this out.

…tical_bottom_up * commit '510b0e02d1fcf43a5281bebb147ca1bce5db45f1': Fix a typo in FCModel _sample Remove att2in dependency and fix typo Fix #26. fix bug when lang_stats not set to 1 Only initialize cider_score at the first time.

The code was using the learning_rate from optimizer.pth after starting self critical training.

…tical_bottom_up * commit '510b0e02d1fcf43a5281bebb147ca1bce5db45f1': Fix a typo in FCModel _sample Remove att2in dependency and fix typo Fix ruotianluo#26. fix bug when lang_stats not set to 1 Only initialize cider_score at the first time.

ruotianluo closed this as completed Dec 19, 2017

ruotianluo reopened this Dec 19, 2017

ruotianluo closed this as completed in 3601e9c Dec 19, 2017

fearless77 mentioned this issue Jul 31, 2018

Got an error during training #51

Closed

linzhlalala pushed a commit to linzhlalala/self-critical.pytorch that referenced this issue Feb 23, 2021

Fix ruotianluo#26.

486dca0

The code was using the learning_rate from optimizer.pth after starting self critical training.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

what is the learning rate when start self-critical training? #26

what is the learning rate when start self-critical training? #26

siyilingting commented Dec 19, 2017 •

edited

Loading

ruotianluo commented Dec 19, 2017

siyilingting commented Dec 19, 2017

ruotianluo commented Dec 19, 2017

ruotianluo commented Dec 19, 2017

what is the learning rate when start self-critical training? #26

what is the learning rate when start self-critical training? #26

Comments

siyilingting commented Dec 19, 2017 • edited Loading

ruotianluo commented Dec 19, 2017

siyilingting commented Dec 19, 2017

ruotianluo commented Dec 19, 2017

ruotianluo commented Dec 19, 2017

siyilingting commented Dec 19, 2017 •

edited

Loading