DeepNet in Pytorch #23

Goutam-Kelam · 2018-09-07T05:43:54Z

Hi, I am trying to implement the DeepNet architecture in pytorch. The code seems to work fine but the result are not as expected. I have done as per the protext files which are provided in the issue 3 and 9. You can find my implementation in https://github.com/Goutam-Kelam/Visual-Saliency/tree/master/Deep_Net. It would be helpful if you can tell me where my mistake lies.
Thankyou in advance

kevinmcguinness · 2018-09-11T14:42:32Z

Hi @Goutam-Kelam. Are you initializing the first three layers from VGG-M? You might also need to adjust the learning rates or train for longer -- losses tend to be implemented differently in different frameworks.

Goutam-Kelam · 2018-09-11T16:05:14Z

In one of my trials i had initialized the weights with VGG-M as mentioned and happened to train the network for about 15 epochs. The avg loss didn't change much and it was approximately 4. I had reduced the learning rate by half every 2000 iterations as mentioned in your protext file. When the results were not as expected I manually initialized the weights of the layers as it was displayed in the protext file. I am not sure where I am messing up. Could be it the prediction stage itself but there also I did the exact post processing as described by you in one of the earlier issues.

Goutam-Kelam · 2018-09-11T16:11:14Z

kevinmcguinness · 2018-09-11T16:13:47Z

You need to transfer weights from VGG M to get good results if I recall correctly. You might need to play around with the initial learning rate too. Try setting it close to as high as you can initially without causing divergence, then only reduce when train loss flattens off for a few epochs.

Goutam-Kelam · 2018-09-11T16:19:47Z

I had kept the LR at 1.3e-7. So are you suggesting i should keep it to somewhere near E-5 and train it for few epochs say 5 and then reduce it every 2000 iterations.

prashnani · 2019-10-29T11:23:09Z

Hi @Goutam-Kelam : is this pytorch implementation stable now? Would like to try it.
Also, @kevinmcguinness : would you please provide the training prototxt that shows the loss function and the prototxt with hyperparameters (learning rate) as starting points? I am interested in reproducing the training results as well.

Cony-Atalas · 2020-11-19T14:23:56Z

Hi @Goutam-Kelam : is this pytorch implementation stable now? Would like to try it.
Also, @kevinmcguinness : would you please provide the training prototxt that shows the loss function and the prototxt with hyperparameters (learning rate) as starting points? I am interested in reproducing the training results as well.

Hi , i would like to try this pytorch implementation . Is it stable now? Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DeepNet in Pytorch #23

DeepNet in Pytorch #23

Goutam-Kelam commented Sep 7, 2018

kevinmcguinness commented Sep 11, 2018

Goutam-Kelam commented Sep 11, 2018

Goutam-Kelam commented Sep 11, 2018

kevinmcguinness commented Sep 11, 2018

Goutam-Kelam commented Sep 11, 2018

prashnani commented Oct 29, 2019

Cony-Atalas commented Nov 19, 2020

DeepNet in Pytorch #23

DeepNet in Pytorch #23

Comments

Goutam-Kelam commented Sep 7, 2018

kevinmcguinness commented Sep 11, 2018

Goutam-Kelam commented Sep 11, 2018

Goutam-Kelam commented Sep 11, 2018

kevinmcguinness commented Sep 11, 2018

Goutam-Kelam commented Sep 11, 2018

prashnani commented Oct 29, 2019

Cony-Atalas commented Nov 19, 2020