Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add code and models #1

Open
impredicative opened this issue Oct 23, 2018 · 14 comments
Open

Add code and models #1

impredicative opened this issue Oct 23, 2018 · 14 comments

Comments

@impredicative
Copy link

impredicative commented Oct 23, 2018

This issue merely serves as a tracker. Please update it when the code and models are added for https://arxiv.org/abs/1810.08425

@zimenglan-sysu-512
Copy link

hi @KimSoybean
do you still have the plan to release the code and model?
thanks.

@hust-kevin
Copy link

@KimSoybean in your paper, batchsize=128 means 128 imgs per gpu or total batch size on 4 gpu?

@KimSoybean
Copy link
Owner

@zimenglan-sysu-512 Hi, we will release the codes and models. But we will release them after the announcement of CVPR2019. I recommend you to reproduce the code on mmdetection, which is much faster than SSD-caffe. (Our method is so easy)

@KimSoybean
Copy link
Owner

@hust-kevin Hi, the accumulate batchsize is 128, in our code for Root-ResNet34, we put 16 images on each GPU (totally 4 GPUs), and the gradient is updated by the average of double iterations. You can see the source code in SSD-caffe.

@hust-kevin
Copy link

@KimSoybean thanks, I will try it on mmdetection, I have only 1 GPU. if I set 16 images per GPU,should I change lr to 0.05/8, what's more, how to updated gradient by the average of double iterations in pytorch?

@KimSoybean
Copy link
Owner

@hust-kevin I think lr=0.03 is better, but it depends on your loss curves. If you have only one GPU, I recommend you to use GN instead of BN and train more epochs (I set totally 44 epochs on ssd of mmdetection ).

@hust-kevin
Copy link

thanks

@dby2017
Copy link

dby2017 commented Mar 18, 2019

Do you still have the plan to release the code and model?
thanks.

@KimSoybean
Copy link
Owner

@dby2017 Recently we have trained the 512x512 models on mmdetection which took a long time. We will release codes and models when we finish it.

@liuchang8am
Copy link

Can the idea generalise to other base networks? E.g., resnet 18 layers. I tried the paper's idea by adding BN after each conv layer, and train from scratch with lr=0.01 and lr=0.05, however I'm getting worse results.

@pyupcgithub
Copy link

@KimSoybean where can i get the pretrained model VGG-16-BN ?

@liuchang8am
Copy link

any updates?

@E-Dreamer-LQ
Copy link

Do you still have the plan to release the code and model?
thanks.

@KimSoybean
Copy link
Owner

@Forgetmyfairy I have just finished my graduation thesis for undergraduates. This weekend I will upload them.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants