Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow arbitrary number of input channels in ResNet encoder (not only RGB) #56

Open
4 tasks
daniel-j-h opened this issue Jun 29, 2018 · 2 comments
Open
4 tasks

Comments

@daniel-j-h
Copy link
Collaborator

With #46 we are changing our model architecture from training the encoder and decoder from scratch to using a pre-trained ResNet for the encoder. The pre-trained ResNet uses three channels (RGB) for the input layer through.

We need to be able to add arbitrary channels, say, RGB + water mask + elevation + lidar. To to this we need to construct a wrapper module architecture extending the ResNet architecture, copying weights over, and initializing the new channels with zero. In addition the channel-wise mean and std dev needs to be adapted.

Tasks

  • Figure out how to extend the ResNet input channels
  • Figure out how to copy over the pre-trained ResNet parts
  • Let users construct a model with arbitrary channels
  • Adapt mean and std dev
@mikoontz
Copy link

mikoontz commented Sep 23, 2018

Is there any concern with initializing channels with 0 instead of a value that implies 'missing'? Or is this just me misunderstanding the process?

Looking forward to seeing this develop, regardless!

@daniel-j-h
Copy link
Collaborator Author

If you check out Kaggle competitions with multi-spectral data or in general more channels most of them either initialize additional channels with zero, randomly initialize, or copy the rgb channels over. I've seen zero initialization working best for most winning solutions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants