Some questions about training #9

zhLawliet · 2022-03-29T04:59:28Z

@ziniuwan
1: you mentioned that “Use the last checkpoint of stage 1 to initialize the model and starts training stage 2.”
but there is "# We empirically choose not to load the pretrained decoder weights from stage1 as it yields better performance." in code, so will not use the pretrained weights of decode form stage1. howerver the MODEL.ENCODER.BACKBONE is "cnn" in stage1, wihch is "ste" in stage2, so we will also not use the pretrained weights of encode form stage1. so neither encode nor decode is used，why do we still need to do pre-training for stage1?

2: what does this operation mean?, i think it is similar to "x = x + self.pos_embed"
x = x.reshape(-1, seqlen, N, C) + self.temp_embed[:,:seqlen,:,:] in vision_transformer.py

zhLawliet · 2022-04-01T03:17:26Z

@ziniuwan where can we find the supplmentary Material of the paper?

zhLawliet · 2022-04-24T02:26:27Z

@ziniuwan i find the ROT_JITTER is 0, Is this profile the one you trained on? and the sample_freq is 8, why not it is 1?

zhLawliet closed this as completed Mar 29, 2022

zhLawliet reopened this Mar 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some questions about training #9

Some questions about training #9

zhLawliet commented Mar 29, 2022 •

edited

Loading

zhLawliet commented Apr 1, 2022

zhLawliet commented Apr 24, 2022 •

edited

Loading

Some questions about training #9

Some questions about training #9

Comments

zhLawliet commented Mar 29, 2022 • edited Loading

zhLawliet commented Apr 1, 2022

zhLawliet commented Apr 24, 2022 • edited Loading

zhLawliet commented Mar 29, 2022 •

edited

Loading

zhLawliet commented Apr 24, 2022 •

edited

Loading