Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Visualization results are confusing. 🤔 #13

Open
LMD0311 opened this issue Jan 4, 2024 · 1 comment
Open

Visualization results are confusing. 🤔 #13

LMD0311 opened this issue Jan 4, 2024 · 1 comment

Comments

@LMD0311
Copy link

LMD0311 commented Jan 4, 2024

Thank you for your inspiring work. I try to reproduce the results, the avg val IoU is 26.15, and the avg val mIoU is 16.72, which are similar with the results from paper.

However, my visualization results make me confused. During the training, the Transformer get 1st-15th frames as inputs and predict 2nd-16th frames. Here are some visualization results.

  • 2nd GT:
    image
  • 3rd GT:
    image
  • 16th GT:
image
  • 2nd Predict:
image
  • 3rd Predict:
image
  • 16th Predict:
image

The results are confusing. Even the reconstruction of 2nd and 3rd frames are not satisfying, and I cannot find any connection between them. @wzzheng Could authors provide any help?

@LMD0311 LMD0311 changed the title Visualizations Visualization results are confusing. 🤔 Jan 7, 2024
@LMD0311
Copy link
Author

LMD0311 commented Jan 7, 2024

Thank you for your inspiring work. I try to reproduce the results, the avg val IoU is 26.15, and the avg val mIoU is 16.72, which are similar with the results from paper.

However, my visualization results make me confused. During the training, the Transformer get 1st-15th frames as inputs and predict 2nd-16th frames. Here are some visualization results.

  • 2nd GT:
    image
  • 3rd GT:
    image
  • 16th GT:
image * 2nd Predict: image * 3rd Predict: image * 16th Predict: image The results are confusing. Even the reconstruction of 2nd and 3rd frames are not satisfying, and I cannot find any connection between them. @wzzheng Could authors provide any help?

Visualization code comes from https://github.com/wzzheng/TPVFormer/blob/main/visualization/vis_frame.py.

Since the visualized GT is quite reasonable, I guess my visualization code works fine.

BTW, the predict Occ I chose comes from pred of

pred = z_q_predict.argmax(dim=-1).detach().cuda()

the GT Occ I chose comes from output_dict['target_occs'] of
output_dict['target_occs'] = x[:, self.offset:]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant