-
Notifications
You must be signed in to change notification settings - Fork 277
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why newfc extend AttModel? #128
Comments
replace l770 in AttModel.py with return fc_feats, att_feats, att_feats, att_masks. I fixed it locally. |
Thanks for your reply. It works. |
Basically treat fc model as a special case of attmodel where there is no attention. So that the fc model can reuse all the functions of attmodel. Basically to reduce redundant code. |
@ruotianluo You mean the newfc is like show tell and attmodel like show attend tell? I want to train show tell and show attend tell model as experiment result compare. Any suggestions? |
FC is what's in self. Critical sequence training paper. It's not exactly show and tell but quite similar. Same for att2in which can be viewed as a variant for show attend and tell |
Got it. That‘s different and will be treat as different compare methods. So, how to run train the original show tell and show attend tell ? (I think it ’s important as a baseline to compare.) |
If you really want to compare, you can try running it and fix all the errors. |
Thank you for your repo, Here is my question
when I eval newfc, there is some error
and I found that newfc extends AttModel, but in initialization, we do not read att feats for newfc as in misc.utils.py
So could you please tell me why newfc extend AttModel ?
The text was updated successfully, but these errors were encountered: