-
Notifications
You must be signed in to change notification settings - Fork 88
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can't get 100% accuracy in Sub-Goal evaluation with ground-truth actions and masks. #19
Comments
Hi, thanks for replying. This is the index errors I got.
I've found that an agent takes more actions for a subgoal than the validation set has. + I've validated a model with GT actions and masks in the same way above, and I didn't get 100% SR nor PC in both seen and unseen validation sets. In seen validation set, In unseen validation set, I think in #7 three trajectories are missed in validation, but I include the whole validation set (820 for seen, and 821 for unseen). |
@bhkim94 something seems strange. If there are 19 low-level actions in the expert trajectory, I don't know why the agent takes 23 actions when you are simply replaying the expert actions. Could you share your fork so I can get a better understanding of what's happening? |
This is the fork, https://github.com/bhkim94/alfred There are only changes in eval_task.py and eval_subgoals.py with marked with # in the codes.
but the index error itself is not the main point because the incompletion of a subgoal as stated in a validation trajectory causes the index errors. |
@bhkim94 sorry about the delay. Will take a look at this soon. Meanwhile, you can just ignore these 5 trajectories, while I try to figure out what's happening. Since the SR is 99.8% anyway, this issue shouldn't hinder your progress. |
@bhkim94, it seems like the issue is being caused by some non-deterministic behavior in AI2THOR. For instance, when We will take a look at fixing this, but it needs to be done on the simulator side. Fortunately, it's a rare occurrence, so you still get 99.8% SR. This shouldn't affect your modeling progress for now. Thanks again for pointing this out! |
I'm trying to produce Sub-Goal evaluation results with ground-truth actions and masks.
But I got index-out-of-bound errors and couldn't get 100% PLW for some trajectories in seen and unseen validation sets. (both SR and PLW should be 100% as it's evaluated with ground truths.)
This is the changes I made only in eval_subgoals.py, (line 69 and 128)
If the changes are correct to implement ground-truth actions and masks, is there any idea why I can't get 100% PLW?
And I don't understand why I got index-out-of-bound errors with ground-truth trajectories.
Thanks for replying!
The text was updated successfully, but these errors were encountered: