-
Notifications
You must be signed in to change notification settings - Fork 165
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Episode length, ActionHead, Data collect, Next state as action #120
Comments
Non-author here but hopefully I can help.
Should work fine! Have a look at
I have also found that after fine tuning the policy behaves weirdly in response to irrelevant changes in the background of the image. Please share any additional findings if you figure this out! I am not sure how to advise on your action space.
Not quite sure what you mean by "trend in changes", but the TFDS pipeline does normalization of the action dimensions (unless you explicitly turn it off), so that should take care of different magnitudes of different action dimensions.
This is a subtle question I think, because it really depends on how your robot's controller behaves and how your teleoperation works. Can you be more specific about "teleoperation isn't very good? what does that mean? For most robot teleoperation cases, the robot doesn't stop and wait at each action because that would be very jerky. Instead, the robot is continuously trying but never succeeding to accomplish the latest action. In this case, using the next state as the action could result in different motion/path than using the original actions recorded during teleop. Personally I would say you should not use next state, but rather use the action recorded during teleop. But I think more details are before I can say anything confidently. A video showing the teleop at 1x speed (not sped up) would be very useful. Hope this is helpful! |
Some issues while trying to replicate your impressive work.
The text was updated successfully, but these errors were encountered: