Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inquiry Regarding Discrepancies in Results from Your Published LA3P Code #3

Open
Franklin-19 opened this issue Oct 8, 2024 · 0 comments

Comments

@Franklin-19
Copy link

Hello, thank you for sharing your work and the code for LA3P!

Our team is working on improving the TD3 reinforcement learning algorithm and using your implementation as a baseline for comparison. However, we've encountered some issues replicating the results presented in your paper. Specifically, under both V2 and V4 conditions, our results are significantly lower than expected.

For instance, in the HalfCheetah environment, the agent's maximum cumulative return is only around 3,000, and in the Ant environment, the performance stays around 2,000, sometimes even falling behind TD3. We're puzzled by this and wondering if there were any minor code modifications not reflected in the current repository that could account for the higher performance in the published results.

Any insights or clarifications would be greatly appreciated, as we plan to cite your approach in our upcoming paper. Thank you in advance for your help!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant