-
Notifications
You must be signed in to change notification settings - Fork 313
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Algorithm] Td3 #684
[Algorithm] Td3 #684
Conversation
Hi @BY571! Thank you for your pull request and welcome to our community. Action RequiredIn order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you. ProcessIn order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA. Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with If you have received this in error or have any questions, please contact us at [email protected]. Thanks! |
Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks! |
For the formatting, refer to instructions :) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think you have committed a log file :)
Let me know if there's anything i can do to help -- I can do a review even if the code is still a draft if you'd like
Sure, you can review it already! Just trying to fix the last formatting issue. |
Codecov Report
@@ Coverage Diff @@
## main #684 +/- ##
==========================================
+ Coverage 85.15% 88.83% +3.68%
==========================================
Files 123 124 +1
Lines 21167 21364 +197
==========================================
+ Hits 18024 18978 +954
+ Misses 3143 2386 -757
Flags with carried forward coverage won't be shown. Click here to find out more.
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM thanks for this!
Description
Adding Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm.
Creating:
Performance looks good at the beginning of the training but becomes very unstable at the end. Did try some other parameter settings but could not resolve the instabilities until now.
Motivation and Context
This PR closes an open issue:
close #18
.Types of changes
What types of changes does your code introduce? Remove all that do not apply:
Checklist
Go over all the following points, and put an
x
in all the boxes that apply.If you are unsure about any of these, don't hesitate to ask. We are here to help!