Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimization methods specifically for RL #1273

Closed
wants to merge 12 commits into from
Closed

Optimization methods specifically for RL #1273

wants to merge 12 commits into from

Conversation

teytaud
Copy link
Contributor

@teytaud teytaud commented Oct 22, 2021

Types of changes

  • Docs change / refactoring / dependency upgrade
  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to change)

Motivation and Context / Related issue

Just because it looks like an excellent tool specifically for RL

How Has This Been Tested (if it applies)

Experiments on Open AI Gym and neurocontrol.

Checklist

  • The documentation is up-to-date with the changes I made.
  • I have read the CONTRIBUTING document and completed the CLA (see CLA).
  • All tests passed, and additional code has been covered with new tests.

@facebook-github-bot facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Oct 22, 2021
@teytaud teytaud requested a review from jrapin October 30, 2021 06:57
Comment on lines +2093 to +2096
MixDeterministicRL = ConfPortfolio(optimizers=[DiagonalCMA, PSO, GeneticDE]).set_name(
"MixDeterministicRL", register=True
)
SpecialRL = Chaining([MixDeterministicRL, TBPSA], ["half"]).set_name("SpecialRL", register=True)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we need both?

Suggested change
MixDeterministicRL = ConfPortfolio(optimizers=[DiagonalCMA, PSO, GeneticDE]).set_name(
"MixDeterministicRL", register=True
)
SpecialRL = Chaining([MixDeterministicRL, TBPSA], ["half"]).set_name("SpecialRL", register=True)
SpecialRL = Chaining([ConfPortfolio(optimizers=[DiagonalCMA, PSO, GeneticDE]), TBPSA], ["half"]).set_name("SpecialRL", register=True)

@@ -2760,3 +2767,6 @@ def __init__(
base_optimizer: base.OptCls = NGOpt,
) -> None:
super().__init__(_MSR, locals())


NGOptRL = SpecialRL
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd rather we keep one name only if possible, why have two names for now?
As usual, adding more names means more breaking changes down the roads, so I prefer being careful.

@teytaud
Copy link
Contributor Author

teytaud commented Nov 24, 2021

deprecated by #1303

@teytaud teytaud closed this Nov 24, 2021
@teytaud teytaud deleted the for_rl branch November 24, 2021 10:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed Do not delete this pull request or issue due to inactivity.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants