-
Notifications
You must be signed in to change notification settings - Fork 43
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Questions about the number in the paper #5
Comments
Also, which version of D4RL were you using (also in COMBO)? |
@HYDesmondLiu The config file in this repo says they used '-v0' dataset for MOPO. But I'm still curious about the dataset version used in COMBO, is COMBO's source code even released? |
@typoverflow |
Hi, I really appreciate your open source code. My question is how is your performance number reported in the paper.
For example, in Table 1, do you use the max evaluation return during the learning process or use the last evaluation return. The return of the policy has large variance in different iteration.
Thanks,
Yue
The text was updated successfully, but these errors were encountered: