-
Notifications
You must be signed in to change notification settings - Fork 243
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
COMBO performance #183
Comments
same as you |
We confirmed that three is an issue about rollout termination. For now, COMBO is still an experimental feature. |
Hello, I'm quite skeptical about the reproducibility of some model-based offline RL algorithms (see tianheyu927/mopo#5). The source code of COMBO is not even released (as far as I know). |
Sorry for the inconvenience. Currently, COMBO support is a low priority because I'm working on Transformer architecture support in |
Hi, I run COMBO on v2 version d4rl MuJoCo Gym datasets and found that the performances are surprisingly poor. For example, the normalized score after training 1M step of COMBO on walker2d-medium-expert-v2 is -0.18, on halfcheetah-medium-replay-v2 is 35.98 (The results fluctuated between 30-60 in the last few steps). Did I do something wrong?
The text was updated successfully, but these errors were encountered: