Questions about the number in the paper #5

MSRA-COLT · 2020-11-30T09:13:46Z

Hi, I really appreciate your open source code. My question is how is your performance number reported in the paper.

For example, in Table 1, do you use the max evaluation return during the learning process or use the last evaluation return. The return of the policy has large variance in different iteration.

Thanks,
Yue

weihongwei0586 · 2020-12-08T05:34:54Z

Hi, I really appreciate your open source code. My question is how is your performance number reported in the paper.

For example, in Table 1, do you use the max evaluation return during the learning process or use the last evaluation return. The return of the policy has large variance in different iteration.

Thanks,
Yue

I have the same problem, When i run the demo not in mixed, the results has large variance.

HYDesmondLiu · 2022-03-25T18:19:57Z

Also, which version of D4RL were you using (also in COMBO)?
The reason why I ask is that the buffer quality is quite different in v0~v2. (you could refer to the TD3BC paper for details).

typoverflow · 2022-07-29T15:31:00Z

@HYDesmondLiu The config file in this repo says they used '-v0' dataset for MOPO. But I'm still curious about the dataset version used in COMBO, is COMBO's source code even released?
I am also having trouble stabilizing MOPO's performance. The variance of performance across epochs is quite huge.

HYDesmondLiu · 2022-07-29T16:46:49Z

@typoverflow
AFAIK, COMBO source code is not shared. As I recall they use D4RL v2 buffers since the performance between v0 and v2 is quite different. You could easily spot the difference.
"Some" DRL methods are notorious for being unreproducible.
You could refer to this paper and other related research for more information.

typoverflow mentioned this issue Jan 7, 2023

COMBO performance takuseno/d3rlpy#183

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about the number in the paper #5

Questions about the number in the paper #5

MSRA-COLT commented Nov 30, 2020

weihongwei0586 commented Dec 8, 2020

HYDesmondLiu commented Mar 25, 2022

typoverflow commented Jul 29, 2022

HYDesmondLiu commented Jul 29, 2022

Questions about the number in the paper #5

Questions about the number in the paper #5

Comments

MSRA-COLT commented Nov 30, 2020

weihongwei0586 commented Dec 8, 2020

HYDesmondLiu commented Mar 25, 2022

typoverflow commented Jul 29, 2022

HYDesmondLiu commented Jul 29, 2022