Vd sampler behaves strangely #9

nikohansen · 2017-05-14T22:51:16Z

In maybe 20% of the runs, we see plots like this:

@youheiakimoto

youheiakimoto · 2017-06-20T23:32:48Z

It isn't a bug, it is more like a feature. I suppose that the function is a rotated cigar. I confirmed that the same behavior is observed with my vdcma code. This problem is observed when the cigar axis is nearly parallel to coordinate axis, or nearly in a subspace spanned by the basis. If I run vdcma on a diagonally oriented cigar, the strange behavior less likely happens. In Figure 2 of the reference [1], you find a relatively large standard deviation on cigrot and ellcig. [1] Y. Akimoto, A. Auger, N. Hansen: Comparison-Based Natural Gradient Optimization in High Dimension, GECCO 2014, pp 373--380 (2014)

…

On May 15, 2017, at 7:53, nikohansen ***@***.***> wrote: In maybe 20% of the runs, we see plots like this: @youheiakimoto — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.

nikohansen · 2017-06-21T09:48:22Z

Thanks, I see. Then I would consider this to be somewhat a defect in the algorithm, which might be intrinsic. I guess one way to look at the underlying reason is that unlearning V takes much longer than learning it?

youheiakimoto · 2017-06-21T23:12:24Z

Right. No active unlearning mechanism for V (and D) is implemented, while learning one long axis is very quick thanks to the cumulation. One major defect of VD-CMA is that once it learns a wrong long axis (V), it has to first make the vector short and then rotate the vector and make it longer to learn the right long axis. It is problematic when the initial step-size is very small. Then, the evolution path first tends to be long in the negative gradient direction, which is orthogonal to the long axis of the function, and so does V. Therefore VD-CMA needs to wait until V becomes sufficiently short. The same happens for CMA (learns a wrong axis at the beginning), but CMA doesn't need to wait this axis becomes short. It learns the right long axis while it makes the wrong axis short. If we have two vectors V in VD covariance model, I guess the situation will be better.

nikohansen · 2017-06-22T06:58:06Z

The same happens for CMA (learns a wrong axis at the beginning), but CMA doesn't need to wait this axis becomes short.

I am not so sure about that, because we can observe a very similar effect with small initial step-size. The effect is prevented with the h_sigma switch. I don't recall whether we also see the overshoot in the fitness values, but that would be simple to check. They just don't "look right" to me.

nikohansen · 2017-08-09T15:12:59Z

Given that VD-CMA is largely succeeded by VkD-CMA, I am closing this issue.

nikohansen mentioned this issue May 15, 2017

Python ask/tell/eval interface? CMA-ES/libcmaes#159

Open

nikohansen closed this as completed Aug 9, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vd sampler behaves strangely #9

Vd sampler behaves strangely #9

nikohansen commented May 14, 2017 •

edited

Loading

youheiakimoto commented Jun 20, 2017 via email

nikohansen commented Jun 21, 2017

youheiakimoto commented Jun 21, 2017 via email

nikohansen commented Jun 22, 2017 •

edited

Loading

nikohansen commented Aug 9, 2017

Vd sampler behaves strangely #9

Vd sampler behaves strangely #9

Comments

nikohansen commented May 14, 2017 • edited Loading

youheiakimoto commented Jun 20, 2017 via email

nikohansen commented Jun 21, 2017

youheiakimoto commented Jun 21, 2017 via email

nikohansen commented Jun 22, 2017 • edited Loading

nikohansen commented Aug 9, 2017

nikohansen commented May 14, 2017 •

edited

Loading

nikohansen commented Jun 22, 2017 •

edited

Loading