Test results are non-deterministic #42

traversaro · 2023-05-15T08:08:54Z

For example, see this output of two tests runs on the same commit:

The reason for this is that we call np.random, but we do not set the seed, so the test results are different at every run (see https://adamj.eu/tech/2018/01/08/pytest-randomly-history/ and https://towardsdatascience.com/random-seeds-and-reproducibility-933da79446e3). The long term plan may be to implement some kind of way of controlling randomness (for example via https://github.com/pytest-dev/pytest-randomly), but in the short term perhaps the easy fix is to increase the test threshold.

The text was updated successfully, but these errors were encountered:

DanielePucci · 2023-05-15T14:34:53Z

CC @ami-iit/artificial-mechanical-intelligence

Giulero · 2023-05-16T09:02:13Z

Thanks @traversaro! :) Agreed!
I'll open a PR for increasing the tolerance, but I'll plan to address this issue in a more structured way.
P.S. I do not really understand why Jax and PyTorch tests are the only ones failing. Is there some strange interaction between these two frameworks and numpy?

P.P.S. Maybe it's a stupid solution. What if I set np.random.seed(0)? I guess it will always give me the same sequence of random numbers.

traversaro · 2023-05-16T11:27:38Z

P.P.S. Maybe it's a stupid solution. What if I set np.random.seed(0)? I guess it will always give me the same sequence of random numbers.

That for sure should work fine. Reading https://adamj.eu/tech/2018/01/08/pytest-randomly-history/ and similar tests, it seems that people do not like it as you perturb the global state and so you could influence other tests, but for our specific case it should work fine (that is what we do in iDynTree, for example: https://github.com/robotology/idyntree/blob/35b0f76a9db3809384e8ebcbdb7cfb11d2cb7a7b/bindings/python/tests/joints.py#L31 and https://github.com/robotology/idyntree/blob/35b0f76a9db3809384e8ebcbdb7cfb11d2cb7a7b/src/estimation/tests/KalmanFilterUnitTest.cpp#L84).

P.S. I do not really understand why Jax and PyTorch tests are the only ones failing. Is there some strange interaction between these two frameworks and numpy?

I guess that for some reason on some joint configuration the numeric error induced by how these frameworks make the computation is bigger, but it is just an intuition.

Giulero · 2023-05-16T11:44:35Z

That for sure should work fine. Reading https://adamj.eu/tech/2018/01/08/pytest-randomly-history/ and similar tests, it seems that people do not like it as you perturb the global state, and so you could influence other tests, but for our specific case it should work fine

So I could start with this approach and then proceed with a more refined solution.

I guess that for some reason on some joint configuration the numeric error induced by how these frameworks make the computation is bigger, but it is just an intuition.

I suspect the same. I did some tests and it seems that setting torch.set_default_dtype(torch.float64) and config.update("jax_enable_x64", True) (as in #39) along with np.random.seed(42) does the job, and tests don't fail.

Giulero · 2023-05-16T12:20:36Z

Just for a log, the proposed solution in #42 (comment) is implemented in #39.

Giulero mentioned this issue May 16, 2023

Set float64 #39

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test results are non-deterministic #42

Test results are non-deterministic #42

traversaro commented May 15, 2023

DanielePucci commented May 15, 2023

Giulero commented May 16, 2023 •

edited

Loading

traversaro commented May 16, 2023

Giulero commented May 16, 2023 •

edited

Loading

Giulero commented May 16, 2023 •

edited

Loading

Test results are non-deterministic #42

Test results are non-deterministic #42

Comments

traversaro commented May 15, 2023

DanielePucci commented May 15, 2023

Giulero commented May 16, 2023 • edited Loading

traversaro commented May 16, 2023

Giulero commented May 16, 2023 • edited Loading

Giulero commented May 16, 2023 • edited Loading

Giulero commented May 16, 2023 •

edited

Loading

Giulero commented May 16, 2023 •

edited

Loading

Giulero commented May 16, 2023 •

edited

Loading