TestCorrelation::test_corr fails on some platforms #195

musicinmybrain · 2021-09-13T17:42:20Z

I’m helping to update the python-pingouin package in Fedora Linux. One test, TestCorrelation::test_corr, is failing on some platforms because what should theoretically be a perfect unity correlation is coming out two or three ulps short:

        # Compare BF10 with JASP
        df = read_dataset('pairwise_corr')
        stats = corr(df['Neuroticism'], df['Extraversion'])
        assert np.isclose(1 / float(stats['BF10'].to_numpy()), 1.478e-13)
        # Perfect correlation, CI and power should be 1, BF should be Inf
        stats = corr(x, x)
>       assert stats.at['pearson', 'r'] == 1
E       AssertionError: assert 0.9999999999999997 == 1

This happens when the tests are executed on an aarch64, ppc64le, or s390x platform. (On ppc64le and s390x, the correlation is 0.9999999999999998.)

What do you think? Does this look like this a bug to be fixed somewhere, or should the assertion be loosened to permit a value slightly less than one due to platform-dependent rounding?

The text was updated successfully, but these errors were encountered:

raphaelvallat · 2021-09-15T17:22:53Z

Hi @musicinmybrain,

Interesting — thanks for opening the issue and helping with this! I think it is definitely a platform-dependent rounding error so we should use np.isclose(stats.at['pearson', 'r'], 1) instead of the strict ==.

Raphael

musicinmybrain · 2021-09-15T17:30:09Z

Makes sense to me. You could, if you like, add an additional test for stats.at['pearson', 'r'] <= 1', since a high-quality implementation shouldn’t be producing correlations outside [-1, 1] no matter how the rounding happens to work out.

#195

raphaelvallat · 2022-02-13T00:18:37Z

Done in 7f5f0cc

See PR v0.5.1: #236

* Flake8 * Explicit error when y is an empty list in pg.ttest #222 * Add keyword arguments in homoscedasticity function #218 * Bugfix rm_anova and mixed_anova changed the dtypes of categorical columns + added observed=True to all groupby #224 * Update version number in init and setup * Use np.isclose for test_pearson == 1 #195 * Coverage for try..except scipy fallback * Fix set_option for pandas 1.4 * Upgraded dependencies for seaborn and statsmodels * Added Jarque-Bera test in pg.normality #216 * Coverage scipy import error * Use pd.concat instead of frame.append to avoid FutureWarning * Remove add_categories(inplace=True) to avoid FutureWarning * GH Discussions instead of Gitter * Minor doc fix

raphaelvallat added the docs/testing:book: Documentation and unit testing label Sep 15, 2021

raphaelvallat self-assigned this Sep 15, 2021

raphaelvallat mentioned this issue Feb 12, 2022

Release v0.5.1 #236

Merged

12 tasks

raphaelvallat added a commit that referenced this issue Feb 13, 2022

Use np.isclose for test_pearson == 1

7f5f0cc

#195

raphaelvallat closed this as completed Feb 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TestCorrelation::test_corr fails on some platforms #195

TestCorrelation::test_corr fails on some platforms #195

musicinmybrain commented Sep 13, 2021

raphaelvallat commented Sep 15, 2021

musicinmybrain commented Sep 15, 2021

raphaelvallat commented Feb 13, 2022

TestCorrelation::test_corr fails on some platforms #195

TestCorrelation::test_corr fails on some platforms #195

Comments

musicinmybrain commented Sep 13, 2021

raphaelvallat commented Sep 15, 2021

musicinmybrain commented Sep 15, 2021

raphaelvallat commented Feb 13, 2022