Speedup logcdf tests #4734

ricardoV94 · 2021-06-03T18:40:41Z

The logcdf tests were running very slow due to constantly rebuilding the logcdf function.

Also fixed a couple of failing tests on float32 (which were difficult to identify before via because setting n_samples=-1 took ages to run.

ricardoV94 · 2021-06-04T08:33:36Z

I am getting a strange issue with some of the logcdf methods which seems to be behind the failing tests:

Setting the values to constants gives the correct values

with pm.Model() as model:
    value = pm.NegativeBinomial('value', mu=0.9, alpha=0.1)

logcdf = logpt(model['value'], cdf=True)
fn1 = model.fastfn(logcdf)

print(fn1({'value': 1.0}))  # -0.14408081390569186

But using shared variables does not

with pm.Model() as model:
    mu = aesara.shared(np.asarray(0.9))
    alpha = aesara.shared(np.asarray(0.1))
    value = pm.NegativeBinomial('value', mu=mu, alpha=alpha)

logcdf = logpt(model['value'], cdf=True)
fn2 = model.fastfn(logcdf)

mu.set_value(0.9)
alpha.set_value(0.1)
print(fn2({'value': 1.0}))  # -0.08777554398474162

~~This doesn't happen with all distributions, and unfortunately those that do have a crazy long graph to be able to compare easily side by side :/~~

Found the culprint: it's the incomplete_beta function:

incomplete_beta(0.1, 2.0, 0.1).eval()   # array(0.86581778)

alpha = aesara.shared(0.1, 'alpha')
beta = aesara.shared(2.0, 'beta')
incomplete_beta(alpha, beta, 0.1).eval()   # array(0.91596645)

import scipy.special
scipy.special.betainc(0.1, 2.0, 0.1)   # 0.8658177758494668

All failing tests (expect for the HyperGeometric, which is a different issue) rely on the incomplete_beta.

Maybe it's time I finish #4519

ricardoV94 · 2021-06-04T08:54:51Z

Also found some issues with the recent initval changes, as it doesn't respect the parents initvals.

This snippet often leads to a ValueError:

with pm.Model() as model:
    mu = pm.Normal('mu', initval=100)
    alpha = pm.HalfNormal('alpha', initval=100, transform=None)
    value = pm.NegativeBinomial('value', mu=mu, alpha=alpha)

model.initial_values
# {mu: array(100., dtype=float32),
#  alpha: array(100., dtype=float32),
#  value: array(1)}

When it doesn't fail, initval is still far from the expected ~ 100:

pm.NegativeBinomial.dist(mu=100, alpha=100).eval()
# array(98)

ricardoV94 · 2021-06-04T10:10:45Z

The HyperGeometric test is failing because of this issue in Aesara: pymc-devs/pytensor#461

ricardoV94 · 2021-06-04T16:47:07Z

Closing in favor of #4736

ricardoV94 marked this pull request as draft June 3, 2021 18:41

ricardoV94 added 4 commits June 3, 2021 20:43

Speedup check_logcdf test

6159d72

Speedup check_selfconsistency_discrete_logcdf test

fe67c98

Revert reduced test n_samples due to speed issues

7e8e838

Fix float32 Beta and StudentT logcdf failing tests

5de3f2d

ricardoV94 force-pushed the speedup_logcdf_tests branch from 4e706ac to 5de3f2d Compare June 3, 2021 18:44

This was referenced Jun 4, 2021

Replace incomplete_beta with betainc and speedup logcdf tests #4736

Closed

Issue with recent initval changes #4737

Closed

ricardoV94 closed this Jun 4, 2021

ricardoV94 mentioned this pull request Jul 12, 2021

Replace incomplete_beta with at.betainc and speedup/clean logcdf tests #4857

Merged

ricardoV94 deleted the speedup_logcdf_tests branch September 23, 2021 08:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speedup logcdf tests #4734

Speedup logcdf tests #4734

ricardoV94 commented Jun 3, 2021

ricardoV94 commented Jun 4, 2021 •

edited

Loading

ricardoV94 commented Jun 4, 2021 •

edited

Loading

ricardoV94 commented Jun 4, 2021 •

edited

Loading

ricardoV94 commented Jun 4, 2021

Speedup logcdf tests #4734

Speedup logcdf tests #4734

Conversation

ricardoV94 commented Jun 3, 2021

ricardoV94 commented Jun 4, 2021 • edited Loading

ricardoV94 commented Jun 4, 2021 • edited Loading

ricardoV94 commented Jun 4, 2021 • edited Loading

ricardoV94 commented Jun 4, 2021

ricardoV94 commented Jun 4, 2021 •

edited

Loading

ricardoV94 commented Jun 4, 2021 •

edited

Loading

ricardoV94 commented Jun 4, 2021 •

edited

Loading