Implement betaincinv and gammainc[c]inv functions #502

amyoshino · 2023-11-16T02:47:30Z

Motivation for these changes

Closes #413

Implement betaincinv and gammaincinv functions.
The implementation of these functions will allow us to implement quantile/ICDF functions for distributions like Beta, Gamma, ChiSquared and StudentT distributions, being fundamental to the issue: pymc-devs/pymc#6845

Implementation details

Following other scalar.math functions implementations, the code is extended to call the functions from scipy.special.
Tests were added to check if it matches scipy implementation.

Checklist

Explain motivation and implementation 👆
Make sure that the pre-commit linting/style checks pass.
Link relevant issues, preferably in nice commit messages.
The commits correspond to relevant logical changes. Note that if they don't, we will rewrite/rebase/squash the git history before merging.
Are the changes covered by tests and docstrings?
Fill out the short summary sections 👇

Major / Breaking Changes

...

New features

...

Bugfixes

...

Documentation

...

Maintenance

...

amyoshino · 2023-11-16T02:48:29Z

I finally was able to get my hands back to it, apologies for resuming it very late.

I have only implemented the function "impl" so far, gradients were not implemented and not sure if it is needed in the case of those inverse regularized functions.

Let me know what can be improved here.

codecov-commenter · 2023-11-16T03:11:15Z

Codecov Report

Merging #502 (753d150) into main (230a808) will increase coverage by 0.00%.
The diff coverage is 88.23%.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #502   +/-   ##
=======================================
  Coverage   80.77%   80.77%           
=======================================
  Files         162      162           
  Lines       46105    46122   +17     
  Branches    11266    11267    +1     
=======================================
+ Hits        37240    37255   +15     
- Misses       6638     6640    +2     
  Partials     2227     2227

Files	Coverage Δ
pytensor/scalar/math.py	`89.22% <88.23%> (-0.02%)`	⬇️

ricardoV94

Thanks for picking it up!

tests/scalar/test_math.py

pytensor/scalar/math.py

ricardoV94 · 2023-11-16T08:11:40Z

@amyoshino here is a more typical PR to add new scalar/elemwise Ops that you can use as a template: 2dc7591

ricardoV94 · 2023-11-16T08:14:52Z

For JAX dispatching, it seems like these exist in tensorflow-probability:

https://www.tensorflow.org/probability/api_docs/python/tfp/substrates/jax/math/betaincinv
https://www.tensorflow.org/probability/api_docs/python/tfp/substrates/jax/math/igammainv

We can do something like https://github.com/pymc-devs/pytensor/pull/403/files to provide them in the JAX backend

Co-authored-by: Ricardo Vieira <[email protected]>

amyoshino · 2023-11-16T14:31:11Z

@amyoshino here is a more typical PR to add new scalar/elemwise Ops that you can use as a template: 2dc7591

I see I am missing quite a few things, thanks for the guidance!

amyoshino · 2023-11-26T15:05:16Z

I have added the first derivatives for betaincinv, gammaincinv (lower) and gammainccinv (upper) and tried to set up some Elemwise tests.

I have not set up the JAX dispatching yet, but plan to do so in the next push.

Thanks for all your guidance, and I am hoping to improve it soon!

amyoshino · 2023-12-19T01:05:21Z

@ricardoV94 Phew, what a journey to get all tests good for this PR. I believe now it is all good and it is ready for review.
Again, sorry for the slow pace in this issue.

Let me take this comment to summarize all the steps and assumptions used in this PR, I hope it helps the reviewing process.

Overall I implemented the wrapper of scipy.special.gammaincinv, scipy.special.gammainccinv and scipy.special.betaincinv with:

the first derivatives implementation under the grad function, with respect to arguments that I was able to find closed form solutions. In the ones I wasn't able I added grad_not_implemented
Include JAX dispatching when available
Write tests to cover outputs and broadcasting

In order to write the derivative for scipy.special.betaincinv w.r.t. x, I needed to add scipy.special.beta (beta function), and with it, add its derivatives wrt a and b and required tests for it. I wasn't able to find the JAX version for this function, so I haven't added it as well.

References

Derivative of:

Beta wrt a: https://www.wolframalpha.com/input?i=derivative+of+beta+function+with+respect+to+a
Beta wrt b: https://www.wolframalpha.com/input?i=derivative+of+beta+function+with+respect+to+b
Inverse to the regularized lower incomplete gamma function wrt x: https://reference.wolfram.com/language/ref/InverseGammaRegularized.html
Inverse to the regularized upper incomplete gamma function wrt x: same as lower but with inverted sign
Inverse of the regularized incomplete beta function wrt x: https://www.wolframalpha.com/input?i=D%5BInverseBetaRegularized%5Bs%2C+a%2C+b%5D%2C+s%5D

I wasn't able to implement the derivatives of inverse gamma function with respect to the first argument k, and the inverse beta function with respect to the first and second arguments a, b because the closed forms require the generalized hypergeometric function with arguments not implemented even in scipy, for example https://www.wolframalpha.com/input?i=D%5BInverseGammaRegularized%5Ba%2C+s%5D%2C+a%5D

This PR will help the issue pymc-devs/pymc#6845, which will benefit from the implemented functions for the icdf formulas.

ricardoV94 · 2023-12-22T14:52:09Z

@amyoshino sorry I still didn't have time to review, just this comment here caught my attention:

I needed to add scipy.special.beta (beta function), and with it, add its derivatives wrt a and b and required tests for it. I wasn't able to find the JAX version for this function, so I haven't added it as well.

Instead of implementing a new Op, let's try to use this form instead?

def beta(a, b):
    return (gamma(a) * gamma(b)) / gamma(a + b)

If it proves too unstable, we can try to use the log-form with gammaln and exponentiate the output.
If that also proves too unstable we can reassess :)

This saves us from having to maintain another Op and its grads, plus missing C/JAX implementations.

We can also add a user-facing helper in tensor.math (the scalar version used in the grads wouldn't be user facing) . Here is an example where we do something like that:

pytensor/pytensor/tensor/special.py

Lines 739 to 744 in 39bda72

    
           def poch(z, m): 
        
               """ 
        
               Pochhammer symbol (rising factorial) function. 
        
               """ 
        
               return gamma(z + m) / gamma(z)

amyoshino · 2023-12-22T19:14:14Z

@ricardoV94 enjoy your holiday time, no worries about it. I delayed the progress of this PR for so long, it would be really nonsense on my end to expect a quick response in this 🥲

Nice suggestions! I will make the changes in the code shortly, and for safety, I'll try to find out if the proposed solution is not unstable in any ways. I will keep you posted on the progress.

amyoshino · 2023-12-23T21:29:02Z

@ricardoV94, I investigated the stability of using the proposed form of the beta function, and here are my findings so far.

Since the connection between beta and gamma function is valid only when x > 0 and y > 0 (reference), as expected, the approximation works well under this condition, and implementing it with gammln is preferred over gamma because of numerical instability on large values:

Unstable when arguments have "high" values:

Approximation is great (only one tiny difference found in a relatively large range of arguments)

When x < 0. and/or y < 0

The thing starts to go a bit wrong if we want to keep consistency in the results between this implementation and the SciPy's Beta function when x < 0 and/or y < 0.
When checking for these cases, then we find some inconsistencies. Just to illustrate:

Results can change sign depending on the argument value

And also can have a large difference when values are high (only when one or more arguments are < 0):

Given this information, should we go ahead and use your suggestion? I have already worked on the changes, whatever path you want to take is good for me. Let me know and can push it at anytime. 😄

ricardoV94 · 2023-12-24T12:23:19Z

That first sign flip doesn't seem too bad, the answer is pretty much 0. I didn't look carefully and the examples that followed.

We should think about the stability and range in the context that first motivated these Ops: use in the gradients of this PR.

Do the gradients remain stable in a reasonable parameter space of the original functions?

Also the form of beta we use for the gradients need not have anything to do with the one we offer users in the tensor module. For the user facing one I would probably offer the naive way with gamma and also offer the betaln with gammaln.

ricardoV94 · 2023-12-24T13:27:22Z

Aren't negative values some special analytic extension? Or is it standard to support it?

Also for assessing closeness it's common to look not only at absolute error but also relative error. Those 1e+16 may not be so unreasonable if they correspond to a small relative error

ricardoV94 · 2023-12-24T13:32:36Z

Given this information, should we go ahead and use your suggestion?

My gut feeling is we should use exp(beta_ln) in the scalar grads and offer beta and betaln to user in tensor.math.

Let's see if the gradients don't blowup for some reasonable values of the original functions. you can test locally with verify_grad

amyoshino · 2023-12-24T14:22:23Z

Thanks for the suggestions and pointers to where to look at! I will evaluate all the mentioned points.

We should think about the stability and range in the context that first motivated these Ops: use in the gradients of this PR.

Do the gradients remain stable in a reasonable parameter space of the original functions?

Good call, I will check it and be mindful about the objective of the PR.

Aren't negative values some special analytic extension? Or is it standard to support it?

I will research more about it. Since the function is well defined only for positive values, I imagine it is kind of a special use, but let me find out in which circumstances negative values are used before stating anything.

Also for assessing closeness it's common to look not only at absolute error but also relative error. Those 1e+16 may not be so unreasonable if they correspond to a small relative error

Most of errors are change in sigs, so error is close to zero. But in few cases errors are large, but only happens when values are high as well, so I will evaluate if this is really a concern or if large errors happen only when the functions are itself unusable.

Let's see if the gradients don't blowup for some reasonable values of the original functions. you can test locally with verify_grad

Thank you so much for the hint! I will need some time to check the gradients and be more familiarized with the trace to the gradient computation, having a tip on where to start is very valuable 😄

…sor.special

amyoshino · 2023-12-30T17:49:08Z

@ricardoV94, after checking the review comments, here is my suggestion:

Do the gradients remain stable in a reasonable parameter space of the original functions?

Given the context of the PR, the use of beta function for the inverse of beta distribution gradient computation will not present us negative values as arguments (as parameters of the beta distribution must be Positive and real-valued). That said, I believe we should go with the approximation you suggested.

Moreover, I could not find use cases for negative-valued arguments for the beta function. So, if any use case comes up in the future, we can revise the user-facing helper in tensor.special and implement the Op for the beta function.

Let's see if the gradients don't blowup for some reasonable values of the original functions. you can test locally with verify_grad

For safety and consistency with previously implemented betainc_grad() function, I used the betaln approach. This is because having alpha or beta with a value of 180 (taking the blown-up example) is not unlikely.

We can also add a user-facing helper in tensor.math (the scalar version used in the grads wouldn't be user facing)

I have done that as well! Thanks for suggesting it.

I just pushed the latest code, I hope everything is good, but if not, do not hesitate in pointing out improvements, I am never tired of improving things and keeping the codebase as neat as possible in a single pass. 😄

pytensor/tensor/special.py

ricardoV94 · 2024-01-02T10:48:40Z

@amyoshino this looks great, I left a tiny tiny comment above. Otherwise it seems good to merge!

Default pre-commit to multi-line Co-authored-by: Ricardo Vieira <[email protected]>

amyoshino · 2024-01-02T12:16:09Z

@ricardoV94 As always, thanks for guididing me through this PR. Learned a lot about the codebase and tests with this one.

ricardoV94 · 2024-01-02T12:31:41Z

My pleasure, and thanks for taking this one on. It will be really useful for the icdf methods!

amyoshino added 2 commits November 15, 2023 21:30

add betaincinv and gammaincinv functions

be3e038

add betaincinv and gammaincinv functions

ff9049d

removing nin variable

753d150

amyoshino changed the title ~~Beta gamma incinv~~ Implement betaincinv and gammaincinv functions Nov 16, 2023

ricardoV94 reviewed Nov 16, 2023

View reviewed changes

tests/scalar/test_math.py Outdated Show resolved Hide resolved

pytensor/scalar/math.py Outdated Show resolved Hide resolved

pytensor/scalar/math.py Show resolved Hide resolved

ricardoV94 added Op implementation SciPy compatibility labels Nov 16, 2023

Update pytensor/scalar/math.py

c3b9f14

Co-authored-by: Ricardo Vieira <[email protected]>

add first derivative and tests

dd1b1e9

amyoshino and others added 8 commits December 13, 2023 20:22

Merge branch 'pymc-devs:main' into beta_gamma_incinv

45ebdd1

add beta function, tests and jax ops

7125c2e

remove unused test case

f96eb6a

fixing jax test error

667bb6f

fixing jax test error - using float64

5241d36

fixing jax test args error

14b0837

fixing jax test args error v2

8fd35e7

Merge branch 'pymc-devs:main' into beta_gamma_incinv

a04b215

Merge branch 'pymc-devs:main' into beta_gamma_incinv

3b2c4bb

amyoshino added 2 commits December 30, 2023 13:11

implementing beta function Op and adding it as beta and betaln in ten…

bc715a6

…sor.special

fix betaln test and circular import error

9ad16af

ricardoV94 reviewed Jan 2, 2024

View reviewed changes

pytensor/tensor/special.py Outdated Show resolved Hide resolved

amyoshino and others added 2 commits January 2, 2024 08:57

Update pytensor/tensor/special.py

908b5a1

Default pre-commit to multi-line Co-authored-by: Ricardo Vieira <[email protected]>

running pre-commit

aeab773

ricardoV94 approved these changes Jan 2, 2024

View reviewed changes

ricardoV94 changed the title ~~Implement betaincinv and gammaincinv functions~~ Implement betaincinv and gammainc[c]inv functions Jan 2, 2024

ricardoV94 added the enhancement New feature or request label Jan 2, 2024

ricardoV94 merged commit f951743 into pymc-devs:main Jan 2, 2024
51 of 53 checks passed

amyoshino deleted the beta_gamma_incinv branch January 2, 2024 14:09

amyoshino mentioned this pull request Jan 7, 2024

Add icdf functions for Beta, Gamma, Chisquared and StudentT distributions pymc-devs/pymc#6845

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement betaincinv and gammainc[c]inv functions #502

Implement betaincinv and gammainc[c]inv functions #502

amyoshino commented Nov 16, 2023

amyoshino commented Nov 16, 2023 •

edited

Loading

codecov-commenter commented Nov 16, 2023 •

edited

Loading

ricardoV94 left a comment

ricardoV94 commented Nov 16, 2023

ricardoV94 commented Nov 16, 2023

amyoshino commented Nov 16, 2023 •

edited

Loading

amyoshino commented Nov 26, 2023 •

edited

Loading

amyoshino commented Dec 19, 2023 •

edited

Loading

ricardoV94 commented Dec 22, 2023 •

edited

Loading

amyoshino commented Dec 22, 2023

amyoshino commented Dec 23, 2023 •

edited

Loading

ricardoV94 commented Dec 24, 2023 •

edited

Loading

ricardoV94 commented Dec 24, 2023 •

edited

Loading

ricardoV94 commented Dec 24, 2023

amyoshino commented Dec 24, 2023

amyoshino commented Dec 30, 2023

ricardoV94 commented Jan 2, 2024 •

edited

Loading

amyoshino commented Jan 2, 2024

ricardoV94 commented Jan 2, 2024

Implement betaincinv and gammainc[c]inv functions #502

Implement betaincinv and gammainc[c]inv functions #502

Conversation

amyoshino commented Nov 16, 2023

Motivation for these changes

Implementation details

Checklist

Major / Breaking Changes

New features

Bugfixes

Documentation

Maintenance

amyoshino commented Nov 16, 2023 • edited Loading

codecov-commenter commented Nov 16, 2023 • edited Loading

Codecov Report

ricardoV94 left a comment

Choose a reason for hiding this comment

ricardoV94 commented Nov 16, 2023

ricardoV94 commented Nov 16, 2023

amyoshino commented Nov 16, 2023 • edited Loading

amyoshino commented Nov 26, 2023 • edited Loading

amyoshino commented Dec 19, 2023 • edited Loading

References

ricardoV94 commented Dec 22, 2023 • edited Loading

amyoshino commented Dec 22, 2023

amyoshino commented Dec 23, 2023 • edited Loading

Unstable when arguments have "high" values:

Approximation is great (only one tiny difference found in a relatively large range of arguments)

When x < 0. and/or y < 0

Results can change sign depending on the argument value

And also can have a large difference when values are high (only when one or more arguments are < 0):

ricardoV94 commented Dec 24, 2023 • edited Loading

ricardoV94 commented Dec 24, 2023 • edited Loading

ricardoV94 commented Dec 24, 2023

amyoshino commented Dec 24, 2023

amyoshino commented Dec 30, 2023

ricardoV94 commented Jan 2, 2024 • edited Loading

amyoshino commented Jan 2, 2024

ricardoV94 commented Jan 2, 2024

amyoshino commented Nov 16, 2023 •

edited

Loading

codecov-commenter commented Nov 16, 2023 •

edited

Loading

amyoshino commented Nov 16, 2023 •

edited

Loading

amyoshino commented Nov 26, 2023 •

edited

Loading

amyoshino commented Dec 19, 2023 •

edited

Loading

ricardoV94 commented Dec 22, 2023 •

edited

Loading

amyoshino commented Dec 23, 2023 •

edited

Loading

ricardoV94 commented Dec 24, 2023 •

edited

Loading

ricardoV94 commented Dec 24, 2023 •

edited

Loading

ricardoV94 commented Jan 2, 2024 •

edited

Loading