feat: batched sampling for MCMC #1176

manuelgloeckler · 2024-06-18T07:32:38Z

What does this implement/fix? Explain your changes

This pull request aims to implement the sample_batched method for MCMC.

Current problem

BasePotential can either "allow_iid" or not. Hence, each batch dimension will be interpreted as IID samples.
- Replace allow_iid with a mutable attribute (or optional input argument) interpret_as_iid.
- Remove warning for batched x and default to batched evaluation
Refactor all MCMC initialization methods to work with batch dim.
- resample should break
- SIR should break
- proposal should work
Add tests to check if correct samples are in each dimension (currently, only shapes are checked)
- The problem is currently not catched by tests...

The current implementation will let you sample the correct shape, BUT will output the wrong solution. This is because the potential function will broadcast, repeat and finally sum up the first dimension which is incorrect.

… amortizedsample

…rs' into amortizedsample

…posteriors' into amortizedsample" This reverts commit 07084e2, reversing changes made to f16622d.

…from-different-posteriors' into amortizedsample

… reshapes in rejection

This reverts commit 17c5343.

…oved

gmoss13 · 2024-06-27T16:03:58Z

I've made some progress now towards this PR, and would like some feedback before I continue.

BasePotential can either "allow_iid" or not.

Given batch_dim_theta!=batch_dim_x, we need to decide how to interpret how to evaluate potential(x,theta). We could return (batch_dim_x,batch_dim_theta) potentials (i.e. every combination), but I am worried this can add a lot of computational overhead, especially when sampling. Instead, the current implementation I suggest that we assume that batch_dim_theta is a multiple of batch_dim_x (i.e. for sampling, we have n chains in theta for each x). In this case we expand the batch dim of x to batch_theta, and match which x goes to which theta. If we are happy with this approach, I'll go ahead and apply this also to the MCMC init_strategy, etc., and make sure this is consistent with other calls.

Remove warning for batched x and default to batched evaluation
Not sure if we want batched evaluation as the default. I think it's easier to do batched evaluation when sample_batched or log_prob_batched is called, and otherwise assume iid (and warn if batch dim >1 as before).

manuelgloeckler · 2024-06-28T12:45:54Z

Great, it looks good. I like that the choice on iid or not can now be made at the set_x method which makes a lot of sense.

I would also opt for your suggested option. The question arises because we squeeze the batch_shape into a single dimension, right? For "PyTorch" broadcasting, one would expect something like (1,batch_x_dim, x_dim) and (batch_theta_dim, betach_x_dim, theta_dim) -> (batch_x_dim, batch_theta_dim), so by squeezing the xs, thetas into 2d one would always get a dimension that is a multiple of batch_x_dim (otherwise it cannot be represented by a fixed size tensor).

For (1,batch_x_dim,x_dim) and (batch_theta_dim, 1, theta_dim), PyTorch broadcasting semantics would compute all combinations. Unfortunately, after squeezing, these distinctions between cases can no longer be fully preserved.

janfb

Great effort, thanks a lot for tacking this 👏

I do have a couple of comments and questions. Happy to discuss in person if needed.

sbi/inference/posteriors/mcmc_posterior.py

sbi/utils/conditional_density_utils.py

sbi/utils/potentialutils.py

sbi/utils/sbiutils.py

tests/posterior_nn_test.py

gmoss13 · 2024-07-19T15:32:43Z

Great effort, thanks a lot for tacking this 👏

I do have a couple of comments and questions. Happy to discuss in person if needed.

Thanks for the review! I implemented your suggestions.

An additional point - For posterior_based_potential, indeed we should not allow for iid_x, as this is handled by PermutationInvariantNetwork. Instead, we now always treat x batches as not iid. If the user tries to set potential.set_x(x,x_is_iid=True) with a PosteriorBasedPotential, we raise an error stating this. I added a few test cases in embedding_net_test.py::test_embedding_api_with_multiple_trials to test whether batches of x are interpreted correctly when we use a PermutationInvariantNetwork.

janfb

Looks great! I added just a couple of last questions..

sbi/inference/posteriors/mcmc_posterior.py

sbi/inference/potentials/posterior_based_potential.py

tests/embedding_net_test.py

janfb

Looks good! Thanks a lot, great effort!

janfb · 2024-07-30T09:20:57Z

closes #990
closes #944

manuelgloeckler and others added 30 commits April 29, 2024 09:04

Base estimator class

17c5343

intermediate commit

705e9df

make autoreload work

07b53cd

amortized_sample works for MCMCPosterior

dd02e22

fixes current bug!

663185b

Added tests

df8899a

batched_rejection_sampling

aa82aab

intermediate commit

00cdade

make autoreload work

cb8e4d8

amortized_sample works for MCMCPosterior

d64557f

Merge branch 'amortizedsample' of https://github.com/sbi-dev/sbi into…

f16622d

… amortizedsample

Merge branch '990-add-sample_batched-and-log_prob_batched-to-posterio…

07084e2

…rs' into amortizedsample

Revert "Merge branch '990-add-sample_batched-and-log_prob_batched-to-…

e54a2fb

…posteriors' into amortizedsample" This reverts commit 07084e2, reversing changes made to f16622d.

Merge branch '1154-density-estimator-batched-sample-mixes-up-samples-…

52d0e7e

…from-different-posteriors' into amortizedsample

sample works, try log_prob_batched

cd808d5

log_prob_batched works

f542224

abstract method implement for other methods

48a1a28

temp fix mcmcposterior

5a37330

meh for general use i.e. in the restriction prior we have to add some…

2b23e42

… reshapes in rejection

... test class

6362051

Revert "Base estimator class"

294609d

This reverts commit 17c5343.

removing previous change

99abbb1

removing some artifacts

ef9e99c

revert wierd change

5eb1007

docs and tests

82127ab

MCMC sample_batched works but not log_prob batched

41617a8

adding some docs

82951db

batch_log_prob for MCMC requires at best changes for potential -> rem…

c5fac1d

…oved

intermediate commit

0d82422

make autoreload work

57cfde3

gmoss13 and others added 4 commits June 27, 2024 16:44

batch sampling for snpe,snre

9ff2ce8

Merge branch 'main' into amortized_sample_mcmc

05da5e3

ruff fixes after merge

f759e23

pytest not catching xfail

94732aa

gmoss13 requested a review from janfb June 27, 2024 16:04

mcmc_posterior sample_batched disappeared in merge

69f459e

manuelgloeckler mentioned this pull request Jul 8, 2024

Score-based density estimators for SBI #1015

Merged

gmoss13 added 5 commits July 11, 2024 17:58

move mcmc chain shape handling to mcmcposterior away from potentials

ce24632

batched init strategies for mcmc

25f7e2c

Merge branch 'main' into amortized_sample_mcmc

f98bf4d

update raio_based_potential for new RatioEstimator class

4524853

mcmc sample shape out fix and process_x utils

2c7fc0e

janfb reviewed Jul 18, 2024

View reviewed changes

suggestions from jan

fd11a72

gmoss13 requested a review from janfb July 19, 2024 15:51

janfb reviewed Jul 21, 2024

View reviewed changes

warning on batched x

813ee75

gmoss13 requested a review from janfb July 30, 2024 08:11

janfb approved these changes Jul 30, 2024

View reviewed changes

janfb self-assigned this Jul 30, 2024

janfb added this to the Hackathon and release 2024 milestone Jul 30, 2024

janfb added the enhancement New feature or request label Jul 30, 2024

This was linked to issues Jul 30, 2024

Add sample_batched and log_prob_batched to posteriors #990

Closed

Allow sampling the posterior given different x (batched) #944

Closed

janfb merged commit 81fffcf into main Jul 30, 2024
5 of 6 checks passed

janfb deleted the amortized_sample_mcmc branch July 30, 2024 09:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: batched sampling for MCMC #1176

feat: batched sampling for MCMC #1176

manuelgloeckler commented Jun 18, 2024 •

edited by gmoss13

Loading

gmoss13 commented Jun 27, 2024

manuelgloeckler commented Jun 28, 2024

janfb left a comment

gmoss13 commented Jul 19, 2024

janfb left a comment

janfb left a comment

janfb commented Jul 30, 2024

feat: batched sampling for MCMC #1176

feat: batched sampling for MCMC #1176

Conversation

manuelgloeckler commented Jun 18, 2024 • edited by gmoss13 Loading

What does this implement/fix? Explain your changes

Current problem

gmoss13 commented Jun 27, 2024

manuelgloeckler commented Jun 28, 2024

janfb left a comment

Choose a reason for hiding this comment

gmoss13 commented Jul 19, 2024

janfb left a comment

Choose a reason for hiding this comment

janfb left a comment

Choose a reason for hiding this comment

janfb commented Jul 30, 2024

manuelgloeckler commented Jun 18, 2024 •

edited by gmoss13

Loading