Alternative tutorial on implementing samplers #468

torfjelde · 2024-05-31T20:46:51Z

I did a talk on Turing.jl for a statistical audience earlier today and described how it's possible to implement a samplers in AbstratMCMC.jl in such a way that it's very easy to use the resulting sampler to target both Turing.jl and Stan models (through BridgeStan).

Afterwards I was asked if there was some tutorial or something online outlining how to do this in a detailed way. After having a look at the current docs on how to use the externalsampler, I felt they were a bit overly complicated and didn't quite get all the points across, and so I did a rewrite of it.

I think this version will be very, very useful for inference researchers who want their methods to become available for a larger userbase:)

EDIT: Note that I haven't added example with StanLogDensityProblems.jl, but I think we maybe should, as it would be quite nice.

I'm also thinking we should make the example a bit more complicated later in the tutorial, e.g. adding stuff like "how to test the sampler with MCMCTesting.jl" from @Red-Portal , adding a few more things to the MALAState, e.g. isaccept to indicate whether it got accepted, maybe add some adaptation, etc. But I think the current version should be merged first, and then we can build on it later 👍

tutorial on implementing samplers

github-actions · 2024-05-31T20:54:09Z

Preview the changes: https://turinglang.org/docs/pr-previews/468, Please avoid using the search feature and navigation bar in PR previews!

yebai · 2024-05-31T21:12:39Z

Let’s add this as a new docs instead of overriding an existing one. I am aware of users relying on the pathfinder and external samplers in current docs.

torfjelde · 2024-06-01T08:04:18Z

But the current docs are somewhat out of date + it's too similar in what they describe IMO. Maybe it makes more sense to add another section at the end of this tutorial demonstrating some examples, where I'll add back the examples in the original?

torfjelde · 2024-06-04T15:01:28Z

Maybe it makes more sense to add another section at the end of this tutorial demonstrating some examples, where I'll add back the examples in the original?

Happy with this Hong?

yebai · 2024-06-04T15:16:45Z

Let's keep both tutorials. This tutorial can be called something else, but go to the For developers (Inference) section.

The previous tutorial on external samplers provides the basic mechanism for calling AbstractMCMC-compatible external samplers. This PR can build upon that and provide instructions on implementing a new AbstractMCMC sampler.

It's okay to reorganise docs tutorials later, but I felt we needed to do that after making a plan and reviewing all existing docs.

…cs' into torfjelde/alternative-sampler-docs

torfjelde · 2024-06-04T18:59:50Z

Done 👍

torfjelde · 2024-06-04T20:18:12Z

@yebai notice that I've implemented a version of MALA which uses a single leapfrog step rather than the "standard" formulation of MALA. My original intention was to add a final section on a more "complex" version being autoMALA. Buuut I'm uncertain if this is worth it or not. Should I still pursue this or just leave it for now?

yebai · 2024-06-06T11:52:24Z

@mhauru can you take a look at this tutorial?

mhauru

Great stuff, I learned a lot from reading it. I only have some localised comments and typo fixes.

tutorials/docs-17-implementing-samplers/index.qmd

mhauru · 2024-06-07T15:38:50Z

tutorials/docs-17-implementing-samplers/index.qmd

+)
+    model = model_wrapper.logdensity
+    # Let's just create the initial state by sampling using  a Gaussian.
+    x = randn(rng, LogDensityProblems.dimension(model))


This is only valid for our particular model, right? In that this may e.g. give negative values, which might not be valid parameters for some models. Don't want to overcomplicate the example, but might it be worth mentioning that in general one should e.g. sample from the prior?

EDIT: Noticed that you address this later. Maybe when you address it mention that the fix would be to change this method of step?

Even if you use a different sample method here which samples values inside the support of the distribution, you'd still run into the same bug (just further down the callstack) because a gradient-based sampler like MALA will inevitably end up exploring the space outside of the support (unless you have a tiny stepsize). So we can't really implement this sampler for models (in a generic fashion, without a lot of gluecode) in such a way that it also works with constrained distributions.

Heeeence I just avoid the issue entirely, demosntrate a case later and how we can use Turing.jl to "address" this.

mhauru · 2024-06-07T15:42:24Z

tutorials/docs-17-implementing-samplers/index.qmd

+using Turing
+
+# Overload the `getparams` method for our "sample" type, which is just a vector.
+Turing.Inference.getparams(::Turing.Model, x::AbstractVector{<:Real}) = x


I'm a bit confused by this, which looks like type piracy to me. Should this rather be defined for MALAState?

EDIT: Also just noticed that the docstring of getparams says "Return a named tuple of parameters", which adds to my confusion.

Yeah this was just a temporary hack.

And regarding the getparams docstring; it's unfortunately quite a mess. getparams is used in different ways depending on what inputs it receives. We decided to just overload this for the AbstractMCMC.jl purpose because we're going to move the particular implementations that relates to AbstractMCMC.jl samplers into AbstractMCMC.jl itself, in which case we do want to call it getparams (and then the current Turing.Inference.getparams should never be touched outside of Turing.jl).

But I agree that the type-piracy should be avoided, so I'll define a MALASample (note that the "sample" returned from step is generally going to be different from the state; the "sample" should contain information "for public consumption" while the state should contain all the needed information for the MCMC sampler).

tutorials/docs-17-implementing-samplers/index.qmd

…stion

Co-authored-by: Markus Hauru <[email protected]>

…cs' into torfjelde/alternative-sampler-docs

… MALA

torfjelde · 2024-06-11T21:16:13Z

This should be ready now:)

torfjelde added 2 commits May 31, 2024 21:39

replaced the external sampler docs with a more straight-forward

61d2a5a

tutorial on implementing samplers

fixed reference to the impl samplers tutorial

761a9f5

Merge branch 'master' into torfjelde/alternative-sampler-docs

cbf88ac

torfjelde added 3 commits June 4, 2024 19:56

moved docs on implementing samplers

50a3822

revert changes to docs-16-using-turing-external-samplers

1889544

Merge remote-tracking branch 'origin/torfjelde/alternative-sampler-do…

b29e3c8

…cs' into torfjelde/alternative-sampler-docs

make the getparams overload explicit

5ea76e7

mhauru requested changes Jun 7, 2024

View reviewed changes

torfjelde and others added 5 commits June 11, 2024 17:56

renmaed NormalLogDensity to IsotropicNormalModel as per @mhauru sugge…

44dc179

…stion

replace underline with emph

9d5de22

Apply suggestions from code review

6b401eb

Co-authored-by: Markus Hauru <[email protected]>

fix formatting of summary

a67021c

Merge remote-tracking branch 'origin/torfjelde/alternative-sampler-do…

d07012e

…cs' into torfjelde/alternative-sampler-docs

torfjelde mentioned this pull request Jun 11, 2024

Add a section on a more complex example in the "how to implement a sampler" tutorial #479

Open

torfjelde added 2 commits June 11, 2024 22:02

removed type-piracy and added note on why we're using this example of…

ff4cd38

… MALA

Merge branch 'master' into torfjelde/alternative-sampler-docs

4ff2dad

yebai approved these changes Jun 11, 2024

View reviewed changes

torfjelde merged commit 71e2303 into master Jun 11, 2024
3 checks passed

github-actions bot added a commit that referenced this pull request Jun 11, 2024

Remove preview for merged PR #468

141eaed

torfjelde mentioned this pull request Jun 18, 2024

Move MCMC Interface Guide into inference section. #482

Closed

yebai deleted the torfjelde/alternative-sampler-docs branch June 20, 2024 17:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alternative tutorial on implementing samplers #468

Alternative tutorial on implementing samplers #468

torfjelde commented May 31, 2024 •

edited

Loading

github-actions bot commented May 31, 2024

yebai commented May 31, 2024

torfjelde commented Jun 1, 2024

torfjelde commented Jun 4, 2024

yebai commented Jun 4, 2024

torfjelde commented Jun 4, 2024

torfjelde commented Jun 4, 2024

yebai commented Jun 6, 2024

mhauru left a comment

mhauru Jun 7, 2024

torfjelde Jun 11, 2024

mhauru Jun 7, 2024

torfjelde Jun 11, 2024

torfjelde commented Jun 11, 2024

Alternative tutorial on implementing samplers #468

Alternative tutorial on implementing samplers #468

Conversation

torfjelde commented May 31, 2024 • edited Loading

github-actions bot commented May 31, 2024

yebai commented May 31, 2024

torfjelde commented Jun 1, 2024

torfjelde commented Jun 4, 2024

yebai commented Jun 4, 2024

torfjelde commented Jun 4, 2024

torfjelde commented Jun 4, 2024

yebai commented Jun 6, 2024

mhauru left a comment

Choose a reason for hiding this comment

mhauru Jun 7, 2024

Choose a reason for hiding this comment

torfjelde Jun 11, 2024

Choose a reason for hiding this comment

mhauru Jun 7, 2024

Choose a reason for hiding this comment

torfjelde Jun 11, 2024

Choose a reason for hiding this comment

torfjelde commented Jun 11, 2024

torfjelde commented May 31, 2024 •

edited

Loading