Fix `logpdf_grad` for BroadcastedNormal. #433

MrVPlusOne · 2021-07-14T02:04:51Z

As requested by #432, this PR fixes the gradient of BroadcastedNormal for the non-scalar cases.

Now it should correctly handle non-scalar arguments.

ztangent · 2021-07-14T03:36:05Z

Thanks for making this PR! I'm not sure why the Travis build errored, but separate of that, it'd great if you could adjust the implementation to be overflow-safe in line with #321!

Fix keyword syntax for earlier julia versions.

bzinberg

Thanks @MrVPlusOne! A couple of initial comments now; I'll come back later to suggest unit / regression tests.

src/modeling_library/distributions/normal.jl

bzinberg · 2021-07-14T16:44:08Z

test/modeling_library/distributions.jl

-    @test isapprox(actual[1], finite_diff(f, args, 1, dx; broadcast=true))
-    @test isapprox(actual[2], finite_diff(f, args, 2, dx; broadcast=true))
-    @test isapprox(actual[3], finite_diff(f, args, 3, dx; broadcast=true))


Why this change?

I'm actually a little unsure how to fix these tests. It seems that the first argument of isapprox is a zero-dimensional array (which I believe is the expected behavior) but the second argument is a Float64, which causes Julia to complain that no method of isapprox matches the given argument types. Any suggestions on how to fix this?

After investigating this a bit, I think it boils down to JuliaLang/julia#28866, which I consider to be a bug in Julia. Yes -- in the version of the code before this PR, both the LHS and the RHS should be 0-dimensional arrays. For the RHS, the issue appears to be that these lines

Gen.jl/test/runtests.jl

Lines 22 to 23 in d79aded

pos_args[i] = copy(args[i]) .+ dx

neg_args[i] = copy(args[i]) .- dx

do ::Aray{T,0} .+ ::T and ::Aray{T,0} .- ::T, which should give an ::Array{T,0} but due to JuliaLang/julia#28866 gives a scalar. So the LHS of the ./ on the following line should have an Array{T,0} as the ith argument to f, thus (since broadcast = true) should output an Array{T,0}, and consequently the division operation should be ::Array{T,0} ./ ::T which should output an Array{T,0}, but that is not what happens.

I think the cleanest way to fix this is to add a workaround to the (test-only) function finite_diff so that it handles zero-dimensional array valued arguments correctly.

(I'm currently drafting this.)

Aha, in addition to the above, the function f being supplied to finite_diff does not satisfy the requirements in its docstring: it takes array-valued arguments, but is not the broadcast of a function that takes scalar-valued arguments (i.e. logpdf_grad(vector-valued distribution parameters) is not a broadcast of a bunch of logpdf_grad(scalar-valued distribution parameters)'s). So I guess we should add a variant of finite_diff that has the semantics we need.

Yeah, I think to properly test this, we will need a more general finite_diff function or use an autodiff library implementation as a reference.

you clearly trust autodiff more than I do 🙂

Co-authored-by: bzinberg <[email protected]>

…comment)

…neral case See probcomp#433 (comment)

bzinberg · 2021-07-14T22:57:04Z

@MrVPlusOne, I've drafted a fix to the unit test breakage. Take a look and LMKWYT?

MrVPlusOne · 2021-07-15T00:02:13Z

@bzinberg Thanks! The fix looks good to me!

…n values of `logpdf_grad`

…t it's an implementation detail

…r than being coupled to a specific usage where the target shape is an "arg" and the full array is a "grad"

…ring

bzinberg · 2021-07-17T05:28:34Z

@MrVPlusOne, I've made a few more changes that are mostly by way of tidying.

Rename the vars in the unbroadcast helper functions to generically describe what they do, rather than coupling them to how they're used in logpdf_grad; e.g., unbroadcast_for_arg(arg, grad) becomes _unbroadcast_like(a, full_arr)
Add some documentation, mainly for what "unbroadcast" means since I can imagine that being ambiguous or confusing to a reader who doesn't have the same context we have
A bit of code tidying without changing the logic
Line length and indentation

The one functional change I made was to add to a couple of the unit tests an explicit check on the shape of the returned arrays.

I think regression testing is covered by the shape checks that were fixed in e712c71. They passed before, but that is because the unit tests were incorrect.

@MrVPlusOne, any comments you'd like to make on the above? If looks good to you, I think this is ready to merge.

MrVPlusOne · 2021-07-18T16:26:42Z

Thanks for the cleaning up; they all look good to me! I think one more nice thing to have would be to add a unit test for the case where the two arguments of broadcasted_norm have different numbers of dimensions. We currently don't have a test for that, right?

…parameters `broadcasted_normal` have different ranks See probcomp#433 (comment)

bzinberg · 2021-07-22T03:20:24Z

Great idea @MrVPlusOne, how's this?

MrVPlusOne · 2021-07-22T03:28:17Z

Thanks, looks good to me!

bzinberg · 2021-08-02T18:41:38Z

Thanks @MrVPlusOne for using and improving Gen!

Update logpdf_grad for BroadcastedNormal.

56cdef4

Now it should correctly handle non-scalar arguments.

MrVPlusOne added 2 commits July 14, 2021 00:20

Make broadcasted normal's gradient overflow-proof.

76e7f98

Fix keyword syntax for earlier julia versions.

Update the test for zero-dimensional arguments.

a7ff2d7

bzinberg reviewed Jul 14, 2021

View reviewed changes

Update src/modeling_library/distributions/normal.jl

8bc4d89

Co-authored-by: bzinberg <[email protected]>

bzinberg changed the title ~~Update logpdf_grad for BroadcastedNormal.~~ Fix logpdf_grad for BroadcastedNormal. Jul 14, 2021

bzinberg added 3 commits July 14, 2021 18:49

Add workaround for the zero-dimensional array issue in probcomp#433 (…

8d79f46

…comment)

Implement and use the right variant of finite_diff for this more ge…

0d37660

…neral case See probcomp#433 (comment)

fix incorrect unit tests 😬

e712c71

bzinberg and others added 12 commits July 17, 2021 00:00

indentation

b35f326

when testing BroadcastedNormal, directly verify shapes of the retur…

6dc4492

…n values of `logpdf_grad`

clarify that the equivalence is for logpdf, not logpdf_grad

127e07c

add ref link in docstring

677282a

add another ref link in docstring

36e8c1f

rename unbroadcast_for_arg -> _unbroadcast_for_arg, connoting tha…

8e50e66

…t it's an implementation detail

indentation

f7cc6d9

Rename unbroadcast functions to say what they generically do, rathe…

ab66a2b

…r than being coupled to a specific usage where the target shape is an "arg" and the full array is a "grad"

ternary expression is long, use if instead

7159b11

tidy up the body of _unbroadcast_to_shape a bit

f9ac944

The term "unbroadcast" is non-obvious enough that it deserves a docst…

e08dd07

…ring

arg -> a

c4573fc

Add tests and modify an existing test to exercise the case where the …

5d3f192

…parameters `broadcasted_normal` have different ranks See probcomp#433 (comment)

bzinberg merged commit acd7005 into probcomp:master Aug 2, 2021

bzinberg linked an issue Aug 4, 2021 that may be closed by this pull request

logpdf_grad(::BroadcastedNormal, ...) sums over the wrong dims #432

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `logpdf_grad` for BroadcastedNormal. #433

Fix `logpdf_grad` for BroadcastedNormal. #433

MrVPlusOne commented Jul 14, 2021

ztangent commented Jul 14, 2021

bzinberg left a comment •

edited

Loading

bzinberg Jul 14, 2021

MrVPlusOne Jul 14, 2021 •

edited

Loading

bzinberg Jul 14, 2021 •

edited

Loading

bzinberg Jul 14, 2021

bzinberg Jul 14, 2021

MrVPlusOne Jul 14, 2021

bzinberg Jul 14, 2021

bzinberg commented Jul 14, 2021 •

edited

Loading

MrVPlusOne commented Jul 15, 2021

bzinberg commented Jul 17, 2021

MrVPlusOne commented Jul 18, 2021

bzinberg commented Jul 22, 2021

MrVPlusOne commented Jul 22, 2021

bzinberg commented Aug 2, 2021

	pos_args[i] = copy(args[i]) .+ dx
	neg_args[i] = copy(args[i]) .- dx

Fix logpdf_grad for BroadcastedNormal. #433

Fix logpdf_grad for BroadcastedNormal. #433

Conversation

MrVPlusOne commented Jul 14, 2021

ztangent commented Jul 14, 2021

bzinberg left a comment • edited Loading

Choose a reason for hiding this comment

bzinberg Jul 14, 2021

Choose a reason for hiding this comment

MrVPlusOne Jul 14, 2021 • edited Loading

Choose a reason for hiding this comment

bzinberg Jul 14, 2021 • edited Loading

Choose a reason for hiding this comment

bzinberg Jul 14, 2021

Choose a reason for hiding this comment

bzinberg Jul 14, 2021

Choose a reason for hiding this comment

MrVPlusOne Jul 14, 2021

Choose a reason for hiding this comment

bzinberg Jul 14, 2021

Choose a reason for hiding this comment

bzinberg commented Jul 14, 2021 • edited Loading

MrVPlusOne commented Jul 15, 2021

bzinberg commented Jul 17, 2021

MrVPlusOne commented Jul 18, 2021

bzinberg commented Jul 22, 2021

MrVPlusOne commented Jul 22, 2021

bzinberg commented Aug 2, 2021

Fix `logpdf_grad` for BroadcastedNormal. #433

Fix `logpdf_grad` for BroadcastedNormal. #433

bzinberg left a comment •

edited

Loading

MrVPlusOne Jul 14, 2021 •

edited

Loading

bzinberg Jul 14, 2021 •

edited

Loading

bzinberg commented Jul 14, 2021 •

edited

Loading