use a functor for projection #385

oxinabox · 2021-06-29T16:20:05Z

Alternative to #380, #382 and #383. See usage in JuliaDiff/ChainRules.jl#459

use a ProjectTo functor, instead of passing to a project function a target type + info keyword arg.
In the place of #383 it uses the constructor for the ProjectTo rather than a preproject function returning an info namedtuple.

It's kinda a curried functor. One would do ProjectTo(2.5)(1.0 + 2.0im), or more realistically project = ProjectTo(2.5); ...; project(1.0 + 2.0im).

An additional change over #383 is it only allows you to pass the type-alone into constructing ProjectTo for subtypes of Number.
Since other things might have structure that we need to be capturing; so recursion breaks.
(See e.g. additional tests over #383 for Arrays of Arrays, and Arrays of Any)
One reason not to allow this at all is that it may be easy to misread: ProjectTo(Float64)(1.0 + 2.0im), is not ProjectTo{Float64}(1.0 + 2.0im)
but it is probably worth keeping for arrays of numbers, since it makes it easy to write the optimized case for that which doesn't store 1 ProjectTo per element.

Another minor additional change is it makes use of some of the tooling for working with structs like construct and backing which should be faster, those were optimized fairly carefully.

Some observations (comments welcome):

This PR is cool because similarly to WIP: projector implementation (returning a closure) #382, and WIP: preproject and project implementation #383 it allows us to not close over primal value (x)
This PR is cool because (roughly similar to WIP: preproject and project implementation #383) unlike WIP: projector implementation (returning a closure) #382, where the returned closure can not be extended, the project function can be extended by someone developing a Quaternions package. They would simply define (::ProjectTo{Real})(dx::Quaternion) = ...
It's also clean (only one argument functions), compared to WIP: project implementation #380 where there are 3-arg functions, and unlike WIP: preproject and project implementation #383 calling project doesn't needs an info kwarg, as everything needed in stored in the functor (information about the primal x, such as size or element type.)
These also seem to compose relatively naturally. I.e. ProjectTo contains a NamedTuple, one element of which can be ProjectTo for an inner projection
It solves the issue in WIP: use preproject implementation ChainRules.jl#457 that closing over the primal type values doesn't infer, because it puts the primal type into the type-parameters of ProjectTo (I think this is a big one)
unlike WIP: preproject and project implementation #383 it doesn't have unadorned NamedTuples floating around, so debugging can be easier.

Something I am not certain on is if out type arg is the type of the primal or not.
I think it is not, right?
It the type of the destination differential?
Which for most types we want this for will be a natural tangent type for the primal, and probably even equal to the primal.
We might want to document this somewhere there the destination type-arg must be for a valid tangent type.
(And so a NamedTuple is not valid)

I am not 100% sold on the name ProjectTo

IMO this is the best way forward of the four PRs. Thoughts?

I am putting this up here, but probably @mzgubic, will take it back over tomorrow.
It is is very heavily based on #383, and all the learnings that went into that.

src/projection.jl

mzgubic

Nice, thanks, that's such an elegant solution, will take it over from here if that's ok?

src/projection.jl

oxinabox · 2021-06-30T09:56:33Z

Please do

mzgubic · 2021-06-30T10:45:32Z

Something I am not certain on is if out type arg is the type of the primal or not.
I think it is not, right?
It the type of the destination differential?
Which for most types we want this for will be a natural tangent type for the primal, and probably even equal to the primal.
We might want to document this somewhere there the destination type-arg must be for a valid tangent type.
(And so a NamedTuple is not valid)

I am not sure I understand what you are saying here. Is it that in

struct ProjectTo{P, D<:NamedTuple}
    info::D
end

P is not necessarily the primal type? The only two cases I can think of are Period and DateTimes, and the projection to a Tangent type. Are there any other cases you can think of, where P is not equal to the primal type?

oxinabox · 2021-06-30T11:08:26Z

It is hard to construct good examples because many of the LinearAlgebra structured array types, only accept vector-space elements.
Something like if the primal type was a Diagonal of Tuples.
then when doing sum(prod, Diagonal([(1.0, 2.0), (3.0, 4.0)]))
you would want to project not onto a Diagonal of Tuples but on to a Diagonal of Tangent{<:Tuple}s.
Except that sum errors in the primal as calling sum on a sparse array that has elements that don't define zero is not allowed.

I think this is very rare, and I think the thing we need to do is be clear in the docs.
We do not promise to project onto arbitrary types, only on to valid tangent types.
And the cases where the is most useful is where the primal is also a valid natural tangent type for itself.

test/projection.jl

Co-authored-by: Lyndon White <[email protected]>

src/projection.jl

docs/src/writing_good_rules.md

oxinabox

Some last comments,
other than that LGTM.
I do approve this, I can't actually click approve as I am the original author.
But you can approve yourself and merge.
We have both looked closely enough at this

docs/src/writing_good_rules.md

src/projection.jl

docs/src/writing_good_rules.md

Co-authored-by: Lyndon White <[email protected]>

docs/Manifest.toml

mcabbott · 2021-07-06T12:06:46Z

Something I am not certain on is if out type arg is the type of the primal or not.
I think it is not, right?
It the type of the destination differential?

Is this clarified and written up somewhere? There has been a blizzard of implementation but I am not sure I've seen clarity on the basic goal here, what types, how they are chosen, and where this gets applied. (FluxML/Zygote.jl#965 was one take on these questions, but I'm not sure anyone read it.)

mcabbott · 2021-07-06T12:13:47Z

src/projection.jl

+        function (project::ProjectTo{<:$SymHerm})(dx::AbstractMatrix)
+            return $SymHerm(project.parent(dx), project.uplo)


This looks a lot like it just applies a Symmetric wrapper, rather than projecting onto the space of symmetric matrices. I think that's wrong, and raised this point on one of the other implementations. Maybe you disagree and can point me to where this was discussed?

Earlier comment was #382 (comment)

Oh, sorry, I meant to comment on this but forgot in the middle of the many small things that came up in the PR. I don't think we are projecting onto the space of symmetric matrices, but rather on the space of Symmetric matrices. Which can indeed hold an asymmetric data field.

Is there an example that gives an unexpected result? I stared at the finite differencing a little bit and that does seem odd, but not long enough to figure out whether it is a project or finite differencing issue

Yes, my Zygote PR has examples. The finite-differencing code was giving bizarre answers and should for now be ignored. The mathematical question seems pretty clear.

"Many small things" seems accurate, I worry that in rushing to sort them all out, we've lost sight of big-picture questions.

mzgubic · 2021-07-06T13:11:52Z

Something I am not certain on is if out type arg is the type of the primal or not.
I think it is not, right?
It the type of the destination differential?

Is this clarified and written up somewhere? There has been a blizzard of implementation but I am not sure I've seen clarity on the basic goal here, what types, how they are chosen, and where this gets applied. (FluxML/Zygote.jl#965 was one take on these questions, but I'm not sure anyone read it.)

My understanding is that we have decided to only project onto valid tangent spaces (and not on arbitrary primals, so we don't accept Tuple's or NamedTuples). @oxinabox will expand the write up in the docs on this. See JuliaDiff/ChainRules.jl#467 (I will add your other concerns there as well)

willtebbutt and others added 22 commits June 22, 2021 11:37

Sketch project implementation

914bd92

change Composite to Tangent

06678a4

export project

c58f974

make T optional

00020e3

add tests and Complex

37f9253

workout the edge cases

4e1b79d

rename dummy struct

7dc58ee

rename project to projector

3345ba9

move to projector

31d81ed

do not close over x (other than in the general case)

2ea4845

update docstring

465e1d7

fix getproperty

0a06dce

add to Tangent and to Symmetric

d822b02

remove debug strings

25a7cee

separate out the projector

7801e19

implement preproject

9147fad

remove getproperty for thunks

cc2f199

remove to Tangent

2aa3859

fix docstrings

44ef266

project nested structs

d8848f5

Change from preproject to ProjectTo functor

88da9c6

Make sure Arrays of Arrays etc work

e0318b3

oxinabox mentioned this pull request Jun 29, 2021

use ProjectTo functor everywhere, and update to v1 of CRC and CRTU JuliaDiff/ChainRules.jl#459

Merged

oxinabox commented Jun 29, 2021

View reviewed changes

src/projection.jl Show resolved Hide resolved

oxinabox commented Jun 29, 2021

View reviewed changes

src/projection.jl Outdated Show resolved Hide resolved

mzgubic reviewed Jun 30, 2021

View reviewed changes

src/projection.jl Outdated Show resolved Hide resolved

src/projection.jl Outdated Show resolved Hide resolved

src/projection.jl Outdated Show resolved Hide resolved

src/projection.jl Outdated Show resolved Hide resolved

remove the special case ProjectTo(::Type{<:Number})

ce5d646

oxinabox commented Jul 2, 2021

View reviewed changes

test/projection.jl Outdated Show resolved Hide resolved

Miha Zgubic and others added 2 commits July 2, 2021 19:49

PermutedDimsArray

9d665c0

Update test/projection.jl

030d636

Co-authored-by: Lyndon White <[email protected]>

oxinabox commented Jul 2, 2021

View reviewed changes

src/projection.jl Outdated Show resolved Hide resolved

oxinabox commented Jul 5, 2021

View reviewed changes

src/projection.jl Outdated Show resolved Hide resolved

Miha Zgubic added 4 commits July 5, 2021 12:51

fix docs

b87368f

JuliaFormatter

0f09ab9

simplify one of the PermutedDimsArray

3a47f6f

document when to use ProjectTo

ce022d5

mzgubic mentioned this pull request Jul 5, 2021

Iron out ProjectTo JuliaDiff/ChainRules.jl#467

Closed

4 tasks

oxinabox commented Jul 5, 2021

View reviewed changes

docs/src/writing_good_rules.md Outdated Show resolved Hide resolved

oxinabox commented Jul 5, 2021

View reviewed changes

mzgubic approved these changes Jul 6, 2021

View reviewed changes

docs/src/writing_good_rules.md Show resolved Hide resolved

Apply suggestions from code review

4106232

Co-authored-by: Lyndon White <[email protected]>

mzgubic reviewed Jul 6, 2021

View reviewed changes

docs/Manifest.toml Outdated Show resolved Hide resolved

Update docs/Manifest.toml

04a4e87

mzgubic merged commit 3acd962 into master Jul 6, 2021

mzgubic deleted the ox/project_info branch July 6, 2021 10:18

mcabbott reviewed Jul 6, 2021

View reviewed changes

mzgubic mentioned this pull request Jul 7, 2021

mul/ewise rules for basic arithmetic semiring JuliaSparse/SuiteSparseGraphBLAS.jl#26

Merged

mcabbott mentioned this pull request Jul 7, 2021

Use abstract types for projection #391

Merged

IvanYashchuk mentioned this pull request Nov 4, 2022

Triangular solver for sparse matrices pytorch/pytorch#87358

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use a functor for projection #385

use a functor for projection #385

oxinabox commented Jun 29, 2021 •

edited

Loading

mzgubic left a comment

oxinabox commented Jun 30, 2021

mzgubic commented Jun 30, 2021

oxinabox commented Jun 30, 2021 •

edited

Loading

oxinabox left a comment

mcabbott commented Jul 6, 2021

mcabbott Jul 6, 2021

mcabbott Jul 6, 2021

mzgubic Jul 6, 2021

mcabbott Jul 6, 2021 •

edited

Loading

mzgubic commented Jul 6, 2021

		function (project::ProjectTo{<:$SymHerm})(dx::AbstractMatrix)
		return $SymHerm(project.parent(dx), project.uplo)

use a functor for projection #385

use a functor for projection #385

Conversation

oxinabox commented Jun 29, 2021 • edited Loading

mzgubic left a comment

Choose a reason for hiding this comment

oxinabox commented Jun 30, 2021

mzgubic commented Jun 30, 2021

oxinabox commented Jun 30, 2021 • edited Loading

oxinabox left a comment

Choose a reason for hiding this comment

mcabbott commented Jul 6, 2021

mcabbott Jul 6, 2021

Choose a reason for hiding this comment

mcabbott Jul 6, 2021

Choose a reason for hiding this comment

mzgubic Jul 6, 2021

Choose a reason for hiding this comment

mcabbott Jul 6, 2021 • edited Loading

Choose a reason for hiding this comment

mzgubic commented Jul 6, 2021

oxinabox commented Jun 29, 2021 •

edited

Loading

oxinabox commented Jun 30, 2021 •

edited

Loading

mcabbott Jul 6, 2021 •

edited

Loading