[Numpy] Add qr backward part 2 for wide matrices with m < n #18197

D-Roberts · 2020-04-29T22:02:37Z

Description

This is the 2nd part of the QR backward implementation. The 1st part (merged) covered square and deep matrix shapes (nrows >= ncols) and part 2 now covers the remaining wide matrix shapes (ncols > nrows).

References:
Differential Programming Tensor Networks

The added test includes a numerical check (via central differences) of the analytical gradient since this is a novel implementation. The tests were run offline 1K times to insure against flakiness.

Checklist

Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Code is well-documented:
To the best of my knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

QR back part 2
Tests

mxnet-bot · 2020-04-29T22:02:40Z

Hey @D-Roberts , Thanks for submitting the PR
All tests are already queued to run once. If tests fail, you can trigger one or more tests again with the following commands:

To trigger all jobs: @mxnet-bot run ci [all]
To trigger specific jobs: @mxnet-bot run ci [job1, job2]

CI supported jobs: [miscellaneous, centos-cpu, sanity, windows-gpu, website, centos-gpu, unix-cpu, edge, clang, windows-cpu, unix-gpu]

Note:
Only following 3 categories can trigger CI :PR Author, MXNet Committer, Jenkins Admin.
All CI tests must pass before the PR can be merged.

D-Roberts · 2020-05-01T20:50:21Z

@haojin2 PR ready for review, tnx.

haojin2 · 2020-05-01T20:52:21Z

@D-Roberts Thanks for your contribution! From now and on please @yzhliu for reviews of NumPy-related contributions since I'm moving my gravity from this project.

D-Roberts · 2020-05-11T19:14:32Z

Hi @yzhliu, can you take a look? thanks

hzfan

Could you elaborate a bit about how the backward works when m < n? Seems https://arxiv.org/pdf/1710.08717.pdf does not cover this case.

src/operator/numpy/linalg/np_qr-inl.h

D-Roberts · 2020-05-14T13:38:51Z

The code follows the idea in the reference Differential Programming Tensor Networks. At high level, partition/split the input A into 2 matrices X and Y and R (from A=QR decomposition) into 2 matrices U and V. Then X = QU and get X_grad by applying the gradient derivation from the square input case (m=n) with adjusted Q_grad. Also get Y_grad separately. Then A_grad is the concatenation of X_grad and Y_grad.

D-Roberts · 2020-05-14T17:09:36Z

@mxnet-bot run ci [centos-cpu, unix-cpu, windows-gpu]

mxnet-bot · 2020-05-14T17:09:44Z

Jenkins CI successfully triggered : [unix-cpu, windows-gpu, centos-cpu]

D-Roberts · 2020-05-15T13:11:57Z

@mxnet-bot run ci [centos-cpu, unix-cpu]

mxnet-bot · 2020-05-15T13:12:06Z

Jenkins CI successfully triggered : [centos-cpu, unix-cpu]

hzfan

Great work! I added one more comment, trying to be more memory-efficient.

src/operator/numpy/linalg/np_qr-inl.h

hzfan

LGTM

D-Roberts · 2020-05-16T16:21:09Z

@mxnet-bot run ci [unix-cpu]

mxnet-bot · 2020-05-16T16:21:17Z

Jenkins CI successfully triggered : [unix-cpu]

D-Roberts · 2020-05-18T11:46:02Z

@mxnet-bot run ci [unix-cpu]

mxnet-bot · 2020-05-18T11:46:10Z

Jenkins CI successfully triggered : [unix-cpu]

D-Roberts · 2020-05-18T16:38:16Z

@mxnet-bot run ci [unix-cpu]

mxnet-bot · 2020-05-18T16:38:21Z

Jenkins CI successfully triggered : [unix-cpu]

D-Roberts · 2020-05-21T17:24:55Z

Hi @yzhliu - is there anything else you'd like me to do on this? tnx

D-Roberts · 2020-06-03T18:54:09Z

Any updates on this?

D-Roberts · 2020-07-17T16:15:14Z

Hello @yzhliu - are we planning to merge this soon? This particular case of differentiable QR can be useful on batch, in place of LQ, or SVD in recent computer vision research for solving least squares.

hzfan · 2020-07-17T17:01:49Z

Merged into master. Thanks @D-Roberts , @haojin2

ptrendx · 2020-07-17T18:43:27Z

This PR broke master CPU pipelines and blocks PRs (test_np_linalg_qr fails), see e.g. http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Fcentos-cpu/detail/master/2093/pipeline

szha · 2020-07-17T19:24:19Z

@D-Roberts @hzfan could you look into the issue that @ptrendx mentioned? If it can't be fixed in a couple of hours let's revert the change first.

This reverts commit 60d0672.

D-Roberts · 2020-07-18T15:53:56Z

@hzfan Thank you for your prompt assistance, I appreciate it.

@leezu @szha @DickJC123 I will resubmit a separate PR. For my future reference - what are your recommendations to avoid the "stale PR" situation? CI passed when first submitted about 3 months ago and I rebased and CI passed about 2 months ago when the PR was reviewed. All along I followed up on the PR every 2-3 weeks or so.

szha · 2020-07-19T05:54:07Z

@D-Roberts we will likely need to automate it so that stale CI checks are invalidated. In the meantime, if the PR sits for a long time, feel free to ping me or any other committer to get more attention on it.

leezu · 2020-07-20T18:09:25Z

@D-Roberts the recommendation is to comment with "at mxnet-bot run ci [all]" where at is @

D-Roberts force-pushed the qr_back_two branch from 5b1a238 to 4519795 Compare April 30, 2020 10:48

haojin2 assigned yzhliu May 1, 2020

haojin2 added the Numpy label May 1, 2020

hzfan reviewed May 14, 2020

View reviewed changes

src/operator/numpy/linalg/np_qr-inl.h Show resolved Hide resolved

D-Roberts force-pushed the qr_back_two branch from 4519795 to 244f0b8 Compare May 14, 2020 13:33

hzfan reviewed May 15, 2020

View reviewed changes

src/operator/numpy/linalg/np_qr-inl.h Outdated Show resolved Hide resolved

D-Roberts force-pushed the qr_back_two branch from 244f0b8 to 3510112 Compare May 15, 2020 16:09

hzfan approved these changes May 16, 2020

View reviewed changes

Add qr backward for wide matrices with m < n

08bc0b5

D-Roberts force-pushed the qr_back_two branch from 3510112 to 08bc0b5 Compare May 16, 2020 12:11

hzfan merged commit 60d0672 into apache:master Jul 17, 2020

leezu added a commit that referenced this pull request Jul 17, 2020

Revert "Add qr backward for wide matrices with m < n (#18197)"

bd52fb7

This reverts commit 60d0672.

leezu mentioned this pull request Jul 17, 2020

Revert "[Numpy] Add qr backward part 2 for wide matrices with m < n" #18750

Merged

DickJC123 mentioned this pull request Jul 18, 2020

PRs that have passed CI, but have "aged" considerably w.r.t. master, can still be merged. #18753

Open

DickJC123 pushed a commit that referenced this pull request Jul 18, 2020

Revert "Add qr backward for wide matrices with m < n (#18197)" (#18750)

444a7ee

This reverts commit 60d0672.

D-Roberts mentioned this pull request Jul 19, 2020

[Numpy] Add qr backward for wide inputs with nrows < ncols #18757

Merged

2 tasks

D-Roberts deleted the qr_back_two branch December 21, 2020 19:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Numpy] Add qr backward part 2 for wide matrices with m < n #18197

[Numpy] Add qr backward part 2 for wide matrices with m < n #18197

D-Roberts commented Apr 29, 2020 •

edited

Loading

mxnet-bot commented Apr 29, 2020

D-Roberts commented May 1, 2020

haojin2 commented May 1, 2020

D-Roberts commented May 11, 2020

hzfan left a comment

D-Roberts commented May 14, 2020

D-Roberts commented May 14, 2020

mxnet-bot commented May 14, 2020

D-Roberts commented May 15, 2020

mxnet-bot commented May 15, 2020

hzfan left a comment

hzfan left a comment

D-Roberts commented May 16, 2020

mxnet-bot commented May 16, 2020

D-Roberts commented May 18, 2020

mxnet-bot commented May 18, 2020

D-Roberts commented May 18, 2020

mxnet-bot commented May 18, 2020

D-Roberts commented May 21, 2020

D-Roberts commented Jun 3, 2020

D-Roberts commented Jul 17, 2020

hzfan commented Jul 17, 2020

ptrendx commented Jul 17, 2020

szha commented Jul 17, 2020

D-Roberts commented Jul 18, 2020

szha commented Jul 19, 2020

leezu commented Jul 20, 2020

[Numpy] Add qr backward part 2 for wide matrices with m < n #18197

[Numpy] Add qr backward part 2 for wide matrices with m < n #18197

Conversation

D-Roberts commented Apr 29, 2020 • edited Loading

Description

Checklist

Changes

mxnet-bot commented Apr 29, 2020

D-Roberts commented May 1, 2020

haojin2 commented May 1, 2020

D-Roberts commented May 11, 2020

hzfan left a comment

Choose a reason for hiding this comment

D-Roberts commented May 14, 2020

D-Roberts commented May 14, 2020

mxnet-bot commented May 14, 2020

D-Roberts commented May 15, 2020

mxnet-bot commented May 15, 2020

hzfan left a comment

Choose a reason for hiding this comment

hzfan left a comment

Choose a reason for hiding this comment

D-Roberts commented May 16, 2020

mxnet-bot commented May 16, 2020

D-Roberts commented May 18, 2020

mxnet-bot commented May 18, 2020

D-Roberts commented May 18, 2020

mxnet-bot commented May 18, 2020

D-Roberts commented May 21, 2020

D-Roberts commented Jun 3, 2020

D-Roberts commented Jul 17, 2020

hzfan commented Jul 17, 2020

ptrendx commented Jul 17, 2020

szha commented Jul 17, 2020

D-Roberts commented Jul 18, 2020

szha commented Jul 19, 2020

leezu commented Jul 20, 2020

D-Roberts commented Apr 29, 2020 •

edited

Loading