[FEATURE] Add interleaved batch_dot oneDNN fuses for new GluonNLP models #20312

bgawrych · 2021-05-26T13:39:22Z

Description

This change utilizes oneDNN matmul primitive capabilities and fuses following sequences of operators:

Tested on run_squad.py script from GluonNLP repository (weights downloaded from QA README.md file)

electra-small quantized with disabled 1st fully connected layer (there is bug not related to this change) - fix incoming
accuracy drop on bert base is caused because there wasn't custom calibration - without this change quantized accuracy is similar (69.0617)

Checklist

Essentials

PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage

mxnet-bot · 2021-05-26T13:39:24Z

Hey @bgawrych , Thanks for submitting the PR
All tests are already queued to run once. If tests fail, you can trigger one or more tests again with the following commands:

To trigger all jobs: @mxnet-bot run ci [all]
To trigger specific jobs: @mxnet-bot run ci [job1, job2]

CI supported jobs: [windows-gpu, clang, unix-gpu, edge, centos-gpu, website, sanity, unix-cpu, centos-cpu, miscellaneous, windows-cpu]

Note:
Only following 3 categories can trigger CI :PR Author, MXNet Committer, Jenkins Admin.
All CI tests must pass before the PR can be merged.

bgawrych · 2021-06-01T12:26:35Z

Hi @leezu @szha, is something wrong with master CI? I get always

[2021-06-01T09:46:09.673Z] =========================== short test summary info ===========================

[2021-06-01T09:46:09.673Z] FAILED tests/python/gpu/test_operator_gpu.py::test_sparse_nd_pickle - mxnet.b...

[2021-06-01T09:46:09.673Z] FAILED tests/python/gpu/test_operator_gpu.py::test_sparse_nd_save_load[save]

[2021-06-01T09:46:09.673Z] FAILED tests/python/gpu/test_operator_gpu.py::test_sparse_ndarray_load_csr_npz_scipy[save]

on windows-gpu - in this PR and in #20227

src/operator/subgraph/mkldnn/mkldnn_transformer.cc

szha · 2021-06-02T19:28:28Z

If the tests fail consistently, let's disable these tests and open an issue for tracking them.

bgawrych · 2021-06-15T06:10:12Z

@akarbown Can you review and merge if it's ok?

Float self attention fuse WIP Add transpose - reshape Add self attention fuse with oneDNN support

bgawrych · 2021-06-29T14:02:36Z

@mxnet-bot run ci [unix-cpu, windows-gpu]

mxnet-bot · 2021-06-29T14:02:41Z

Jenkins CI successfully triggered : [unix-cpu, windows-gpu]

bgawrych requested a review from szha as a code owner May 26, 2021 13:39

mseth10 added pr-awaiting-testing PR is reviewed and waiting CI build and test pr-work-in-progress PR is still work in progress and removed pr-awaiting-testing PR is reviewed and waiting CI build and test pr-work-in-progress PR is still work in progress labels May 26, 2021

bartekkuncer reviewed Jun 1, 2021

View reviewed changes

src/operator/subgraph/mkldnn/mkldnn_transformer.cc Outdated Show resolved Hide resolved

mseth10 added pr-awaiting-testing PR is reviewed and waiting CI build and test pr-work-in-progress PR is still work in progress and removed pr-work-in-progress PR is still work in progress pr-awaiting-testing PR is reviewed and waiting CI build and test labels Jun 2, 2021

mseth10 added pr-work-in-progress PR is still work in progress and removed pr-awaiting-testing PR is reviewed and waiting CI build and test labels Jun 2, 2021

bgawrych force-pushed the nlp_selfattn_onednn branch from 906a4c3 to 336e442 Compare June 9, 2021 10:57

mseth10 added pr-awaiting-testing PR is reviewed and waiting CI build and test pr-awaiting-review PR is waiting for code review and removed pr-work-in-progress PR is still work in progress pr-awaiting-testing PR is reviewed and waiting CI build and test labels Jun 9, 2021

bgawrych added 8 commits June 29, 2021 09:57

Add self-attention fuse for MKLDNN backend

5668d46

Float self attention fuse WIP Add transpose - reshape Add self attention fuse with oneDNN support

Review

ca90e83

Fix sanity

6b551d6

Make ConvertWeightBias2MKLDNN inline function

4d16c43

Fix CI compilation errors

d53e14e

Add new fp32 ops names to amp list

e474df8

Remove double semicolon

70d3c89

Switch to forward interface

88401a0

bgawrych force-pushed the nlp_selfattn_onednn branch from 336e442 to 88401a0 Compare June 29, 2021 08:02

mseth10 added pr-awaiting-testing PR is reviewed and waiting CI build and test pr-work-in-progress PR is still work in progress and removed pr-awaiting-review PR is waiting for code review pr-awaiting-testing PR is reviewed and waiting CI build and test labels Jun 29, 2021

mseth10 added pr-awaiting-testing PR is reviewed and waiting CI build and test pr-awaiting-review PR is waiting for code review and removed pr-work-in-progress PR is still work in progress pr-awaiting-testing PR is reviewed and waiting CI build and test labels Jun 29, 2021

akarbown merged commit 1d0bdfd into apache:master Jun 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Add interleaved batch_dot oneDNN fuses for new GluonNLP models #20312

[FEATURE] Add interleaved batch_dot oneDNN fuses for new GluonNLP models #20312

bgawrych commented May 26, 2021

mxnet-bot commented May 26, 2021

bgawrych commented Jun 1, 2021

szha commented Jun 2, 2021

bgawrych commented Jun 15, 2021

bgawrych commented Jun 29, 2021

mxnet-bot commented Jun 29, 2021

[FEATURE] Add interleaved batch_dot oneDNN fuses for new GluonNLP models #20312

[FEATURE] Add interleaved batch_dot oneDNN fuses for new GluonNLP models #20312

Conversation

bgawrych commented May 26, 2021

Description

Checklist

Essentials

mxnet-bot commented May 26, 2021

bgawrych commented Jun 1, 2021

szha commented Jun 2, 2021

bgawrych commented Jun 15, 2021

bgawrych commented Jun 29, 2021

mxnet-bot commented Jun 29, 2021