[Relay, OpFusion] Better tuple fusion implementation #3092

masahi · 2019-04-25T17:16:59Z

See #3039 for the context and discussion.

This is my second cut at fixing tuple fusion, which I hope is a better approach than the rather ad hoc one in #3049 .

Added two new op patterns, for marking tuple and tuple field nodes
Added a new attribute to the fusion group structure, to keep track of data flow between groups (think hypergraph where nodes correspond to fusion groups)
After the existing fusion step, check each fusion group to see if we can further fuse the group and its input groups. By now, we can determine If the tuple has been fused to subsequent ops or not. We can fuse tuple fields into the tuple here.

masahi · 2019-04-25T17:47:55Z

@tqchen I need to fix the tensorflow test, but can you have a look at the patch and let me know if I am on the right track?

tqchen · 2019-04-25T18:16:27Z

I think we can still simplify it. We do not need to record the inputs of the group. We can still just do the traversal in the node relation group.

If there is a relation x -> tuple, and we can directly check if the group of the tuple is already injective, which means it is already fused to another op. Then we can fuse the group of x and group of tuple, if the group of the x is injective, and everything in between x and tuple follows the same pattern

masahi · 2019-04-26T00:19:22Z

thanks, I was also able to remove kTupleFields, one of two new op patterns I added. I think it is much simpler now.

python/tvm/relay/op/op.py

src/relay/pass/fuse_ops.cc

tqchen · 2019-04-26T03:21:25Z

@MarisaKirisame @vinx13 please help review this PR

masahi · 2019-04-26T10:55:28Z

Ready for review @tqchen @jroesch @MarisaKirisame @zhiics @vinx13

tqchen · 2019-04-27T16:41:46Z

Given that there is a lot of recent interest in the fusor, I opened a new issue for better docs #3109

tqchen · 2019-04-27T16:42:51Z

include/tvm/relay/op_attr_types.h

@@ -41,14 +41,17 @@ enum OpPatternKind {
  // for example :code:`out[i, ax1, j, ax2] = input[i, j]`.
  // Note that the axis need to be in order so transpose is not a bcast operator.
  kBroadcast = 1,
+  // The pattern for tuple nodes. Can fuse into subsequent injective ops.
+  kTuple = 2,


I think we need to justify a bit why put kTuple as 2. Because tuple is special, I would rather put it say like 7, and use

pattern <= kInjective || pattern == kTuple

to indicate the pattern is tuple aware

I thought the pattern of tuple needs to be smaller than kInjective, because when we fuse the tuple into subsequent injective ops, we want the pattern of fused group to be kInjective (CombinePattern returns the larger of the two patterns begin combined).

But I realized that CombinePattern is only called when child group's master ref is non null, so it doesn't work the way I expected. kTuple doesn't have to be smaller than kInjective.

It also means even if injective ops are fused into a broadcast op, the combined pattern is still kBroadcast. Is this intended?

I see, never mind, we can keep this as it is then. Please add a comment on why we choose such order

I've just changed the kTuple to 7, no change to fuse_ops.cc was needed. I also prefer this since it makes the diff smaller

Let us update CombinePattern to specially handle tuple + injective and tuple + broadcast

do you mean, both injective and broadcast win against tuple, even though their op pattern is smaller? (now kTuple is 7)

@tqchen do you want to update both CombinePattern and its call site? (it seems CombinePattern is only called when one of its arg is kOutEWiseFusable.)

zhiics · 2019-04-27T23:07:50Z

LGTM

zhiics · 2019-04-27T23:07:53Z

LGTM

masahi · 2019-04-28T00:27:20Z

@vinx13 @tqchen looks like CI is broken, I'm getting error from topi group conv2d test (verify_group_conv2d_NCHWc_int8). Maybe #3070 is related?

http://ci.tvm.ai:8080/blue/organizations/jenkins/tvm/detail/PR-3092/5/pipeline/235

tqchen · 2019-04-29T02:18:53Z

Thanks @masahi @vinx13 @zhiics , this is merged

masahi changed the title ~~Tup fusion~~ [Relay, OpFusion] Better tuple fusion implementation Apr 25, 2019

masahi changed the title ~~[Relay, OpFusion] Better tuple fusion implementation~~ [Relay, OpFusion] [WIP] Better tuple fusion implementation Apr 25, 2019

tqchen requested changes Apr 26, 2019

View reviewed changes

python/tvm/relay/op/op.py Outdated Show resolved Hide resolved

src/relay/pass/fuse_ops.cc Outdated Show resolved Hide resolved

masahi changed the title ~~[Relay, OpFusion] [WIP] Better tuple fusion implementation~~ [Relay, OpFusion] Better tuple fusion implementation Apr 26, 2019

tqchen added the status: need review label Apr 27, 2019

vinx13 approved these changes Apr 27, 2019

View reviewed changes

tqchen requested changes Apr 27, 2019

View reviewed changes

Masahiro Masuda and others added 11 commits April 28, 2019 13:07

add post process analysis for tuple fusion

646d885

make it work

87bda6c

add more test

911d6b6

add more test

1b73bd0

add comment

0e6aa6a

remove group inputs and kTupleFields

4d87216

move logic to RunFuse

55e1c0c

fix inception case

538a2d3

add inception test case

6535dcf

make kTuple 7

7cb1724

fix typo

b278a74

masahi force-pushed the tup-fusion branch from 3e586ac to b278a74 Compare April 28, 2019 04:07

tqchen approved these changes Apr 29, 2019

View reviewed changes

tqchen merged commit f1fcbaf into apache:master Apr 29, 2019

tqchen added status: accepted and removed status: need review labels Apr 29, 2019

This was referenced Apr 29, 2019

[RELAY][FUSION] Enhance fusion rule that starts from elemwise and broadcast #2932

Merged

[RFC][DISCUSS] Tuple-related Fusion #3039

Closed

wweic pushed a commit to wweic/tvm that referenced this pull request May 13, 2019

[Relay, OpFusion] Better tuple fusion implementation (apache#3092)

5b5ff51

wweic pushed a commit to neo-ai/tvm that referenced this pull request May 13, 2019

[Relay, OpFusion] Better tuple fusion implementation (apache#3092)

26e788b

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relay, OpFusion] Better tuple fusion implementation #3092

[Relay, OpFusion] Better tuple fusion implementation #3092

masahi commented Apr 25, 2019 •

edited

Loading

masahi commented Apr 25, 2019 •

edited

Loading

tqchen commented Apr 25, 2019

masahi commented Apr 26, 2019

tqchen commented Apr 26, 2019

masahi commented Apr 26, 2019

tqchen commented Apr 27, 2019

tqchen Apr 27, 2019

masahi Apr 27, 2019 •

edited

Loading

masahi Apr 27, 2019

tqchen Apr 27, 2019

masahi Apr 27, 2019 •

edited

Loading

tqchen Apr 27, 2019

masahi Apr 28, 2019 •

edited

Loading

masahi Apr 28, 2019

zhiics commented Apr 27, 2019

zhiics commented Apr 27, 2019

masahi commented Apr 28, 2019

tqchen commented Apr 29, 2019

[Relay, OpFusion] Better tuple fusion implementation #3092

[Relay, OpFusion] Better tuple fusion implementation #3092

Conversation

masahi commented Apr 25, 2019 • edited Loading

masahi commented Apr 25, 2019 • edited Loading

tqchen commented Apr 25, 2019

masahi commented Apr 26, 2019

tqchen commented Apr 26, 2019

masahi commented Apr 26, 2019

tqchen commented Apr 27, 2019

tqchen Apr 27, 2019

Choose a reason for hiding this comment

masahi Apr 27, 2019 • edited Loading

Choose a reason for hiding this comment

masahi Apr 27, 2019

Choose a reason for hiding this comment

tqchen Apr 27, 2019

Choose a reason for hiding this comment

masahi Apr 27, 2019 • edited Loading

Choose a reason for hiding this comment

tqchen Apr 27, 2019

Choose a reason for hiding this comment

masahi Apr 28, 2019 • edited Loading

Choose a reason for hiding this comment

masahi Apr 28, 2019

Choose a reason for hiding this comment

zhiics commented Apr 27, 2019

zhiics commented Apr 27, 2019

masahi commented Apr 28, 2019

tqchen commented Apr 29, 2019

masahi commented Apr 25, 2019 •

edited

Loading

masahi commented Apr 25, 2019 •

edited

Loading

masahi Apr 27, 2019 •

edited

Loading

masahi Apr 27, 2019 •

edited

Loading

masahi Apr 28, 2019 •

edited

Loading