Fix vmlal.s16 code generation for int8 x int8 -> int32 #2748

ajtulloch · 2019-03-08T01:11:19Z

The IntrinInjecter::SwapBroadcastCast function was limited to cases where the result bitwidth was exactly 2x the input bitwidth. This fails for cases of relevance such as a vectorized int8 x int8 -> int32 GEMM. This helps improve the code-generation somewhat by outputting VMLAL.S16 instructions instead of a MOVL + VMLA.S32`.

tqchen · 2019-03-08T01:51:48Z

src/pass/lower_intrin.cc

@@ -50,7 +50,23 @@ class IntrinInjecter : public IRMutator {
    // on ARM.
    if (const Broadcast* bcast = e.as<Broadcast>()) {
      if (const Cast* cast = bcast->value.as<Cast>()) {
-        if (cast->type.bits() == cast->value.type().bits() * 2) {
+        auto shouldSwap = [&]() {


nit: code style, local variable need to be should_swap?

Done, my bad.

tqchen

lgtm modulo nit

FrozenGene · 2019-03-08T10:13:49Z

tests/python/unittest/test_codegen_arm.py

+            A[k, n].astype("int32") * B[k, n].astype("int32"), axis=[k]), name='C')
+        s = tvm.create_schedule(C.op)
+        s[C].vectorize(s[C].op.axis[0])
+        print(tvm.lower(s, [A, B, C], simple_mode=True))


maybe we should remove this print

Done, my bad.

ajtulloch · 2019-03-08T21:40:42Z

Thanks for the comments, I've updated the patch with the review comments.

tqchen · 2019-03-09T01:44:13Z

@FrozenGene please https://docs.tvm.ai/contribute/code_review.html#approve-and-request-changes-explicitly

tqchen · 2019-03-09T03:44:18Z

Thanks @ajtulloch @FrozenGene , this is now merged!

ajtulloch · 2019-03-11T22:00:31Z

Thanks for merging @tqchen.

tqchen requested changes Mar 8, 2019

View reviewed changes

tqchen approved these changes Mar 8, 2019

View reviewed changes

FrozenGene reviewed Mar 8, 2019

View reviewed changes

Fix vmlal.s16 code generation for int8 x int8 -> int32

17112b5

ajtulloch force-pushed the widening-int8-int32-arm-codegen branch from 98b54d4 to 17112b5 Compare March 8, 2019 21:37

tqchen approved these changes Mar 9, 2019

View reviewed changes

tqchen added the status: need review label Mar 9, 2019

FrozenGene approved these changes Mar 9, 2019

View reviewed changes

tqchen merged commit a7e35fc into apache:master Mar 9, 2019

tqchen added status: accepted and removed status: need review labels Mar 9, 2019

wweic pushed a commit to neo-ai/tvm that referenced this pull request Mar 9, 2019

Fix vmlal.s16 code generation for int8 x int8 -> int32 (apache#2748)

f04400e

ajtulloch deleted the widening-int8-int32-arm-codegen branch March 11, 2019 19:55

wweic pushed a commit to neo-ai/tvm that referenced this pull request Mar 12, 2019

Fix vmlal.s16 code generation for int8 x int8 -> int32 (apache#2748)

be89cc1

wweic pushed a commit to neo-ai/tvm that referenced this pull request Mar 12, 2019

Fix vmlal.s16 code generation for int8 x int8 -> int32 (apache#2748)

f01c50d

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix vmlal.s16 code generation for int8 x int8 -> int32 #2748

Fix vmlal.s16 code generation for int8 x int8 -> int32 #2748

ajtulloch commented Mar 8, 2019 •

edited

Loading

tqchen Mar 8, 2019

ajtulloch Mar 8, 2019

tqchen left a comment

FrozenGene Mar 8, 2019

ajtulloch Mar 8, 2019

ajtulloch commented Mar 8, 2019

tqchen commented Mar 9, 2019

tqchen commented Mar 9, 2019

ajtulloch commented Mar 11, 2019

Fix vmlal.s16 code generation for int8 x int8 -> int32 #2748

Fix vmlal.s16 code generation for int8 x int8 -> int32 #2748

Conversation

ajtulloch commented Mar 8, 2019 • edited Loading

tqchen Mar 8, 2019

Choose a reason for hiding this comment

ajtulloch Mar 8, 2019

Choose a reason for hiding this comment

tqchen left a comment

Choose a reason for hiding this comment

FrozenGene Mar 8, 2019

Choose a reason for hiding this comment

ajtulloch Mar 8, 2019

Choose a reason for hiding this comment

ajtulloch commented Mar 8, 2019

tqchen commented Mar 9, 2019

tqchen commented Mar 9, 2019

ajtulloch commented Mar 11, 2019

ajtulloch commented Mar 8, 2019 •

edited

Loading