{QNN] Making scale/zero_points as expr instead of attrs. #4611

anijain2305 · 2020-01-02T23:04:18Z

Currently QNN dialect only deals with uniform quantization, which means each tensor has just one scale and zero point. Because of this restriction, QNN design had scale and zero points as op attributes. However, as we move towards channel quantization, scale will become a vector (and behave similar to something like bias for bias_add in terms of ops inputs).

Before moving to channel quantization, this PR makes the necessary changes to make the scale and zero points as input expr to operators (instead of making them op attrs). So, this PR does not bring any functional/performance change to QNN graphs. The new type checks still check that scale and zero points must be const scalar values. Once this PR is merged, and we start moving towards channel-wise quantization, we can start relaxing the checks and modifying the lowering.

anijain2305 · 2020-01-02T23:29:37Z

Please review @zhiics @FrozenGene @jackwish @shoubhik

@u99127 You might be using QNN already. It might affect you if you are reading QNN graphs directly. Please review!

@vinx13 FixedPointMultiply is shared between QNN and Automatic Quantization. However, I deliberately kept FixedPointMultiply unchanged, so this PR does not affect automatic quantization. Please review!

zhenhuaw-me

LGTM. Thank you for the impressive work @anijain2305 , that must be a lot of hard work :)

zhenhuaw-me · 2020-01-03T04:31:07Z

python/tvm/relay/frontend/tflite.py

@@ -224,9 +226,21 @@ def get_tensor_type_str(self, tensor_type):
        raise NotImplementedError("Tensor type {} is currently not supported"
                                  .format(str(tensor_type)))

+    def is_scalar(self, expr):


is anyone using this?

zhenhuaw-me · 2020-01-03T04:34:43Z

python/tvm/relay/qnn/op/legalizations.py

 import tvm
 from tvm import relay
 from .. import op as reg
+from ... import expr as _expr
+
+def get_scalar_from_constant(expr):


duplicated with the one in frontend/common.py?

zhenhuaw-me · 2020-01-03T05:07:50Z

src/relay/qnn/op/concatenate.cc

+        RELAY_ERROR("qnn concatenate requires a tuple of scales as the second argument, found "
+                    << PrettyPrint(types[1])));
+  }
+  for (auto input_scale : input_scales_tuple->fields) {


could be auto&?

zhenhuaw-me · 2020-01-03T05:08:38Z

src/relay/qnn/op/concatenate.cc

+        RELAY_ERROR("qnn concatenate requires a tuple of zero_points as the third argument, found "
+                    << PrettyPrint(types[2])));
+  }
+  for (auto input_zero_point : input_zero_points_tuple->fields) {


could be auto&?

const auto& maybe better. We don't modify the value and just do check.

leo-blonk · 2020-01-03T10:50:14Z

On behalf of @u99127, lgtm.

vinx13 · 2020-01-03T13:40:36Z

Thanks @anijain2305 @FrozenGene @Leo-arm @jackwish this is merged

anijain2305 force-pushed the qnn_api branch from 1c93bfc to 8245625 Compare January 2, 2020 23:13

anijain2305 force-pushed the qnn_api branch from 8245625 to 0967a81 Compare January 3, 2020 00:14

zhenhuaw-me approved these changes Jan 3, 2020

View reviewed changes

FrozenGene approved these changes Jan 3, 2020

View reviewed changes

anijain2305 force-pushed the qnn_api branch 2 times, most recently from 6cbf86f to 9b5ddf2 Compare January 3, 2020 08:27

{QNN] Making scale/zero_points as expr instead of attrs.

dd63f3d

anijain2305 force-pushed the qnn_api branch from 9b5ddf2 to dd63f3d Compare January 3, 2020 08:30

vinx13 approved these changes Jan 3, 2020

View reviewed changes

vinx13 merged commit 0720ed6 into apache:master Jan 3, 2020

vinx13 added the status: accepted label Jan 3, 2020

alexwong pushed a commit to alexwong/tvm that referenced this pull request Feb 26, 2020

{QNN] Making scale/zero_points as expr instead of attrs. (apache#4611)

45c6012

alexwong pushed a commit to alexwong/tvm that referenced this pull request Feb 28, 2020

{QNN] Making scale/zero_points as expr instead of attrs. (apache#4611)

11c0b98

zhiics pushed a commit to neo-ai/tvm that referenced this pull request Mar 2, 2020

{QNN] Making scale/zero_points as expr instead of attrs. (apache#4611)

9525e96

ZihengJiang mentioned this pull request Sep 17, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

{QNN] Making scale/zero_points as expr instead of attrs. #4611

{QNN] Making scale/zero_points as expr instead of attrs. #4611

anijain2305 commented Jan 2, 2020

anijain2305 commented Jan 2, 2020 •

edited

Loading

zhenhuaw-me left a comment

zhenhuaw-me Jan 3, 2020

zhenhuaw-me Jan 3, 2020

zhenhuaw-me Jan 3, 2020

zhenhuaw-me Jan 3, 2020

FrozenGene Jan 3, 2020

leo-blonk commented Jan 3, 2020

vinx13 commented Jan 3, 2020

{QNN] Making scale/zero_points as expr instead of attrs. #4611

{QNN] Making scale/zero_points as expr instead of attrs. #4611

Conversation

anijain2305 commented Jan 2, 2020

anijain2305 commented Jan 2, 2020 • edited Loading

zhenhuaw-me left a comment

Choose a reason for hiding this comment

zhenhuaw-me Jan 3, 2020

Choose a reason for hiding this comment

zhenhuaw-me Jan 3, 2020

Choose a reason for hiding this comment

zhenhuaw-me Jan 3, 2020

Choose a reason for hiding this comment

zhenhuaw-me Jan 3, 2020

Choose a reason for hiding this comment

FrozenGene Jan 3, 2020

Choose a reason for hiding this comment

leo-blonk commented Jan 3, 2020

vinx13 commented Jan 3, 2020

anijain2305 commented Jan 2, 2020 •

edited

Loading