[WIP][QNN] Quantized fully connected #3597

shoubhik · 2019-07-22T22:42:59Z

This PR is the implementation of pre-quantized fully connected op.

Requantize converts one quantized tensor representation to another quantized representation. The PR has following implementation features - Requantize operator defined in qnn namespace - relay.qnn.requantize - Lowering of the requantize to exisiting Relay operators - Integer fixed point implementation of requantize - Two rounding modes - FE_UPWARDS (round towards infinity) and FE_AWAY_FROM_ZERO (std::round behavior) - Floating point implementation as well, that can act as reference or can be used for devices when FP32 computation is not used. - Unit test cases Relevant Issue - apache#2351 Credit to TFLite and GemmLowp to provide reference implementations.

…s uint8.

# Conflicts: # include/tvm/relay/qnn/attrs.h # python/tvm/relay/qnn/op/qnn.py # src/relay/qnn/pass/qnn_lower.cc # src/relay/qnn/util.h

# Conflicts: # python/tvm/relay/qnn/op/qnn.py

# Conflicts: # src/relay/qnn/pass/qnn_lower.cc # src/relay/qnn/util.h

shoubhik · 2019-08-15T05:28:54Z

There is dependency on #3779. Once it is merged the test cases will pass. will wait for it,

shoubhik · 2019-10-01T16:48:48Z

Another PR has been merged with these changes.

[email protected] and others added 30 commits July 8, 2019 12:12

[Relay] [Quantization] WIP - Common files for the qauntization work.

8d9e317

[Relay] [Quantization] WIP - Prototyping requantize op.

5485b58

Typo and lint fixes.

705b796

Lint fix.

6cd1328

Doc fix.

ac4349b

Uncommenting the lint script (fixing mistake).

a9fef75

Modifying the unit tests.

d9eff68

Moving C++ files into src/relay/qnn

abc7c4e

Moving python files to python/tvm/relay/qnn. Some minor fixes.

275ddd0

Moving the attrs.h inside the include directory.

a0ad8ca

Pushing files that I forgot earlier. Changing util location.

ff8936c

[Relay] [Quantization] WIP - Common files for the qauntization work.

bdca4c6

[Relay] [Quantization] WIP - Prototyping requantize op.

755f934

Typo and lint fixes.

6016b2a

Lint fix.

d54cea8

Doc fix.

ca954e0

Uncommenting the lint script (fixing mistake).

db24f1e

Modifying the unit tests.

523e16a

Moving C++ files into src/relay/qnn

18bff76

Moving python files to python/tvm/relay/qnn. Some minor fixes.

32b69df

Moving the attrs.h inside the include directory.

21168ae

Pushing files that I forgot earlier. Changing util location.

4a4beec

Incorporating comments. API change. Lint fixes.

120c050

Modifying the GetFixedPointMultiplierShift API as per comments.

989bbea

Forgot the dialect change.

8df0ddb

Retriggering Jenkins.

8d0af86

Changing rewrite to qnn_lower.

ff1b9e3

Renaming Quantize to Qnn for clarity.

362869f

[email protected] and others added 24 commits July 17, 2019 16:38

Merge branch 'requantize' into qfullyconnected

419dee0

Incorportaing review comments.

4958495

Adding API doc for QNN dialect.

f858a83

Move the qnn_lower pass to transform namespace.

823cc94

Moving from expr to module. Adding namespace in C++.

28a9587

Working test case for int/uint with bias_add

76476dc

Minor sentence rewrites. Added qnn namespace.

732d6ce

Added the API doc.

fadc573

Chanding default out_dtype to int8. Adding a test with in/out_dtype a…

956d3de

…s uint8.

merge from upstream/requantize

7a63597

Merge branch 'requantize' into qfullyconnected

3ffdbf8

# Conflicts: # include/tvm/relay/qnn/attrs.h # python/tvm/relay/qnn/op/qnn.py # src/relay/qnn/pass/qnn_lower.cc # src/relay/qnn/util.h

Style fixes. Better error messages.

d700945

Removing extra code.

21963dc

Merge branch 'requantize' into qfullyconnected

29c9e06

# Conflicts: # python/tvm/relay/qnn/op/qnn.py

Adding documentation.

d0fdd1c

More documentation fixes.

33cc075

Adding out dtype check for requantize.

bb38855

Adding corner case for FP32 to fixed point conversion.

7aac28d

Adding extra line.

635b053

Documentation fix.

222e189

quantized fully connected working with requantize.

6c833d5

Adding static inline.

a115c96

Merge branch 'master' into requantize

572a8f3

Merge branch 'requantize' into qfullyconnected

dd213b6

# Conflicts: # src/relay/qnn/pass/qnn_lower.cc # src/relay/qnn/util.h

shoubhik closed this Oct 1, 2019

shoubhik reopened this Oct 1, 2019

shoubhik closed this Oct 1, 2019

anijain2305 deleted the qfullyconnected branch November 13, 2019 00:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP][QNN] Quantized fully connected #3597

[WIP][QNN] Quantized fully connected #3597

shoubhik commented Jul 22, 2019

shoubhik commented Aug 15, 2019

shoubhik commented Oct 1, 2019

[WIP][QNN] Quantized fully connected #3597

[WIP][QNN] Quantized fully connected #3597

Conversation

shoubhik commented Jul 22, 2019

shoubhik commented Aug 15, 2019

shoubhik commented Oct 1, 2019