Expose llvm.nearbyint intrinsic. This is a faster alternate to rounding. #4001

kimishpatel · 2019-09-25T02:05:05Z

In the quantized gemm implementation we are working on, we need to quantize input data. During this step we first apply scale and zero point to the input data. Then we do rounding and casting to int8.

tvm::round gets lowered by llvm into roundf function call which make the op slower. I instead exposed llvm.nearbyint via tvm and was able to recover the lost performance.

So this PR is just upstreaming that change.

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

kimishpatel · 2019-09-25T02:07:53Z

@tqchen, sorry have to tag you again on this PR as it seems that you may be the right person. Please feel free to suggest anyone else, but this is a quite small PR.

tqchen

Please add a python wrapper, a reference to the docs in the https://github.com/dmlc/tvm/tree/master/docs/api/python, and a testcase

tqchen · 2019-09-25T02:34:37Z

As per https://docs.tvm.ai/contribute/code_review.html#ensure-test-coverage

Thanks @kimishpatel

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

kimishpatel · 2019-09-25T15:33:45Z

Thanks @tqchen, I have added test case and python binding. Thanks for your feedback.

anijain2305 · 2019-09-25T16:37:27Z

@kimishpatel Thanks for the PR. You can also look at qnn.op.quantize. This is a wrapper that internally lowers to the sequence of relay ops you mentioned. This PR will benefit that wrapper as well.

kimishpatel · 2019-09-25T16:42:59Z

@anijain2305, thanks for the pointer. Wasn't quite aware of that.

anijain2305 · 2019-09-25T16:52:52Z

@anijain2305, thanks for the pointer. Wasn't quite aware of that.

If you are looking at running pre quantized models, you might want to have a look at QNN dialect. We have added a number of QNN ops in there that deal with scale and zero points.

kimishpatel · 2019-09-25T18:21:30Z

@tqchen, sorry to bug again :). Seems like all checks have passed, so just nudging for the merge. Thanks a bunch.

tqchen · 2019-09-25T18:23:04Z

Thanks @anijain2305 @kimishpatel

…ng. (apache#4001) * Expose llvm.nearbyint intrinsic. This is a faster alternate to rounding. Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Added python binding. Added test. Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

Expose llvm.nearbyint intrinsic. This is a faster alternate to rounding.

77f7ecd

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

tqchen approved these changes Sep 25, 2019

View reviewed changes

tqchen requested changes Sep 25, 2019

View reviewed changes

tqchen added status: need test case need test cases to cover the change status: need update need update based on feedbacks labels Sep 25, 2019

Added python binding. Added test.

20967ee

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

tqchen merged commit 17c2c0a into apache:master Sep 25, 2019

tqchen added status: accepted and removed status: need test case need test cases to cover the change status: need update need update based on feedbacks labels Sep 25, 2019

tqchen approved these changes Sep 25, 2019

View reviewed changes

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose llvm.nearbyint intrinsic. This is a faster alternate to rounding. #4001

Expose llvm.nearbyint intrinsic. This is a faster alternate to rounding. #4001

kimishpatel commented Sep 25, 2019

kimishpatel commented Sep 25, 2019

tqchen left a comment

tqchen commented Sep 25, 2019

kimishpatel commented Sep 25, 2019

anijain2305 commented Sep 25, 2019 •

edited

Loading

kimishpatel commented Sep 25, 2019

anijain2305 commented Sep 25, 2019

kimishpatel commented Sep 25, 2019

tqchen commented Sep 25, 2019

Expose llvm.nearbyint intrinsic. This is a faster alternate to rounding. #4001

Expose llvm.nearbyint intrinsic. This is a faster alternate to rounding. #4001

Conversation

kimishpatel commented Sep 25, 2019

kimishpatel commented Sep 25, 2019

tqchen left a comment

Choose a reason for hiding this comment

tqchen commented Sep 25, 2019

kimishpatel commented Sep 25, 2019

anijain2305 commented Sep 25, 2019 • edited Loading

kimishpatel commented Sep 25, 2019

anijain2305 commented Sep 25, 2019

kimishpatel commented Sep 25, 2019

tqchen commented Sep 25, 2019

anijain2305 commented Sep 25, 2019 •

edited

Loading