Add FP requantize flow. Set float32 flow by default for llvm x86 targets with sse4.1 support. #9637

Icemist · 2021-12-02T11:57:13Z

Added a new calculation_flow_type parament to the relay.qnn.op.requantize. This parameter is controlling the implementation flow of this function. Valid values: "int64", "float32", "float64".
The basic idea is that for some targets implementations other than "int64" (the only one at the moment) will be more productive.
Below some measurements were made on AMD Ryzen 7 5800H with TVM_NUM_THREADS=1

Performance with "llvm -mcpu=core-avx2" target:

Performance with "llvm" target:

Accuracy with "llvm -mcpu=core-avx2" target:

requantize flow	accuracy
UPWARD_float64	75,39%
UPWARD_float32	75,32%
UPWARD_int64	75,39%
TONEAREST_float64	75,39%
TONEAREST_float32	75,32%
TONEAREST_int64	75,39%

Accuracy with "llvm" target:

requantize flow	accuracy
UPWARD_float64	75,38%
UPWARD_float32	75,30%
UPWARD_int64	75,38%
TONEAREST_float64	75,38%
TONEAREST_float32	75,30%
TONEAREST_int64	75,38%

Additional changes:

Added relay.qnn.op.requantize_config to use it at python "with" statement. This allows users to control the behavior of the requantize function not directly. It accepts two parameters: rounding and compute_dtype. It has a lower priority than these parameters passed directly to the requantize function. Note: compute_dtype will be "float32" for llvm x86 targets. For example:

mod, params = relay.frontend.from_pytorch(scripted_model, shape_list)
with tvm.transform.PassContext(opt_level=3):
	with relay.qnn.op.requantize_config(rounding="UPWARD", compute_dtype="float64"):
		lib = relay.build_module.build(mod, target=target, params=params)

Added target_has_sse41 and target_is_x86 functions to tvm.topi.x86.utils python namespace
Registered target_has.* functions from tvm.topi.x86.utils to call them from C++ code
Added Floor, LogicalOr, Equal, Less and IsFinite relay operations in C++ tvm::relay namespace
Added requantize_config validation tests

python/tvm/relay/qnn/op/qnn.py

src/relay/qnn/op/requantize.cc

Icemist · 2022-01-10T03:00:34Z

Have you benchmarked on ARM? I think we should enable this only for x86 for now.
I turned it off for not x86. It is also now possible to manage the implementation of requantize that will be used.

masahi · 2022-01-13T12:26:21Z

cc @jwfromm

include/tvm/relay/qnn/attrs.h

masahi

Looks good, only minor comments

python/tvm/relay/qnn/op/qnn.py

src/relay/qnn/op/requantize.cc

masahi · 2022-01-24T20:08:21Z

Please go through your change and remove all uses of the term calculation flow

include/tvm/relay/qnn/attrs.h

python/tvm/relay/qnn/op/qnn.py

masahi · 2022-01-25T00:48:16Z

python/tvm/topi/x86/utils.py

+            "amdfam10",
+            "athlon-4",
+            "athlon-xp",
+            "c3-2",


Do we need this level of details? I prefer dropping them. I don't think people would ever specify these targets...

I think sse4.1 - vnni are enough.

I agree, sse4.1 looks good. Users can always use requantize_config to change the default behavior.
Done.

src/relay/qnn/op/req_config.cc

masahi

Very nice, just more minor comments and I'll merge this.

…ets with sse4.1 support

masahi · 2022-01-26T06:47:48Z

Please kick another CI job.

Addressed

…ets with (apache#9637) sse4.1 support

Icemist requested review from anijain2305, areusch, comaniac, Huyuwei, icemelon, jcf94, jroesch, junrushao, jwfromm, kevinthesun, Laurawly, MarisaKirisame, masahi, mbrookhart, merrymercy, slyubomirsky, tqchen, vinx13, wweic, yzhliu, zhiics and ZihengJiang as code owners December 2, 2021 11:57

Icemist force-pushed the avoronov/float_requantize branch 8 times, most recently from 7068e63 to 9050d50 Compare December 7, 2021 16:27

jwfromm previously requested changes Dec 23, 2021

View reviewed changes

python/tvm/relay/qnn/op/qnn.py Outdated Show resolved Hide resolved

src/relay/qnn/op/requantize.cc Outdated Show resolved Hide resolved

masahi self-assigned this Jan 9, 2022

Icemist force-pushed the avoronov/float_requantize branch from 18824b4 to da09e5e Compare January 10, 2022 02:04

Icemist changed the title ~~Add FP requantize flow for llvm target~~ Add FP requantize flow. Set this by default for llvm x86 targets Jan 10, 2022

Icemist force-pushed the avoronov/float_requantize branch 2 times, most recently from 66e0220 to 5225f48 Compare January 10, 2022 02:29

Icemist force-pushed the avoronov/float_requantize branch from 5225f48 to 457711e Compare January 10, 2022 11:13

masahi reviewed Jan 23, 2022

View reviewed changes

include/tvm/relay/qnn/attrs.h Outdated Show resolved Hide resolved

masahi requested changes Jan 23, 2022

View reviewed changes

python/tvm/relay/qnn/op/qnn.py Outdated Show resolved Hide resolved

src/relay/qnn/op/requantize.cc Show resolved Hide resolved

Icemist force-pushed the avoronov/float_requantize branch 2 times, most recently from 2eb8658 to b958076 Compare January 24, 2022 15:25

Icemist force-pushed the avoronov/float_requantize branch from b958076 to 81458dc Compare January 24, 2022 22:03

masahi reviewed Jan 25, 2022

View reviewed changes

include/tvm/relay/qnn/attrs.h Outdated Show resolved Hide resolved

masahi reviewed Jan 25, 2022

View reviewed changes

python/tvm/relay/qnn/op/qnn.py Outdated Show resolved Hide resolved

masahi reviewed Jan 25, 2022

View reviewed changes

src/relay/qnn/op/req_config.cc Outdated Show resolved Hide resolved

masahi approved these changes Jan 25, 2022

View reviewed changes

Add FP requantize flow. Set float32 flow by default for llvm x86 targ…

5b07e4c

…ets with sse4.1 support

Icemist force-pushed the avoronov/float_requantize branch from 81458dc to 5b07e4c Compare January 25, 2022 11:54

Icemist changed the title ~~Add FP requantize flow. Set this by default for llvm x86 targets~~ Add FP requantize flow. Set float32 flow by default for llvm x86 targets with sse4.1 support Jan 25, 2022

Icemist changed the title ~~Add FP requantize flow. Set float32 flow by default for llvm x86 targets with sse4.1 support~~ Add FP requantize flow. Set float32 flow by default for llvm x86 targets with sse4.1 support. Jan 26, 2022

masahi merged commit ffff8dd into apache:main Jan 26, 2022

sunggg pushed a commit to sunggg/tvm that referenced this pull request Jan 29, 2022

Add FP requantize flow. Set float32 flow by default for llvm x86 targ…

b3fcb4b

…ets with (apache#9637) sse4.1 support

ylc pushed a commit to ylc/tvm that referenced this pull request Feb 16, 2022

Add FP requantize flow. Set float32 flow by default for llvm x86 targ…

c0ec226

…ets with (apache#9637) sse4.1 support

driazati mentioned this pull request Jul 14, 2022

TVM v0.9.0.rc0 Release Candidate Notes #12102

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add FP requantize flow. Set float32 flow by default for llvm x86 targets with sse4.1 support. #9637

Add FP requantize flow. Set float32 flow by default for llvm x86 targets with sse4.1 support. #9637

Icemist commented Dec 2, 2021 •

edited

Loading

Icemist commented Jan 10, 2022

masahi commented Jan 13, 2022

masahi left a comment

masahi commented Jan 24, 2022

masahi Jan 25, 2022 •

edited

Loading

Icemist Jan 25, 2022

masahi left a comment

masahi commented Jan 26, 2022

Add FP requantize flow. Set float32 flow by default for llvm x86 targets with sse4.1 support. #9637

Add FP requantize flow. Set float32 flow by default for llvm x86 targets with sse4.1 support. #9637

Conversation

Icemist commented Dec 2, 2021 • edited Loading

Icemist commented Jan 10, 2022

masahi commented Jan 13, 2022

masahi left a comment

Choose a reason for hiding this comment

masahi commented Jan 24, 2022

masahi Jan 25, 2022 • edited Loading

Choose a reason for hiding this comment

Icemist Jan 25, 2022

Choose a reason for hiding this comment

masahi left a comment

Choose a reason for hiding this comment

masahi commented Jan 26, 2022

Icemist commented Dec 2, 2021 •

edited

Loading

masahi Jan 25, 2022 •

edited

Loading