Fix intel conv2d auto tune #5200

kevinthesun · 2020-04-01T05:28:25Z

debug_skip_region will cause execution time to be inaccurate on x86. This PR fixes x86 conv2d and depthwise conv2d.

FrozenGene · 2020-04-01T06:32:56Z

I think this issue exist in all auto tvm topi template.

anijain2305 · 2020-04-01T16:23:08Z

@kevinthesun Do you also want to send the PR (or update this one) to change zero tensor to random tensor for AutoTVM for stable measurements?

kevinthesun · 2020-04-01T17:32:53Z

@FrozenGene If that's the case, would you mind opening an issue tracking all topi ops we might want to modify?

kevinthesun · 2020-04-01T17:42:00Z

@anijain2305 Added.

comaniac · 2020-04-01T17:50:36Z

Did a brief search and here is a list of TOPI files that has the same use case:

arm_cpu/conv2d_spatial_pack.py
arm_cpu/conv2d.py
arm_cpu/depthwise_conv2d.py
bifrost/conv2d.py
cuda/conv2d_int8.py
cuda/conv2d_winograd.py
cuda/group_conv2d_nchw.py
mali/conv2d.py

btw just curious, do you have an experimental result with an isolated case to illustrate the accuracy issue introduced by debug_skip_region?

kevinthesun · 2020-04-01T17:58:31Z

@comaniac One way to verify this is to directly build a tvm func involving debug_skip_region. I verified that on x86 and debug_skip_region did cause inaccurate measurement. However, I didn't dig into why debug_skip_region causes this. For other platforms, @FrozenGene notices this issue also exists. We might want to verify on other platforms and fix them.

merrymercy · 2020-04-01T22:51:09Z

python/tvm/autotvm/measure/measure_methods.py

            # This can avoid some memory issues that make the measurement results unreliable.
-            args = [nd.empty(x[0], dtype=x[1], ctx=ctx) for x in build_result.arg_info]
+            args = [nd.array(np.random.uniform(0.0, 255.0, size=x[0]).astype(dtype=x[1]), ctx=ctx)


This will introduce a data copy when using RPCRunner, which will bring some network overhead.
One way to solve this is by implementing a tvm.nd.non_empty or tvm.nd.random in the tvm runtime, then we can do the random fill on the target device without copying over the network.

@FrozenGene has implemented a version in our internal codebase. Maybe @FrozenGene can help on this?

@merrymercy Sure. I will port it to our upstream soon.

I open #5216 to track this

merrymercy · 2020-04-01T22:58:34Z

Good catch. I can confirm both tvm.nd.empty and debug_skip_region will cause inaccurate measurement from my experiences.

FrozenGene · 2020-04-02T02:13:59Z

Open #5215 to track this issue.

kevinthesun · 2020-04-02T06:34:47Z

@merrymercy @FrozenGene Do we keep empty array for now and wait for non_empty array?

merrymercy · 2020-04-02T10:01:11Z

I am happy with keeping the empty array and merging this first.

FrozenGene · 2020-04-03T13:42:44Z

I am happy with keeping the empty array and merging this first.

+1

kevinthesun · 2020-04-03T23:00:19Z

Is this good to be merged?

* Fix x86 conv2d and depthwise conv2d auto tuning * Fix depthwise conv2d infer layout * Use random data instead of empty data for autotvm * Fix pylint * Keep empty array for now for autotvm

FrozenGene mentioned this pull request Apr 1, 2020

[Relay][Topi][AutoTVM] Winograd support for Conv3D #5186

Merged

merrymercy requested changes Apr 1, 2020

View reviewed changes

anijain2305 mentioned this pull request Apr 1, 2020

[TOPI x86] Adding unroll_kw config option for depthwise conv2d. #5197

Merged

tqchen assigned merrymercy Apr 2, 2020

FrozenGene mentioned this pull request Apr 2, 2020

[AutoTVM] AutoTVM incorrect measurement #5215

Closed

6 tasks

kevinthesun added 5 commits April 2, 2020 21:39

Fix x86 conv2d and depthwise conv2d auto tuning

3a765c7

Fix depthwise conv2d infer layout

d984151

Use random data instead of empty data for autotvm

13cdf61

Fix pylint

42dc0c9

Keep empty array for now for autotvm

a0e73c9

kevinthesun force-pushed the FixIntelConv2dAutoTune branch from 7f0a72a to a0e73c9 Compare April 2, 2020 21:39

merrymercy approved these changes Apr 4, 2020

View reviewed changes

merrymercy merged commit 0cfdecd into apache:master Apr 4, 2020

kevinthesun deleted the FixIntelConv2dAutoTune branch May 26, 2020 17:31

FrozenGene mentioned this pull request Jun 24, 2020

[random] support random fill #5913

Merged

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix intel conv2d auto tune #5200

Fix intel conv2d auto tune #5200

kevinthesun commented Apr 1, 2020

FrozenGene commented Apr 1, 2020

anijain2305 commented Apr 1, 2020 •

edited

Loading

kevinthesun commented Apr 1, 2020

kevinthesun commented Apr 1, 2020

comaniac commented Apr 1, 2020

kevinthesun commented Apr 1, 2020 •

edited

Loading

merrymercy Apr 1, 2020 •

edited

Loading

FrozenGene Apr 2, 2020

FrozenGene Apr 2, 2020

merrymercy commented Apr 1, 2020 •

edited

Loading

FrozenGene commented Apr 2, 2020

kevinthesun commented Apr 2, 2020

merrymercy commented Apr 2, 2020

FrozenGene commented Apr 3, 2020

kevinthesun commented Apr 3, 2020

Fix intel conv2d auto tune #5200

Fix intel conv2d auto tune #5200

Conversation

kevinthesun commented Apr 1, 2020

FrozenGene commented Apr 1, 2020

anijain2305 commented Apr 1, 2020 • edited Loading

kevinthesun commented Apr 1, 2020

kevinthesun commented Apr 1, 2020

comaniac commented Apr 1, 2020

kevinthesun commented Apr 1, 2020 • edited Loading

merrymercy Apr 1, 2020 • edited Loading

Choose a reason for hiding this comment

FrozenGene Apr 2, 2020

Choose a reason for hiding this comment

FrozenGene Apr 2, 2020

Choose a reason for hiding this comment

merrymercy commented Apr 1, 2020 • edited Loading

FrozenGene commented Apr 2, 2020

kevinthesun commented Apr 2, 2020

merrymercy commented Apr 2, 2020

FrozenGene commented Apr 3, 2020

kevinthesun commented Apr 3, 2020

anijain2305 commented Apr 1, 2020 •

edited

Loading

kevinthesun commented Apr 1, 2020 •

edited

Loading

merrymercy Apr 1, 2020 •

edited

Loading

merrymercy commented Apr 1, 2020 •

edited

Loading