[Fix] Fix get_valid_count flaky test for cuda #4901

Laurawly · 2020-02-17T23:40:03Z

Turned on get_valid_count test for cuda in topi. Used atomic operations in this fix to replace previous block sync method.
Tested on V100 and T4 GPUs.

@trevor-m @kevinthesun @yzhliu Could you review?

… all tests running together

trevor-m

Thanks for fixing this!

topi/python/topi/cuda/nms.py

Laurawly · 2020-02-21T21:44:33Z

ping @kevinthesun @yzhliu

yzhliu · 2020-02-21T22:31:20Z

Thanks @Laurawly @trevor-m

* get_valid_count accuracy issue fixed for individual tests but not for all tests running together * minor fix * initialize valid_count and PrefixSum buffers * test updated * udpate relay test as well * update document * fix lint * address comment * fix lint * correct atomicAdd identifier name

masahi · 2020-03-05T05:45:42Z

@Laurawly I still get a flaky failure from get_valid_count test in my PR
https://ci.tvm.ai/blue/organizations/jenkins/tvm/detail/PR-4964/8/pipeline/246

Laurawly · 2020-03-05T05:52:23Z

@masahi That’s weird. You can comment off the test for now and I’ll try to reproduce it on my end.

masahi · 2020-03-07T01:51:56Z

@Laurawly Also reported at a different PR #4931 (comment)

Laurawly · 2020-03-07T02:05:12Z

@Laurawly Also reported at a different PR #4931 (comment)

Yeah, pls feel free to comment out the test: https://github.com/apache/incubator-tvm/blob/master/topi/tests/python/test_topi_vision.py#L106

masahi · 2020-03-07T02:20:47Z

fortunately my PR is green now without commenting out :) If the next open PR by somebody else sees the same problem, I'll ask him/her to comment it out.

Laurawly added 7 commits February 14, 2020 20:51

get_valid_count accuracy issue fixed for individual tests but not for…

ed42e85

… all tests running together

minor fix

ffac02f

initialize valid_count and PrefixSum buffers

e1dfe59

test updated

e1e8d6f

udpate relay test as well

a0795d3

update document

48738bd

fix lint

94e87f5

tqchen added the status: need review label Feb 18, 2020

trevor-m requested changes Feb 18, 2020

View reviewed changes

topi/python/topi/cuda/nms.py Outdated Show resolved Hide resolved

address comment

03d4cc7

trevor-m approved these changes Feb 19, 2020

View reviewed changes

Laurawly added 2 commits February 19, 2020 19:20

fix lint

fd50a62

correct atomicAdd identifier name

02d2ead

yzhliu approved these changes Feb 21, 2020

View reviewed changes

yzhliu added status: accepted and removed status: need review labels Feb 21, 2020

yzhliu merged commit c4c61cb into apache:master Feb 21, 2020

w-zr mentioned this pull request Feb 29, 2020

Performance regression of quantization on CUDA after [Relay][AutoTVM] Relay op strategy (#4644) #4972

Closed

liangfu mentioned this pull request Mar 7, 2020

[VTA][Chisel] Change Scala Linter scalafmt => scalastyle #4998

Merged

masahi mentioned this pull request Apr 2, 2020

[Frontend][Torch] Fix up graph input handling #5204

Merged

ZihengJiang mentioned this pull request Sep 17, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] Fix get_valid_count flaky test for cuda #4901

[Fix] Fix get_valid_count flaky test for cuda #4901

Laurawly commented Feb 17, 2020 •

edited

Loading

trevor-m left a comment

Laurawly commented Feb 21, 2020

yzhliu commented Feb 21, 2020

masahi commented Mar 5, 2020

Laurawly commented Mar 5, 2020

masahi commented Mar 7, 2020 •

edited

Loading

Laurawly commented Mar 7, 2020

masahi commented Mar 7, 2020

[Fix] Fix get_valid_count flaky test for cuda #4901

[Fix] Fix get_valid_count flaky test for cuda #4901

Conversation

Laurawly commented Feb 17, 2020 • edited Loading

trevor-m left a comment

Choose a reason for hiding this comment

Laurawly commented Feb 21, 2020

yzhliu commented Feb 21, 2020

masahi commented Mar 5, 2020

Laurawly commented Mar 5, 2020

masahi commented Mar 7, 2020 • edited Loading

Laurawly commented Mar 7, 2020

masahi commented Mar 7, 2020

Laurawly commented Feb 17, 2020 •

edited

Loading

masahi commented Mar 7, 2020 •

edited

Loading