[VTA] Support for batched inference #3661

tmoreau89 · 2019-07-30T06:40:10Z

This PR addresses some bugs that occur when setting LOG_BATCH to a value greater than 0 (i.e. batch size greater than 1).

IR pass bug fix for load2d pattern matching
fast simulator bug fix in ALU code
CI tests, and tutorials support batched inference
AutoTVM autotuning works for batched VTA

In addition, TOPHUB schedules were updated: tlc-pack/tophub@102b9b4

And a bitstream was added to test batched inference in hardware: uwsampl/vta-distro@e28d9d5

To reproduce in sim, set vta_config.json to:

{
  "TARGET" : "sim",
  "HW_VER" : "0.0.1",
  "LOG_INP_WIDTH" : 3,
  "LOG_WGT_WIDTH" : 3,
  "LOG_ACC_WIDTH" : 5,
  "LOG_BATCH" : 2,
  "LOG_BLOCK" : 3,
  "LOG_UOP_BUFF_SIZE" : 15,
  "LOG_INP_BUFF_SIZE" : 16,
  "LOG_WGT_BUFF_SIZE" : 16,
  "LOG_ACC_BUFF_SIZE" : 18
}

To run on pynq, set TARGET to pynq.

tmoreau89 · 2019-07-30T06:41:59Z

@vegaluisjose @liangfu @tqchen please review

liangfu

LGTM

vta/tutorials/frontend/deploy_resnet_on_vta.py

* fix in IR pass to support padding on 6-d tensors * support for both N>1 and N==1 for padding * batch size > 1 tuning and base config * output formatting * batch conv2d * print all category results * revert to single-batch config * pick record best * fix conv test * improving reporting * address batching bug in fast simulator * fix

tmoreau89 added 11 commits July 28, 2019 22:06

fix in IR pass to support padding on 6-d tensors

be37818

support for both N>1 and N==1 for padding

6aeeeec

batch size > 1 tuning and base config

b74b5d0

output formatting

25270da

batch conv2d

4f68e92

print all category results

22b4150

revert to single-batch config

5bb7ee1

pick record best

94a0774

fix conv test

7bc7842

improving reporting

5fea8da

address batching bug in fast simulator

9e3a3c4

tmoreau89 changed the title ~~[VTA] Support for batching in inference~~ [VTA] Support for batched inference Jul 30, 2019

tmoreau89 requested review from jroesch and tqchen and removed request for tqchen July 30, 2019 06:40

jroesch approved these changes Jul 30, 2019

View reviewed changes

liangfu approved these changes Jul 30, 2019

View reviewed changes

vta/tutorials/frontend/deploy_resnet_on_vta.py Outdated Show resolved Hide resolved

fix

63d419d

jroesch approved these changes Jul 30, 2019

View reviewed changes

jroesch merged commit 6c7f0c4 into apache:master Jul 30, 2019

tmoreau89 deleted the hotfix-2 branch August 2, 2019 17:59

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[VTA] Support for batched inference #3661

[VTA] Support for batched inference #3661

tmoreau89 commented Jul 30, 2019

tmoreau89 commented Jul 30, 2019

liangfu left a comment

[VTA] Support for batched inference #3661

[VTA] Support for batched inference #3661

Conversation

tmoreau89 commented Jul 30, 2019

tmoreau89 commented Jul 30, 2019

liangfu left a comment

Choose a reason for hiding this comment