OV Performance Hints (CPU and GPU logic for selecting the actual configs), while AUTO/MULTI are passing them thru) #6993

myshevts · 2021-08-09T16:35:33Z

For the GPU (until Auto-Batching) the logic is very simple (and mimicks the benchmark_app), for the CPU in contrast the network-based heuristics selects the #streams.

@ArtemySkrebkov-intel and @rzubarev fyi

inference-engine/src/cldnn_engine/cldnn_config.h

inference-engine/samples/benchmark_app/main.cpp

inference-engine/src/mkldnn_plugin/mkldnn_plugin.cpp

inference-engine/src/plugin_api/ie_performance_hints.hpp

myshevts · 2021-08-13T13:16:52Z

@mashoujiang fyi, this PR passes the perf hints thru the AUTO (and MULTI)

Co-authored-by: Tatiana Savina <[email protected]>

…MULTI

inference-engine/src/cldnn_engine/cldnn_engine.cpp

inference-engine/src/multi_device/multi_device_plugin.cpp

wangleis

AUTO plugin only pass hints instead of fullconfig need handle in following PR.

tools/benchmark_tool/README.md

tools/benchmark_tool/openvino/tools/benchmark/main.py

…oon)

…from Vladimir

vladimir-paramuzov

In general LGTM. Minor comment left

inference-engine/src/cldnn_engine/cldnn_engine.cpp

dmitry-gorokhov

No major objections

inference-engine/src/mkldnn_plugin/mkldnn_plugin.cpp

…igs), while AUTO/MULTI are passing them thru) (openvinotoolkit#6993) * rebasing the perf-modes-2021.3 to the 2021.4 Caveats: the (explicit) setting #streams is not disabled (as it was before for experiments with DLBenchmark), and the logic slighlty differ (streamsSet) (cherry picked from commit 1ae1edc) * overriding streams (to force the TPUT mode to the DLBenchnark) (cherry picked from commit 7f506cd) * disabling reducing #streams to fully mimic baseline c4df94d of the 2021.3 (before experiments) (cherry picked from commit 85073dd) * clang/identation (cherry picked from commit 050a415) * splitting the Transformation to general and CPU specific. Now hopefully,this fully mimics the baseline c4df94d of the 2021.3 (before experiments), as the streams reduce num (as well as early exit on GRU/LSTM/TensorIterator) is deisabled (cherry picked from commit e98b2c1) * disabling GRU/LSTM/TI + reducing of streams + 5D considered compute-limited only for int8 (cherry picked from commit 32b8d80) * refactored to avoid compute_limited_ratio, reverted the reducing #streams, removed LSTM from limitations (cherry picked from commit f2b9721) * isa-based threshold logic (cherry picked from commit b218457) * mode->hint (cherry picked from commit ec20aa8) * optional PERFORMANCE_HINT_NUM_REQUESTS (cherry picked from commit 5a3883e) * moving the perfHints to the common OV config class + initial tests (CPU only, as the actual AUTO/MULTI should be accommodated on the master) (cherry picked from commit (then fixed)45bafe7d527f466507dea0693aeed51be4ebf776) * AUTO support for PerfHints * MULTI support for PerfHints * Enabling Perf hints for the GPU plugin * brushing settings output a bit * disabling "throughput" perf hint being default (until OV 2.0) * uncommenting the logic which was disabled to force the DLBenchmark to use the throughput mode by default * removing dead and experimental code, and debug printfs * clang/code-style * code-review remarks * Moved the output of the actual params that the hint produced to the right place * aligning MULTI's GetConfig beh to HETERO's as captured in the preso (CVS-59960) ratified with the ArchForum * clang * benchmark_app brushing * Update inference-engine/samples/benchmark_app/README.md * propagating the perf hints thru one more scenario in the merged AUTO-MULTI * fixed mispint * Python benchmark_app update for perf hints * addresssing reviewers comments on the python benchmark_app * simplifying/brushing logic a bit * refactor the heuristic to the separate file (to be shared with iGPU soon) * refactor conversion of modes to the specific GPU config per feedback from Vladimir

myshevts requested review from ilya-lavrenov, dmitry-gorokhov, nkogteva, vladimir-paramuzov, wangleis and a team August 9, 2021 16:35

myshevts requested review from a team as code owners August 9, 2021 16:35

myshevts requested review from a team August 9, 2021 16:35

openvino-pushbot added category: CPU OpenVINO CPU plugin category: GPU OpenVINO GPU plugin category: inference OpenVINO Runtime library - Inference category: MULTI OpenVINO MULTI device plugin category: IE Tests OpenVINO Test: plugins and common labels Aug 9, 2021

ilya-lavrenov added this to the 2022.1 milestone Aug 9, 2021

ilya-lavrenov reviewed Aug 9, 2021

View reviewed changes

inference-engine/src/cldnn_engine/cldnn_config.h Show resolved Hide resolved

myshevts force-pushed the perf-hints-master branch 4 times, most recently from 296825c to 193375b Compare August 11, 2021 14:37

nkogteva reviewed Aug 12, 2021

View reviewed changes

myshevts force-pushed the perf-hints-master branch 4 times, most recently from ab27fff to 2f79fd4 Compare August 18, 2021 11:59

myshevts requested a review from a team as a code owner August 18, 2021 11:59

myshevts and others added 10 commits August 18, 2021 16:40

Update inference-engine/samples/benchmark_app/README.md

1b12d6c

Co-authored-by: Tatiana Savina <[email protected]>

Update inference-engine/samples/benchmark_app/README.md

da50156

Co-authored-by: Tatiana Savina <[email protected]>

Update inference-engine/samples/benchmark_app/README.md

40d87a2

Co-authored-by: Tatiana Savina <[email protected]>

Update inference-engine/samples/benchmark_app/README.md

6f4892d

Co-authored-by: Tatiana Savina <[email protected]>

Update inference-engine/samples/benchmark_app/README.md

91e725c

Co-authored-by: Tatiana Savina <[email protected]>

Update inference-engine/samples/benchmark_app/README.md

2487cdb

Co-authored-by: Tatiana Savina <[email protected]>

Update inference-engine/samples/benchmark_app/benchmark_app.hpp

bd87e13

Co-authored-by: Tatiana Savina <[email protected]>

Update inference-engine/samples/benchmark_app/benchmark_app.hpp

48bded4

Co-authored-by: Tatiana Savina <[email protected]>

Update inference-engine/samples/benchmark_app/README.md

faed6b5

Co-authored-by: Tatiana Savina <[email protected]>

propagating the perf hints thru one more scenario in the merged AUTO-…

936ab38

…MULTI

wangleis reviewed Aug 25, 2021

View reviewed changes

inference-engine/src/cldnn_engine/cldnn_engine.cpp Show resolved Hide resolved

Merge remote-tracking branch 'github/master' into perf-hints-master

0ad4908

wangleis reviewed Sep 6, 2021

View reviewed changes

inference-engine/src/multi_device/multi_device_plugin.cpp Show resolved Hide resolved

wangleis approved these changes Sep 6, 2021

View reviewed changes

Merge remote-tracking branch 'github/master' into perf-hints-master

cdc0655

myshevts force-pushed the perf-hints-master branch from 1a7a8e2 to cdc0655 Compare September 6, 2021 15:50

myshevts added 2 commits September 6, 2021 18:53

fixed mispint

299768b

Python benchmark_app update for perf hints

3796c94

myshevts requested a review from a team September 7, 2021 15:52

nkogteva reviewed Sep 7, 2021

View reviewed changes

tools/benchmark_tool/README.md Outdated Show resolved Hide resolved

tools/benchmark_tool/openvino/tools/benchmark/main.py Outdated Show resolved Hide resolved

addresssing reviewers comments on the python benchmark_app

f02f9bf

nkogteva approved these changes Sep 8, 2021

View reviewed changes

myshevts added 2 commits September 9, 2021 14:45

simplifying/brushing logic a bit

11be133

refactor the heuristic to the separate file (to be shared with iGPU s…

6af6358

…oon)

myshevts force-pushed the perf-hints-master branch from df26364 to 6af6358 Compare September 10, 2021 12:18

refactor conversion of modes to the specific GPU config per feedback …

6dd369b

…from Vladimir

vladimir-paramuzov approved these changes Sep 10, 2021

View reviewed changes

inference-engine/src/cldnn_engine/cldnn_engine.cpp Show resolved Hide resolved

dmitry-gorokhov approved these changes Sep 13, 2021

View reviewed changes

inference-engine/src/mkldnn_plugin/mkldnn_plugin.cpp Show resolved Hide resolved

myshevts merged commit 3bec324 into openvinotoolkit:master Sep 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OV Performance Hints (CPU and GPU logic for selecting the actual configs), while AUTO/MULTI are passing them thru) #6993

OV Performance Hints (CPU and GPU logic for selecting the actual configs), while AUTO/MULTI are passing them thru) #6993

myshevts commented Aug 9, 2021 •

edited

Loading

myshevts commented Aug 13, 2021

wangleis left a comment

vladimir-paramuzov left a comment

dmitry-gorokhov left a comment

OV Performance Hints (CPU and GPU logic for selecting the actual configs), while AUTO/MULTI are passing them thru) #6993

OV Performance Hints (CPU and GPU logic for selecting the actual configs), while AUTO/MULTI are passing them thru) #6993

Conversation

myshevts commented Aug 9, 2021 • edited Loading

myshevts commented Aug 13, 2021

wangleis left a comment

Choose a reason for hiding this comment

vladimir-paramuzov left a comment

Choose a reason for hiding this comment

dmitry-gorokhov left a comment

Choose a reason for hiding this comment

myshevts commented Aug 9, 2021 •

edited

Loading