Fix the issue of cpu affinity mask changed on a NUMA machine with SEQ #25455

sunxiaoxia2022 · 2024-07-09T06:25:06Z

Details:

Fix the issue of cpu affinity mask changed on a NUMA machine with SEQ

Tickets:

CVS-143272

dmitry-gorokhov · 2024-07-11T06:34:38Z

General comment:
I don't think restoring affinity mask on compiled model destructor stage is a good enough solution. User application usually interleaves some additional computational logic with OV inference, so the pipeline looks in the following way: [user code] -> [OV infer] -> [user code] -> [OV Infer] -> .... -> [compiled model desctructor]. So the current solution still affects used code behavior.
I could propose 2 solution: a) Restore the mask at the end of Infer call b) Create separate thread for compilation/inference inside OV if enable pinning is ON.

@wangleis What do you think?

wangleis · 2024-07-11T13:08:03Z

@dmitry-gorokhov Since CPU mask affects whole process, option b may not work. Xiaoxia will update PR to change and restore CPU mask during inferring like option a.

sunxiaoxia2022 · 2024-07-15T03:29:20Z

@dmitry-gorokhov I put cpu pinning before and after the task(). Please take a look again.

dmitry-gorokhov · 2024-07-15T11:18:38Z

AS we discussed on the sync. Lets clarify 2 things first:

What overheads is created by [un]pin_stream_to_cpus() calls on each inference?
Does OV created separate thread for inference in SEQ mode? If yes does it mean application thread is not affected by OV pinning/affinity behavior?

sunxiaoxia2022 · 2024-07-16T13:12:08Z

AS we discussed on the sync. Lets clarify 2 things first:

What overheads is created by [un]pin_stream_to_cpus() calls on each inference?

Does OV created separate thread for inference in SEQ mode? If yes does it mean application thread is not affected by OV pinning/affinity behavior?

I tested the performance of master and this PR with a small model ebgan(onnx, FP32) by 10 times.
Test machine: Intel(R) Xeon(R) Gold 6330 CPU @ 2.00GHz
Test command: ./benchmark_app -m /home/openvino-ci-69/xiaoxia/models/cv_bench_cache/ww21_weekly_23.0.0-10926-b4452d56304-API2.0/ebgan/onnx/onnx/FP32/1/dldt/ebgan.xml -d CPU -hint none -nstreams 1 -nthreads 1
Test result: the values are the average of 10 times.
master: latency: 1.829 ms, throughput: 545.311 FPS
This PR: latency: 1.833 ms, throughput: 543.561 FPS. latency increased 0.21% compared with master, throughput dropped 0.32%.
For asnyc mode, OV created separate thread for inference in SEQ which is the same as TBB. So application thread is not affected by OV pinning.
But for sync mode, OV not create separate thread, it uses app thread. So app thread is affected by OV pinning.

dmitry-gorokhov · 2024-07-17T10:19:10Z

Thanks @sunxiaoxia2022 !
So to complete the task we need to fix the behavior for sync mode as well. I would propose to spawn separate thread in case of SEQ build, sync API amd enable_pinning==true.

@wangleis Do you have any objections?

wangleis · 2024-07-17T10:35:51Z

@dmitry-gorokhov In sync mode, app is blocked during inference and app thread is used for inference. Since we will restore CPU mask after inference, current solution should be ok for sync mode from my understanding.

dmitry-gorokhov · 2024-07-18T06:14:04Z

@dmitry-gorokhov In sync mode, app is blocked during inference and app thread is used for inference. Since we will restore CPU mask after inference, current solution should be ok for sync mode from my understanding.

Good point. Agree. Approved!

…openvinotoolkit#25455) ### Details: - *Fix the issue of cpu affinity mask changed on a NUMA machine with SEQ* ### Tickets: - *CVS-143272*

sunxiaoxia2022 requested review from peterchen-intel, wangleis and riverlijunjie July 9, 2024 06:25

sunxiaoxia2022 requested a review from a team as a code owner July 9, 2024 06:25

github-actions bot added the category: inference OpenVINO Runtime library - Inference label Jul 9, 2024

riverlijunjie approved these changes Jul 9, 2024

View reviewed changes

peterchen-intel mentioned this pull request Jul 9, 2024

[Bug]: unwanted calling thread's cpu affinity mask change on a NUMA machine #24826

Closed

3 tasks

fix CVS-143272, cpu affinity mask change on a NUMA machine with SEQ

89b146f

dmitry-gorokhov added this to the 2024.4 milestone Jul 11, 2024

dmitry-gorokhov added the category: CPU OpenVINO CPU plugin label Jul 11, 2024

dmitry-gorokhov self-assigned this Jul 11, 2024

github-actions bot removed the category: CPU OpenVINO CPU plugin label Jul 15, 2024

wangleis approved these changes Jul 15, 2024

View reviewed changes

sunxiaoxia2022 added 2 commits July 15, 2024 19:06

put cpu pinning before and after the task

33122f1

rm a blank line

b5cdbf8

dmitry-gorokhov approved these changes Jul 18, 2024

View reviewed changes

wangleis added this pull request to the merge queue Jul 18, 2024

Merged via the queue into openvinotoolkit:master with commit 08c4195 Jul 18, 2024
122 checks passed

wangleis deleted the xiaoxia/cpu_affinity_SEQ branch July 18, 2024 14:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the issue of cpu affinity mask changed on a NUMA machine with SEQ #25455

Fix the issue of cpu affinity mask changed on a NUMA machine with SEQ #25455

sunxiaoxia2022 commented Jul 9, 2024 •

edited by peterchen-intel

Loading

dmitry-gorokhov commented Jul 11, 2024 •

edited

Loading

wangleis commented Jul 11, 2024

sunxiaoxia2022 commented Jul 15, 2024

dmitry-gorokhov commented Jul 15, 2024

sunxiaoxia2022 commented Jul 16, 2024

dmitry-gorokhov commented Jul 17, 2024

wangleis commented Jul 17, 2024

dmitry-gorokhov commented Jul 18, 2024

Fix the issue of cpu affinity mask changed on a NUMA machine with SEQ #25455

Fix the issue of cpu affinity mask changed on a NUMA machine with SEQ #25455

Conversation

sunxiaoxia2022 commented Jul 9, 2024 • edited by peterchen-intel Loading

Details:

Tickets:

dmitry-gorokhov commented Jul 11, 2024 • edited Loading

wangleis commented Jul 11, 2024

sunxiaoxia2022 commented Jul 15, 2024

dmitry-gorokhov commented Jul 15, 2024

sunxiaoxia2022 commented Jul 16, 2024

dmitry-gorokhov commented Jul 17, 2024

wangleis commented Jul 17, 2024

dmitry-gorokhov commented Jul 18, 2024

sunxiaoxia2022 commented Jul 9, 2024 •

edited by peterchen-intel

Loading

dmitry-gorokhov commented Jul 11, 2024 •

edited

Loading