[Thread Backend]Fix CPU Thread Binding for Multiple Sockets #5918

kevinthesun · 2020-06-24T17:40:07Z

TVM randomly binds master thread onto available cores. If there are multiple CPU sockets, it is possible that thread binds can cross sockets which causes NUMA. This PR limits the ranged of cores in TVM_NUM_THREADS so that on multi-socket machines we can limit thread binding within single socket.

@yidawang @FrozenGene

yidawang

The change looks good to me to fix the NUMA issue. Conceptually, this is a better setting as you don't want the master thread to freely jump to the cores that you are not required. However, generally speaking, I think the current setting (beyond the scope of this PR) doesn't consider the case of multiple instances running simultaneously, which could be a potential issue.

tqchen · 2020-06-24T18:36:48Z

src/runtime/threading_backend.cc

-      int big_count = big_count_;
-      // Imagine our x86 has cores 0 - 7
-      // physical cores are 0 - 3, logical cores are 4 - 7, big_count_ is 8
-      // we wish we run on physical cores, not logical cores to avoid contention issue.


The big count might be necessary for the big.little ARM targets

Is this possible to detect big.LITTLE somewhere?

We need to look into the code. Perhaps do num_binds = std::min(MaxConcurrency(), big_count_); to keep backward compact for BIG.LITTLE

tqchen · 2020-06-24T18:37:10Z

cc @FrozenGene @junrushao1994

junrushao

LGTM. Learned something from the PR and the comments. Thanks!

) * Fix CPU Thread Binding for Multiple Sockets * Backward compatibility

Fix CPU Thread Binding for Multiple Sockets

2f75f2a

yidawang approved these changes Jun 24, 2020

View reviewed changes

tqchen added the status: need review label Jun 24, 2020

tqchen requested changes Jun 24, 2020

View reviewed changes

Backward compatibility

c03d46d

junrushao approved these changes Jun 25, 2020

View reviewed changes

tqchen approved these changes Jun 25, 2020

View reviewed changes

tqchen merged commit 524552a into apache:master Jun 25, 2020

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Jun 30, 2020

[Thread Backend]Fix CPU Thread Binding for Multiple Sockets (apache#5918

b5b8f76

) * Fix CPU Thread Binding for Multiple Sockets * Backward compatibility

zhiics pushed a commit to neo-ai/tvm that referenced this pull request Jul 2, 2020

[Thread Backend]Fix CPU Thread Binding for Multiple Sockets (apache#5918

6e4813f

) * Fix CPU Thread Binding for Multiple Sockets * Backward compatibility

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Thread Backend]Fix CPU Thread Binding for Multiple Sockets #5918

[Thread Backend]Fix CPU Thread Binding for Multiple Sockets #5918

kevinthesun commented Jun 24, 2020 •

edited

Loading

yidawang left a comment •

edited

Loading

tqchen Jun 24, 2020

junrushao Jun 24, 2020

tqchen Jun 24, 2020 •

edited

Loading

tqchen commented Jun 24, 2020

junrushao left a comment

[Thread Backend]Fix CPU Thread Binding for Multiple Sockets #5918

[Thread Backend]Fix CPU Thread Binding for Multiple Sockets #5918

Conversation

kevinthesun commented Jun 24, 2020 • edited Loading

yidawang left a comment • edited Loading

Choose a reason for hiding this comment

tqchen Jun 24, 2020

Choose a reason for hiding this comment

junrushao Jun 24, 2020

Choose a reason for hiding this comment

tqchen Jun 24, 2020 • edited Loading

Choose a reason for hiding this comment

tqchen commented Jun 24, 2020

junrushao left a comment

Choose a reason for hiding this comment

kevinthesun commented Jun 24, 2020 •

edited

Loading

yidawang left a comment •

edited

Loading

tqchen Jun 24, 2020 •

edited

Loading