Extend cpu_count() API to get sockets and NUMA nodes count #1392

s-m-e · 2019-01-19T14:18:53Z

I could not find anything related in the documentation. I am currently using a "hack" working on some Intel-based systems: Messing around with temperature sensors, but it really is not clean ...

len([
    None for sensor in psutil.sensors_temperatures()['coretemp']
    if 'Physical id' in sensor.label
    ])

EDIT: The above is working on Linux x86_64 for 4.4, 4.10 and 4.15 kernels.

giampaolo · 2019-02-17T18:38:42Z

Yes, checkout psutil.cpu_count().

s-m-e · 2019-03-01T19:09:51Z

@giampaolo Maybe I did not make myself clear: psutil.cpu_count() gives the number of CPU cores. I am interested in the number of actual CPUs (not their cores), equivalent to the number of sockets. This is interesting when running on servers with for instance two (or more) CPUs (i.e. in two sockets). In this case, psutil.cpu_count() will return the combined core count of both, without any information on how many CPUs the system actually has and how many cores each individual CPU has.

giampaolo · 2019-03-02T14:14:12Z

Ah you mean the number of physical sockets. Generally speaking one wants cpu_count(logical=True), which is the same as os.cpu_count() and which includes hyper-threaded CPUs. That is generally useful in a multi processing app. psutil does a bit more and allows cpu_count(logical=False) which returns the number of cores. The number of physical sockets is not implemented, basically because it's a rare use case.

s-m-e · 2019-03-02T18:58:57Z

@giampaolo Sorry for the confusion and thanks for your reply. A lot of servers, cloud or otherwise, tend to have more than one socket - and the bandwidth between the sockets is usually significantly lower than between the cores of one individual CPU. This is why you want to avoid spreading certain parallel workloads across multiple sockets. Having this information from psutil would be incredibly helpful :)

giampaolo · 2020-10-17T03:12:59Z

Re-opening given the recent discussion at #1727. It turns out that knowing the number of CPU sockets is desirable after all. The point is how to expose this in terms of API. Alex @amanusk suggests: #1727 (comment)
Also, we have another possible API addition re. the number of NUMA nodes (#1610) which should probably be taken into account in terms of API design.

giampaolo · 2020-10-17T04:26:37Z

OK, here's a bit of brainstorming. Currently psutil is able to return logical (hyper threading) and physical cores. The goal is to provide CPU sockets count (and possibly others). IMO, the ideal API if we were to start from scratch today would be having a single function accepting a kind parameter, similar to psutil.net_connections(kind='all'). That would be simple and extensible in terms of back compatibility. It would look like this:

# number of logical / hyper-threading CPU cores, same as os.cpu_count()
psutil.cpu_count(kind="logical")  

# number of physical cores (currently supported on all platforms except OpenBSD and NetBSD)
psutil.cpu_count(kind="cores")

# number of sockets
psutil.cpu_count(kind="sockets")  

# number of usable CPUs, aka len(os.sched_getaffinity(0)) on Linux
# or len(psutil.Process().cpu_affinity())
psutil.cpu_count(kind="usable")

# number of NUMA nodes
psutil.cpu_count(kind="numa")

(similarly to os.cpu_count(), if the value can't be determined we'll just return None)

The current function signature unfortunately is:

psutil.cpu_count(logical=True)

What we MAY do in order to keep supporting logical parameter and avoid code breakage is this:

psutil.cpu_count(kind='logical', logical=None)

If the function is invoked as such we will assume the user is asking for logical cores:

>>> psutil.cpu_count()
8
>>> psutil.cpu_count(True)
DeprecationWarning('use of boolean as first parameter is deprecated, use kind="logical"')
8
>>> psutil.cpu_count(logical=True)
DeprecationWarning('"logical" parameter is deprecated, use kind="logical"')
8

If the function is invoked as such we will assume the user is asking for physical cores:

>>> psutil.cpu_count(False)
DeprecationWarning('use of boolean as first parameter is deprecated; use kind="cores"')
4
>>> psutil.cpu_count(logical=False)
DeprecationWarning('"logical" parameter is deprecated; use kind="cores"')
4

santagada · 2021-01-08T23:30:25Z

I think calling all that cpu just makes the code more confusing, why not: numa_count(), group_count(), socket_count(), cpu_count(group=, numa=) ?

Then you can get the number of logical/physical cpus in a group or numa node and also the numa and group count. For windows you still need to know in which group is each numa node so something like numa_group(numa=) is also needed.

A numa node has a group (on cpus with more than 64 logical cores in the same numa node windows actually creates virtual numa nodes for them) and an affinity mask in that group as sometimes machines with < 64 total logical cpus but more than one numa node will get their cpus as different affinity masks on the same group

eg. a 2 socket machine with two 20 logical core die will get 2 numa nodes, 1 group and an affinity mask of the first 20 logical cores for numa node 1 and the rest for numa node 2.

giampaolo · 2021-01-09T00:59:53Z

@santagada

I think calling all that cpu just makes the code more confusing, why not: numa_count(), group_count(), socket_count(), cpu_count(group=, numa=)?

Barring a few exceptions, all APIs start with cpu_*, disk_*, net_*, sensors_*, ... prefix, so keeping the cpu_* part is convenient in that sense (+ we won't have to deprecate cpu_count()).

Then you can get the number of logical/physical cpus in a group or numa node and also the numa and group count. For windows you still need to know in which group is each numa node so something like numa_group(numa=) is also needed.

Mmm that complicates things quite a bit. I'm not sure how to express that in terms of API. To my understanding, and judging from lscpu output on Linux, we have 2 kind of info: number of NUMA nodes and what CPUs are in each node (what you call "groups" I suppose), e.g.:

$ lscpu
....
NUMA node0 CPU(s): 0,2,4,6,8,10,12,14
NUMA node1 CPU(s): 1,3,5,7,9,11,13,15

Is Windows different?
Even on Linux, though, I'm not sure how to express that in terms of API because we're dealing with 2 different types (int and list). That suggests a separate function would perhaps be more appropriate. Maybe:

>>> psutil.cpu_count(kind="numa_nodes")  # maybe not necessary at all?
2
>>> psutil.cpu_numa_nodes()
{0: [0, 2, 4, 6, 8, 10, 12, 14], 1: [1, 3, 5, 7, 9, 11, 13, 15]}

CC-ing @amanusk just in case he wants to chime in.

dbwiddis · 2021-01-18T17:56:44Z

Is Windows different?

Windows has the additional complication of processor groups, which complicate the node numbering. You can have processor numbers 0-63 on group 0, and 0-63 on group 1, for example. When combining with NUMA nodes, the numbering for OS lookup in the counters, etc. is tied to the NUMA node, not the processor number, so each of NUMA nodes 0 thru 3 would have processors 0-31, for example.

See oshi/oshi#1373 for some background.

The canonical (?) enumeration in Windows is GetLogicalProcessorInformationEx() which receives an array of SYSTEM_LOGICAL_PROCESSOR_INFORMATION_EX structures connecting the processors, processor groups, and NUMA nodes. The processor "numbering" is via 64-bit bitmask (per processor group) and is not guaranteed to be consecutive (e.g., a 96-core system would have 0-47 in group 0 and 0-47 in group 1), which may or may not match the numa node numbering.

ReubenM · 2022-09-16T16:39:20Z

Is Windows different? Even on Linux, though, I'm not sure how to express that in terms of API because we're dealing with 2 different types (int and list). That suggests a separate function would perhaps be more appropriate. Maybe:
>>> psutil.cpu_count(kind="numa_nodes")  # maybe not necessary at all?
2
>>> psutil.cpu_numa_nodes()
{0: [0, 2, 4, 6, 8, 10, 12, 14], 1: [1, 3, 5, 7, 9, 11, 13, 15]}

I think it would be more useful to have a more generic psutil.cpu_attributes that would allow for more than simply numa attributes to be associated with each processing unit. You could utilize the same "kind" argument for it as well to specify at what structural level the processing unit referred to exists at, since attributes to describe a physical socket will be different that for a logical core for example. This would allow exposing quite a bit of useful info that one would find from lscpu

At that point psutil.cpu_count(kind='foo') basically turns into len( psutil.cpu_attributes(kind=foo))

I wanted to add that if numa support is added, please include setting attributes for network interfaces as well to indicated which numa node they are attached to. In Linux this is found in /sys/class/net/${interface_name}/device/numa_node or /sys/devices/${pci_domain_bus_slot_path}/numa_node}

IritKoll · 2023-05-15T12:49:31Z

Hi
Was there any progress with this in python psutil new releases

giampaolo · 2024-11-12T19:07:17Z

Use cases from Stackoverflow:

knowing number of sockets could potentially allow more efficient planning on NUMA vs not-Numa systems. Also licensing of many products such as Windows server, MS Sql Server, Oracle, Vmware and many more are dependent on number of sockets. Hope this gives enough reasons? ) So maybe, if not hard to add, you can include this feature in some upcoming release.

A NUMA related issue is that sockets these days contain resources other than cpus, for example a data compression accelerator per socket. A software process may schedule its tasks on a particular socket for performance optimization

I am trying to develop a performance monitoring framework independent of platform. Getting socket count will increase the accuracy of various parameters. It'd be appreciated if getting number of socket in added to the psutil/ similar python library

giampaolo closed this as completed Feb 17, 2019

amanusk mentioned this issue Nov 13, 2019

[linux] cpu_count_physical is not correct #1620

Closed

giampaolo reopened this Oct 17, 2020

giampaolo added the new-api label Oct 17, 2020

giampaolo mentioned this issue Dec 19, 2020

[PROPOSAL] psutil.cpu_info() (extended CPU information) #1894

Open

giampaolo mentioned this issue Dec 29, 2020

Return number of numa nodes #1610

Closed

giampaolo added the enhancement label Dec 29, 2020

giampaolo changed the title ~~Is there a reliable way to get the number of physical CPUs (i.e. sockets)?~~ Extend cpu_count() API to get sockets and NUMA nodes count Dec 29, 2020

giampaolo mentioned this issue Oct 18, 2021

Document policy on backwards-incompatible changes? #2002

Open

giampaolo mentioned this issue Dec 16, 2021

psutil.cpu_count: Add argument(s) to allow differentiating "performance cores" from "efficiency cores" #2034

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend cpu_count() API to get sockets and NUMA nodes count #1392

Extend cpu_count() API to get sockets and NUMA nodes count #1392

s-m-e commented Jan 19, 2019 •

edited

Loading

giampaolo commented Feb 17, 2019

s-m-e commented Mar 1, 2019

giampaolo commented Mar 2, 2019 •

edited

Loading

s-m-e commented Mar 2, 2019

giampaolo commented Oct 17, 2020 •

edited

Loading

giampaolo commented Oct 17, 2020 •

edited

Loading

santagada commented Jan 8, 2021

giampaolo commented Jan 9, 2021 •

edited

Loading

dbwiddis commented Jan 18, 2021 •

edited

Loading

ReubenM commented Sep 16, 2022

IritKoll commented May 15, 2023

giampaolo commented Nov 12, 2024

Extend cpu_count() API to get sockets and NUMA nodes count #1392

Extend cpu_count() API to get sockets and NUMA nodes count #1392

Comments

s-m-e commented Jan 19, 2019 • edited Loading

giampaolo commented Feb 17, 2019

s-m-e commented Mar 1, 2019

giampaolo commented Mar 2, 2019 • edited Loading

s-m-e commented Mar 2, 2019

giampaolo commented Oct 17, 2020 • edited Loading

giampaolo commented Oct 17, 2020 • edited Loading

santagada commented Jan 8, 2021

giampaolo commented Jan 9, 2021 • edited Loading

dbwiddis commented Jan 18, 2021 • edited Loading

ReubenM commented Sep 16, 2022

IritKoll commented May 15, 2023

giampaolo commented Nov 12, 2024

s-m-e commented Jan 19, 2019 •

edited

Loading

giampaolo commented Mar 2, 2019 •

edited

Loading

giampaolo commented Oct 17, 2020 •

edited

Loading

giampaolo commented Oct 17, 2020 •

edited

Loading

giampaolo commented Jan 9, 2021 •

edited

Loading

dbwiddis commented Jan 18, 2021 •

edited

Loading