Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

More testsystems #8

Merged
merged 5 commits into from
Aug 8, 2024
Merged

Conversation

oschulz
Copy link
Contributor

@oschulz oschulz commented Aug 7, 2024

Adds

Raven and Viper login nodes seem to use the same hardware as the compute nodes. (Should I add them too, just in case?)

@carstenbauer
Copy link
Owner

Raven and Viper login nodes seem to use the same hardware as the compute nodes. (Should I add them too, just in case?)

If they have the same hardware it doesn't make sense.

@carstenbauer
Copy link
Owner

carstenbauer commented Aug 7, 2024

@oschulz Can you rename the Intel_i9-13900H folder → Intel_i9_13900H for consistency? Will merge afterwards.

oschulz added 3 commits August 7, 2024 16:02
Login nodes seem to use the same hardware configuration
Login nodes seem to use the same hardware configuration
@oschulz
Copy link
Contributor Author

oschulz commented Aug 7, 2024

Can you rename the Intel_i9-13900H folder → Intel_i9_13900H

Whoops, that was a typo - fixed. :-)

@oschulz
Copy link
Contributor Author

oschulz commented Aug 7, 2024

@carstenbauer Would you like another test system with 2x AMD EPYC 9654 96-Core , or is that CPU type covered already?

@carstenbauer
Copy link
Owner

Would you like another test system with 2x AMD EPYC 9654 96-Core , or is that CPU type covered already?

Sure, why not. I don't think it's covered yet.

@oschulz
Copy link
Contributor Author

oschulz commented Aug 7, 2024

Sure, why not. I don't think it's covered yet.

Ok, I'll add it to this PR.

@oschulz
Copy link
Contributor Author

oschulz commented Aug 7, 2024

How should I name it (it's the node type of a "nameless" institute-internal cluster)? "Dual_AMD_EPYC-9654" or so?

@carstenbauer
Copy link
Owner

How should I name it (it's the node type of a "nameless" institute-internal cluster)? "Dual_AMD_EPYC-9654" or so?

If you use an underscore instead of the hyphen I'm fine with this name 👍

@oschulz
Copy link
Contributor Author

oschulz commented Aug 7, 2024

I can also add an "NVIDIA_Jetson_Orin_NX_16GB" (mem size included because the 8GB has less cores)?

oschulz added 2 commits August 7, 2024 16:25
With NV Power Mode MAXN.

Note:The Jetson Orin NX 8GB version has fewer CPU cores.
@oschulz
Copy link
Contributor Author

oschulz commented Aug 7, 2024

NB: The Jetson Orin NX is shown to have two sockets (lscpu says the same). I think there are two CPU "packages" on the SOC, but they (as well as the GPU cores) are definitely all connected to the same memory.

I found a block diagram for the Orin Nano and a block diagram for the AGX Orin - I didn't find one for the Orin NX, but it should be in-between, so to speak.

Hwloc.sysinfo() shows

Machine (15.03 GB)
    NUMANode (15.03 GB)
    Package L#0 P#0 
        L3 (2.0 MB)
        ...
    Package L#1 P#1 
        ...
    HostBridge 
    ...

and SysInfo.sysinfo() says

sysinfo()
Hostname: 	batlab-lxarm01
CPU(s): 	2 x Cortex-A78AE
CPU target: 	generic
Cores: 		8 (8 CPU-threads)
NUMA domains: 	1 (8 cores each)

∘ CPU 1: 
	→ 4 cores (4 CPU-threads)
	→ 1 NUMA domain
∘ CPU 2: 
	→ 4 cores (4 CPU-threads)
	→ 1 NUMA domain

So two "CPU packages" on a single NUMA domain - I guess that's kinda correct?

@carstenbauer
Copy link
Owner

That's an interesting one. Please add it and I will check/debug later.

@oschulz
Copy link
Contributor Author

oschulz commented Aug 7, 2024

Please add it and I will check/debug later.

Thanks - I've added it to this PR.

@carstenbauer carstenbauer merged commit 895530c into carstenbauer:main Aug 8, 2024
10 checks passed
@carstenbauer
Copy link
Owner

carstenbauer commented Aug 8, 2024

FYI, now it will look like this:

julia> sysinfo()
Hostname:       NVIDIA_Jetson_Orin_NX_16GB
CPU(s):         2 x Cortex-A78AE
CPU target:     generic
Cores:          8 (8 CPU-threads)
NUMA domains:   1 (8 cores each)

∘ CPU 1: 
        → 4 cores (4 CPU-threads)
        → 1 NUMA domain (shared with CPU 2)
∘ CPU 2: 
        → 4 cores (4 CPU-threads)
        → 1 NUMA domain (shared with CPU 1)

@oschulz
Copy link
Contributor Author

oschulz commented Aug 8, 2024

FYI, now it will look like this

Oh, nice!

I guess with the coming generations of CPU + GPU + (part of the) RAM devices, this is probably not that last we've seen of such "unusual" situations ...

@oschulz
Copy link
Contributor Author

oschulz commented Aug 8, 2024

Speaking of more reference system configurations, @williamfgc , could you maybe add Frontier? And @luraess could probably get the Lumi configuration, right?

@carstenbauer
Copy link
Owner

Reminder: The instructions for how to add test systems are here. Shouldn't take more than a few minutes.

@oschulz
Copy link
Contributor Author

oschulz commented Aug 8, 2024

Do you have access to a Grace-Hopper system @carstenbauer ? If not, @ludgerpaehler might be able to get that reference config.

@carstenbauer
Copy link
Owner

I do have access to A LOT of systems and surely also Grace-Hopper. The thing is, I'm busy with creating/improving these packages right now and help from the community to gather the system information would be appreciated 😉

@oschulz
Copy link
Contributor Author

oschulz commented Aug 8, 2024

Then let me also CC @JBlaschke, he might have access to some interesting novel system configurations as well. :-)

@carstenbauer
Copy link
Owner

Side comment: This "fake system" framework is much more useful and stable than I thought. I'm even running Distributed.jl tests for ThreadPinning.jl on the fake systems. And it feels great to see the test suite pass on all of those systems :) (see https://github.com/carstenbauer/ThreadPinning.jl/actions/runs/10297349876/job/28500311017#step:6:149)

@oschulz
Copy link
Contributor Author

oschulz commented Aug 8, 2024

Oh, that's a really neat capability!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants