-
Notifications
You must be signed in to change notification settings - Fork 259
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
High vmcompute.exe CPU due to frequent HcsGetComputeSystemProperties calls. #989
Comments
@zoucheng2018 Can you answer a few questions here? Do you have the Container Insights add-on enabled for your cluster (or any other monitor solution)? Thanks! |
Also: what version of Kubernetes? and What was the etw command you ran? |
Do you have the Container Insights add-on enabled for your cluster (or any other monitor solution)? What node size are you seeing this on? How many containers are running on the node? Are limits/requests configured for the pods / how densely packed is the node? Also: what version of Kubernetes? and What was the etw command you ran? Perfview command to collect CPU trace: RPC Trace: |
So for some update here (and sorry for the delay), we've found that the OS isn't as optimized as it could be for returning some memory statistics. I have a change here that speeds things up a bit #1362, although I'll let @marosset or @jsturtevant speak on the container-insights extension as I don't know how much extra this would add on in terms of query volume. I'm hoping we can get that change out into AKS in the next month, but that's the optimist in me haha. |
We've helped fixed the readiness probe in container insights over the last several months that should help the performance of the container that runs on Windows. We also made two perf improvements to kubelet in 1.23 that would reduce overall cpu usage as well: kubernetes/kubernetes#105744 and kubernetes/kubernetes#104287 |
For my repro, disabling container insights seem to have helped, although we had other changes in the cluster and still need more time to confirm if it's indeed container insights specifically. Looking forward to fix to re-enable container insights. |
vmcompute.exe is taking up to 25% of a core on our AKS managed nodes, the CPU profiling data shows it was mostly caused by vmcompute!HcsRpc_GetSystemProperties. Below image contains the detailed stack:
RPC ETW traces indicates the calls are made from kubelet.exe, one example trace shows it calls every 500ms. This API is quite expensive inside vmcompute.exe, can we tune the frequency or use the alternative APIs?
The text was updated successfully, but these errors were encountered: