-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allocation Metrics no longer emitted #24339
Comments
Hi @Himura2la and @dosera and thanks for raising this issue. I have been unable to reproduce this on macOS or Ubuntu 24.04 using the steps below. Could you please provide a minimal job specification that would help reproduce this along with any other relevant information? Example agent config:
Example job:
Example test command:
|
I'm afraid your validation method is not relevant to the issue, because it's best visible on a long term run. |
I'm seeing the same thing with 4 x boxes I upgraded to 1.9.1 from 1.9.0 I'm using Debian testing on all four machines. My Prometheus scrapes a telegraf (prometheus export mode) on the same machines, and I'm seeing no drops in those data. Scrape interval for both is set to 1m. Boxes had no other packages upgraded at that time. More importantly, I'm not seeing CPU or Memory data in the Nomad GUI in either Task or Allocation view. I will occasionally get a red line indicating what I presume is 'current' X MiB / Total memory used, but no history at all. |
Nomad version
Operating system and Environment details
AlmaLinux release 9.4 (Seafoam Ocelot)
/etc/nomad/base.hcl
:Issue
After upgrading nomad from
1.8.4
to1.9.1
allocation metrics likenomad_client_allocs_memory_usage
appear not to be properly emitted (or only for several seconds - i am using prometheus to scrape them).Reproduction steps
Expected Result
Allocation metrics are available.
Actual Result
Allocation metrics are available sporadically (at best). See the attached screenshot - nomad was upgraded on Oct 24th.
This is how it looks in prometheus since then:
The text was updated successfully, but these errors were encountered: