add node_status and node_scheduling_eligibility labels to client_alloc_allocated metrics #7473

gmichalec-pandora · 2020-03-24T19:28:14Z

For reporting security vulnerabilities please refer to the website.

If you have a question, prepend your issue with [question] or preferably use the nomad mailing list.

If filing a bug please include the following:

Nomad version

Output from nomad version
Nomad v0.10.4+ent

Operating system and Environment details

Debian GNU/Linux 9.9 (stretch)

Issue

The addition of node_status and node_scheduling_eligibility (#6130) is a great step, but in order for us to properly monitor cluster usage and scheduling availability, it would be incredibly useful to add these labels to the metrics for client allocated resources. For example, we have a grafana charting the % of available cpu resources in our cluster - however, we currently have no way to account for what hosts are ineligible/down in that chart. If we could get those labels added to the following metrics, it would be a huge help:

nomad_client_allocated_cpu
nomad_client_unallocated_cpu
nomad_client_allocated_memory
nomad_client_unallocated_memory
nomad_client_allocated_disk
nomad_client_unallocated_disk
nomad_client_allocated_network
nomad_client_unallocated_network

Thank you!

The text was updated successfully, but these errors were encountered:

notnoop · 2020-03-27T18:46:05Z

Thanks - this is a reasonable request and would be useful indeed to expose these for meaningful charts and alerts.

gmichalec-pandora · 2020-03-27T19:51:05Z

Just an update here - I've realized I can accomplish this task through some fancy PromQL:
sum(nomad_client_unallocated_cpu and on (host) nomad_client_uptime{node_scheduling_eligibility="eligible"})
but it still would be nice to have the labels directly on those metrics!

notnoop added theme/metrics type/enhancement labels Mar 27, 2020

Amier3 added the help-wanted We encourage community PRs for these issues! label Apr 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add node_status and node_scheduling_eligibility labels to client_alloc_allocated metrics #7473

add node_status and node_scheduling_eligibility labels to client_alloc_allocated metrics #7473

gmichalec-pandora commented Mar 24, 2020

notnoop commented Mar 27, 2020 •

edited

Loading

gmichalec-pandora commented Mar 27, 2020

add node_status and node_scheduling_eligibility labels to client_alloc_allocated metrics #7473

add node_status and node_scheduling_eligibility labels to client_alloc_allocated metrics #7473

Comments

gmichalec-pandora commented Mar 24, 2020

Nomad version

Operating system and Environment details

Issue

notnoop commented Mar 27, 2020 • edited Loading

gmichalec-pandora commented Mar 27, 2020

notnoop commented Mar 27, 2020 •

edited

Loading