Adding fix for host.name field update based on k8s.node.name #37

gizas · 2024-10-24T14:13:19Z

Adds the needed resource/hostname processor that will update the host.name field based on k8s.node.name

Before:

After:

Relates:

Closes: https://github.com/elastic/opentelemetry-dev/issues/516
Found in: https://github.com/elastic/opentelemetry-dev/issues/559

rogercoll · 2024-10-24T14:29:48Z

@lahsivjar @AlexanderWert We first thought that this would be fixed with the elastictrace processor + the corresponding enrichment: elastic/opentelemetry-lib#108

The issue is that the processor only enriches traces, meaning that we cannot use the processor in metrics/logs OTLP pipeline. The configuration changes used in this PR uses the resource processor to do the mapping for any signal type.

felixbarny · 2024-10-24T14:46:38Z

resources/kubernetes/operator/helm/values.yaml

+        resource/hostname:
+          attributes:
+            - key: host.name
+              from_attribute: k8s.node.name
+              action: upsert


Is this something the k8sattributes should do by default? If so, can we propose these changes upstream?

That could be an option, similar to the override functionality in the resource detector processor (e.g. overrides the host.name value with the cloud instance ID). But I have two concerns:

Upstream agreement with host.name == k8s.node.name: looks like a custom workaround to fix the Infrastructure UI relationships. From the container perspective, it has its own host.name value, which is actually used among the cluster for communication.

Otlp data that already contains k8s resource attributes that won't be processed by the k8sattributes processor.

Upstream agreement with host.name == k8s.node.name: looks like a custom workaround to fix the Infrastructure UI relationships. From the container perspective, it has its own host.name value, which is actually used among the cluster for communication.

IMHO, that is a symptom of host.name in SemConv being underspecified. In particular, there's no clear statement about the semantics of the host.name in containerized environments.
In particular it says ... On Unix systems, it may contain what the hostname command returns, or the fully qualified hostname .... For containerized system that means that SDKs will report that value which from the perspective from within a container is per default the pod name. I'm not sure if that behavior is really intended by SemConv or just not thought through well enough. The consequence is, that host.name is not a reliable attribute with that. So I don't think it's an Elastic-specific challenge.

I guess we need to understand all scenarios of the host.name population per different k8s setup. Created internal issue to summarise all the above

The consequence is, that host.name is not a reliable attribute with that. So I don't think it's an Elastic-specific challenge.

I agree. That's why I'm skeptical that we should fix it with Elastic-specific configuration. Maybe we should embrace that the data could be messy and adjust the UI accordingly. For example, exclude metrics in the hosts view that have a kubernetes.pod.uid.

I'm fine with this as a short-term workaround but I feel like we're adding a lot of these fixes to the configuration that aren't or shouldn't be Elastic-specific.

Maybe we should embrace that the data could be messy and adjust the UI accordingly. For example, exclude metrics in the hosts view that have a kubernetes.pod.uid.

+1 Or as @lahsivjar was suggesting, change the UI logic to rely on k8s.node.name and then fallback to host.name.

adding fix for host.name update

dc08400

gizas requested review from tetianakravchenko and rogercoll October 24, 2024 14:13

felixbarny reviewed Oct 24, 2024

View reviewed changes

rogercoll approved these changes Oct 25, 2024

View reviewed changes

gizas merged commit 038b337 into main Oct 25, 2024
1 check passed

gizas deleted the otel_hostname_fix branch October 25, 2024 08:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding fix for host.name field update based on k8s.node.name #37

Adding fix for host.name field update based on k8s.node.name #37

gizas commented Oct 24, 2024 •

edited

Loading

rogercoll commented Oct 24, 2024

felixbarny Oct 24, 2024

rogercoll Oct 24, 2024

AlexanderWert Oct 25, 2024

gizas Oct 25, 2024

felixbarny Oct 25, 2024

rogercoll Oct 25, 2024

Adding fix for host.name field update based on k8s.node.name #37

Adding fix for host.name field update based on k8s.node.name #37

Conversation

gizas commented Oct 24, 2024 • edited Loading

Relates:

rogercoll commented Oct 24, 2024

felixbarny Oct 24, 2024

Choose a reason for hiding this comment

rogercoll Oct 24, 2024

Choose a reason for hiding this comment

AlexanderWert Oct 25, 2024

Choose a reason for hiding this comment

gizas Oct 25, 2024

Choose a reason for hiding this comment

felixbarny Oct 25, 2024

Choose a reason for hiding this comment

rogercoll Oct 25, 2024

Choose a reason for hiding this comment

gizas commented Oct 24, 2024 •

edited

Loading