You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We are seeing the error " failed to translate metric" in our logs for prometheus exporter for some of our metrics.
We are collecting metrics from the otel agent +libraries from a bunch of springboot, nodejs and python services, most of these are working perfectly fine but we see this error quite frequently in our logs.
I am unable to pinpoint these errors to a specific service or runtime or version of the agent etc even after enabling debug logs.
It is happening for a few different metrics such as:
http.server.duration
http.client.request.size
http.client.duration
Steps to Reproduce
Expected Result
Metrics are written to the /metrics endpoint to be scanned by prometheus
Actual Result
Error message in the logs, data point is dropped
Collector version
0.85
Environment information
No response
OpenTelemetry Collector configuration
config:
exporters:
prometheus:
endpoint: "0.0.0.0:9464"resource_to_telemetry_conversion:
enabled: trueenable_open_metrics: truemetric_expiration: 3mextensions:
# The health_check extension is mandatory for this chart.# Without the health_check extension the collector will fail the readiness and liveliness probes.# The health_check extension can be modified, but should never be removed.health_check: {}zpages: {}pprof: {}memory_ballast:
size_in_percentage: 30processors:
memory_limiter:
check_interval: 1slimit_percentage: 50spike_limit_percentage: 20batch:
send_batch_size: 10000send_batch_max_size: 11000timeout: 2s#Delete unnecessary attributes from our metricsresource:
attributes:
- key: telemetry.sdk.nameaction: delete
- key: telemetry.sdk.versionaction: delete
- key: telemetry.sdk.languageaction: delete
- key: telemetry.auto.versionaction: delete
- key: container.idaction: delete
- key: process.command_argsaction: delete
- key: process.command_lineaction: delete
- key: process.commandaction: delete
- key: process.executable.pathaction: deletetransform:
error_mode: ignoremetric_statements:
- context: metricstatements:
- set(description, "The duration of the inbound HTTP request") where name == "http.server.duration"
- set(description, "The duration of the inbound HTTP request") where name == "http.client.duration"
- set(description, "The current number of threads having NEW state") where name == "jvm.threads.states"
- set(description, "The number of concurrent HTTP requests that are currently in-flight") where name == "http.server.active_requests"
- set(description, "") where name == "http.server.requests"
- set(description, "") where name == "http.server.requests.max"
- set(description, "Number of log events that were enabled by the effective log level") where name == "logback.events"
- set(description, "") where name == "spring.data.repository.invocations.max"
- set(description, "Duration of repository invocations") where name == "spring.data.repository.invocations"
- set(description, "Time taken for the application to be ready to service requests") where name == "application.ready.time"
- set(description, "Time taken (ms) to start the application") where name == "application.started.time"
- set(description, "The size of HTTP request messages") where name == "http.client.request.size_bytes"
- set(description, "The size of HTTP response messages") where name == "http.client.response.size_bytes"k8sattributes:
extract:
metadata:
- k8s.namespace.name
- k8s.deployment.name
- k8s.statefulset.name
- k8s.daemonset.name
- k8s.cronjob.name
- k8s.job.name
- k8s.pod.name
- k8s.namespace.name
- k8s.node.namereceivers:
jaeger: nullprometheus: nullzipkin: nullotlp:
protocols:
grpc:
endpoint: 0.0.0.0:4317http:
endpoint: 0.0.0.0:4318service:
telemetry:
#logs for the collector itselflogs:
level: infoextensions:
- health_check
- memory_ballastpipelines:
traces:
exporters:
- spanmetricsprocessors:
- memory_limiter
- batch
- k8sattributesreceivers:
- otlplogs: nullmetrics:
exporters:
- prometheusprocessors:
- memory_limiter
- batch
- resource
- transform
- k8sattributesreceivers:
- otlp
- spanmetricsconnectors:
spanmetrics:
namespace: span.metrics
Component(s)
exporter/prometheus
What happened?
Description
We are seeing the error " failed to translate metric" in our logs for prometheus exporter for some of our metrics.
We are collecting metrics from the otel agent +libraries from a bunch of springboot, nodejs and python services, most of these are working perfectly fine but we see this error quite frequently in our logs.
I am unable to pinpoint these errors to a specific service or runtime or version of the agent etc even after enabling debug logs.
It is happening for a few different metrics such as:
http.server.duration
http.client.request.size
http.client.duration
Steps to Reproduce
Expected Result
Metrics are written to the /metrics endpoint to be scanned by prometheus
Actual Result
Error message in the logs, data point is dropped
Collector version
0.85
Environment information
No response
OpenTelemetry Collector configuration
Log output
Additional context
No response
The text was updated successfully, but these errors were encountered: