Define messaging metrics and add `error.type` attribute to spans #163

lmolkova · 2023-07-05T22:28:50Z

Fixes open-telemetry/opentelemetry-specification#1014

pyohannes

Thanks for putting a stake in the ground here, this is a great start.

specification/messaging/messaging-metrics.md

semantic_conventions/metrics/messaging.yaml

docs/messaging/messaging-metrics.md

cyrille-leclerc · 2023-08-07T12:43:40Z

I love the duration metrics.
FYI Related to this, I'm brainstorming with @jpkrohling and others on the idea to adopt OTel Semantic Conventions metrics in the OTel Collector Service Graph Connector and we are looking at standard metrics for client/producer and server/consumer durations.

Did we consider stronger consistency with existing http.server.duration, http.client.duration, rpc.{client, server}.duration that are aligned with the SpanKind={server, client, producer, consumer, internal} and look at messaging metrics like:

Preferring name messaging.producer.duration over messaging.publish.duration for the definition "Measures the duration of publish operation."
Introducing messaging.consumer.duration with the definition "Measures the duration to consume messages."
- This would be consistent with http.server.duration: Measures the duration of inbound HTTP requests.

pyohannes · 2023-08-08T15:36:40Z

Preferring name messaging.producer.duration over messaging.publish.duration for the definition "Measures the duration of publish operation."

Introducing messaging.consumer.duration with the definition "Measures the duration to consume messages."

This would be consistent with http.server.duration: Measures the duration of inbound HTTP requests.

On the messaging side, the current metric names relate to the messaging specific operation names. For the consumer side I definitely see value in having separate consumer metrics pull-based (receive) and push-based (deliver) scenarios. The duration of pull and push durations aren't semantically consistent, as the push duration usually also involves the duration of processing the message, whereas the pull duration doesn't. We shouldn't mix both in one single metric.

docs/messaging/messaging-metrics.md

pyohannes · 2023-11-09T11:38:59Z

For spans, messaging systems add system-specific attributes to the spans.

Do we want to treat messaging system specific metric dimensions the same way, so that messaging systems extend existing metrics? Or do we require them to send different system-specific metrics?

The first approach makes cardinality hard to control (imagining a single service using two different messaging systems), the second one will likely duplicate information.

docs/messaging/messaging-metrics.md

lmolkova · 2023-11-22T22:31:10Z

Do we want to treat messaging system specific metric dimensions the same way, so that messaging systems extend existing metrics? Or do we require them to send different system-specific metrics?

I think there a couple of options here:

Allow to extend generic metrics.
- I've created update circleci to use go 1.14 opentelemetry-specification#553 to follow up and will prototype how it looks like for one of the Azure SDKs.
- The only downside of this approach is that applications that see metrics from multiple messaging systems will need to have slightly different dashboards/alerts/queries for them.
Make every system come up with their own set of additional metrics that would sometimes overlap with the generic ones.
- We should probably change this PR to just describe how to create custom messaging metrics semconv and not attempt to define generic metrics
- The downside is that every system would need to define something custom (even if it's just a metric name)

I suggest to start with Option 1 - as messaging semconv progresses towards stability, we should get more feedback from messaging systems and instrumentation prototypes and can change the approach.

pyohannes

There are still some details to clarify, but this provides a great starting point.

joaopgrassi

Overall looks like a good start! Left some non blocking comments.

docs/messaging/messaging-metrics.md

lmolkova · 2023-11-27T17:14:21Z

@open-telemetry/specs-semconv-approvers this PR is approved by the messaging WG members, please take a look

…n-telemetry#163)

lmolkova force-pushed the messaging-metrics branch from cdbdf57 to 211bf76 Compare July 5, 2023 22:29

pyohannes reviewed Jul 25, 2023

View reviewed changes

trask reviewed Jul 25, 2023

View reviewed changes

semantic_conventions/metrics/messaging.yaml Outdated Show resolved Hide resolved

lmolkova force-pushed the messaging-metrics branch 3 times, most recently from 11af3e7 to 4dcbade Compare July 26, 2023 20:02

AndrewJSchofield reviewed Jul 27, 2023

View reviewed changes

pyohannes linked an issue Jul 27, 2023 that may be closed by this pull request

Define metric semantic conventions for messaging systems open-telemetry/opentelemetry-specification#1014

Closed

lmolkova mentioned this pull request Jul 29, 2023

Messaging: prefetch scenario observability #218

Open

lmolkova force-pushed the messaging-metrics branch from 6bdad37 to fec893f Compare July 29, 2023 00:47

lmolkova mentioned this pull request Aug 16, 2023

Messaging: design transaction instrumentation #264

Open

lmolkova force-pushed the messaging-metrics branch from fec893f to 2baa64d Compare August 16, 2023 22:50

trask mentioned this pull request Aug 17, 2023

Analyze naming conventions for (monotonic) counter metrics #260

Open

lmolkova force-pushed the messaging-metrics branch from 2baa64d to 034ff58 Compare August 23, 2023 22:52

lmolkova mentioned this pull request Aug 25, 2023

Include OpenTelemetry metrics plugin into OTel agent Azure/azure-sdk-for-java#36537

Open

joaopgrassi mentioned this pull request Sep 11, 2023

Metric definitions for message/event networking (broker metrics) #120

Open

lmolkova force-pushed the messaging-metrics branch from 034ff58 to d4aefb4 Compare September 18, 2023 05:02

lmolkova marked this pull request as ready for review September 18, 2023 05:15

lmolkova requested review from a team September 18, 2023 05:15

github-actions bot assigned jsuereth Sep 18, 2023

oleksii-valuiskyi mentioned this pull request Sep 20, 2023

Multiple confluent-kafka instrumentation issues open-telemetry/opentelemetry-python-contrib#1962

Open

pyohannes mentioned this pull request Sep 27, 2023

BREAKING: Remove server.socket.address attribute from http and rpc metrics #350

Merged

3 tasks

lmolkova mentioned this pull request Oct 23, 2023

[FEATURE REQ] Implement messaging metrics Azure/azure-sdk-for-net#39450

Open

lmolkova force-pushed the messaging-metrics branch from a1ec11b to adc9624 Compare October 30, 2023 16:37

lmolkova force-pushed the messaging-metrics branch from adc9624 to 6ae1dab Compare November 9, 2023 02:09

lmolkova changed the title ~~Messaging metrics~~ Define messaging metrics and add error.type attribute to spans Nov 9, 2023

pyohannes reviewed Nov 9, 2023

View reviewed changes

AlexanderWert reviewed Nov 11, 2023

View reviewed changes

docs/messaging/messaging-metrics.md Outdated Show resolved Hide resolved

lmolkova force-pushed the messaging-metrics branch 2 times, most recently from 5b1cc90 to 93112d0 Compare November 15, 2023 23:59

lmolkova mentioned this pull request Nov 22, 2023

Messaging metrics: extending general messaging metrics for individual systems #553

Open

pyohannes approved these changes Nov 23, 2023

View reviewed changes

joaopgrassi approved these changes Nov 27, 2023

View reviewed changes

docs/messaging/messaging-metrics.md Outdated Show resolved Hide resolved

docs/messaging/messaging-metrics.md Outdated Show resolved Hide resolved

docs/messaging/messaging-metrics.md Outdated Show resolved Hide resolved

lmolkova added 7 commits November 27, 2023 08:37

messaging metrics

138df72

up

1081176

up

63bc708

more nits

dc2b44f

toc

a15b775

review

85f2e06

rebase and feedback

14c18bf

lmolkova force-pushed the messaging-metrics branch from 93112d0 to 14c18bf Compare November 27, 2023 16:41

carlosalberto approved these changes Nov 27, 2023

View reviewed changes

jsuereth approved these changes Nov 28, 2023

View reviewed changes

lmolkova mentioned this pull request Nov 28, 2023

Add Kafka Component dotnet/aspire#951

Merged

Merge branch 'main' into messaging-metrics

f794a15

joaopgrassi merged commit f51df2f into open-telemetry:main Nov 30, 2023
9 checks passed

pyohannes pushed a commit to pyohannes/semantic-conventions that referenced this pull request Jan 17, 2024

Define messaging metrics and add error.type attribute to spans (ope…

1e71340

…n-telemetry#163)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define messaging metrics and add `error.type` attribute to spans #163

Define messaging metrics and add `error.type` attribute to spans #163

lmolkova commented Jul 5, 2023 •

edited

Loading

pyohannes left a comment

cyrille-leclerc commented Aug 7, 2023 •

edited

Loading

pyohannes commented Aug 8, 2023

pyohannes commented Nov 9, 2023

lmolkova commented Nov 22, 2023 •

edited

Loading

pyohannes left a comment

joaopgrassi left a comment

lmolkova commented Nov 27, 2023 •

edited

Loading

Define messaging metrics and add error.type attribute to spans #163

Define messaging metrics and add error.type attribute to spans #163

Conversation

lmolkova commented Jul 5, 2023 • edited Loading

pyohannes left a comment

Choose a reason for hiding this comment

cyrille-leclerc commented Aug 7, 2023 • edited Loading

pyohannes commented Aug 8, 2023

pyohannes commented Nov 9, 2023

lmolkova commented Nov 22, 2023 • edited Loading

pyohannes left a comment

Choose a reason for hiding this comment

joaopgrassi left a comment

Choose a reason for hiding this comment

lmolkova commented Nov 27, 2023 • edited Loading

Define messaging metrics and add `error.type` attribute to spans #163

Define messaging metrics and add `error.type` attribute to spans #163

lmolkova commented Jul 5, 2023 •

edited

Loading

cyrille-leclerc commented Aug 7, 2023 •

edited

Loading

lmolkova commented Nov 22, 2023 •

edited

Loading

lmolkova commented Nov 27, 2023 •

edited

Loading