Metric naming conventions #108

tedpennings · 2020-05-21T20:44:00Z

This proposal follows the discussion we've been having in the metrics SIG.

There are TODOs in the doc. Many sections require expertise in a specific domain; I'm very eager for discussion and suggestions.

This is written from my perspective as a UI/product engineer who consumes and presents metric data. I think is a bit different from the perspective of an instrumentation engineer or data platform engineer. Feedback extra extra welcome from y'all in those roles, and submitted with the caveat that there will be impedances in this regard.

tedpennings · 2020-05-21T20:52:20Z

Meta question: is the usage of ie and eg awkward or academic? I chose it because it's short.

tedpennings · 2020-05-21T20:58:15Z

I think we will want to include the measures / data types for the metrics prescribed in this document. Can someone take a pass at that? I am not as familiar with the nomenclature and options in this regard 😳

text/metrics/0001-naming-conventions.md

tedpennings · 2020-05-21T21:08:38Z

Prior work:

Excellent spreadsheet of OS metrics -- I think this came from Splunk?
Spreadsheet that I drafted a few weeks ago that's largely similar. Lots of discussion questions. No labels in this spreadsheet, which is not what we want

text/metrics/0001-naming-conventions.md

jkwatson · 2020-05-21T21:21:07Z

Before we get to specific names, I'd like to see a meta-OTEP with the guiding principles for instrument naming and measurement labeling, so we can all agree to that before we start digging into individual names. This PR briefly mentions some guiding principles, and then dives into system metrics. Separating those two concerns into separate OTEPs will, I think, make it easier to discuss them in isolation, rather than muddling the discussion of general guidelines with specifics.

Thoughts?

tedpennings · 2020-05-21T21:28:04Z

@jkwatson that's a great point re generic conventions vs specific recommendations. I found it very difficult to talk about the generic conventions without examples, which is what brought me down this path. I will give it some thought.

Ultimately I think we will want a single document -- or extensive examples in the primary document with an appendix for further details.

There's a lot to discuss with both topics, so it would definitely be easier to discuss one thing at a time.

jkwatson · 2020-05-21T21:30:56Z

That's fair. having examples that are outside of system metrics would be helpful, then. Things like http calls, database calls, and rpc calls will help add depth to the discussion, I think.

text/metrics/0001-naming-conventions.md

jrcamp · 2020-05-21T21:52:53Z

Prior work:

Excellent spreadsheet of OS metrics -- I think this came from Splunk?

@james-bebbington from Google gets the credit for the awesome spreadsheet but we (Splunk) have been working with him on defining the set of host metrics.

bogdandrutu · 2020-05-21T22:11:02Z

@tedpennings I think we should do what @jkwatson proposed which is to define how the naming structure looks like for metrics and for labels.

text/metrics/0001-naming-conventions.md

text/metrics/0108-naming-conventions.md

jmacd · 2020-06-18T23:59:04Z

@bogdandrutu I'm not sure if you were planning to apply the per-signal approvers changes in this repository? Either way, this has enough approvals from the metrics approvers and could be merged.

aabmass · 2020-06-30T18:26:53Z

Was there any discussion around versioning in names? I suppose this OTEP covers it implicitly since you could prefix everything, like v1.*, but is this a common enough need to add guidance on?

This could be important for any backends that have a strict schema for metrics and their labels (e.g. Google Cloud Monitoring with MetricDescriptors) to prevent breaking changes.

text/metrics/0108-naming-conventions.md

yurishkuro · 2020-07-02T23:13:55Z

text/metrics/0108-naming-conventions.md

+The hierarchical structure of metrics defines the namespacing. Supporting OpenTelemetry artifacts define the metric structures and hierarchies for some categories of metrics, and these can assist decisions when creating future metrics.
+
+Common labels SHOULD be consistently named. This aids in discoverability and disambiguates similar labels to metric names.
+


Common labels SHOULD be consistently named

What does "common labels" mean? Common in what context/scope? A single service? An organization?

I was hoping to avoid a debate over naming label keys. For example, you have a label key named "service" and have used it on some metrics, and I have a label key named "service" and have used it on some different metrics. How are we to know that those labels are not the same? The answer would be to add namespacing of labels. I recall the OpenCensus guidelines were to prefix your label names with a DNS prefix that you own. So I might have a lightstep.com/service label and you might have an uber.com/service label. For this to create a good user experience, I'd like the DNS prefix to not display by default. What would you like to see, @yurishkuro?

The way I read this guidance is: if we have a label that should be added to many different categories of metric instrument, and that label's semantic meaning is the same across all those categories, its name should be consistent.

The most obvious example I can think of would be status, whose value will be a CanonicalSpanStatus.

As a user, I would find it intuitive when searching my metrics in my UI to always find the success/failure information under the same status label.

I'm not sure I understand the example service label. Would it be the name of the service being instrumented? If so, perhaps we would want some semantic conventions around how to apply Resource attributes as metric labels.

If this is the case, I'm not sure if we need to change this line. Is this guidance not clear enough? What wording would make our meaning more clear?

I think it's safe to say we can merge this and debate this topic again as we modify the specification.

bogdandrutu · 2020-07-07T15:36:30Z

@jmacd there are some comments left open, will merge once this is done.

@tedpennings please fix all open comments.

tedpennings · 2020-07-07T21:15:53Z

@bogdandrutu will take a look later tonight, thanks!

sorry for delay, I had family medical things come up over the past few days. things are better now, and I'm catching up on work now.

bogdandrutu · 2020-07-11T19:28:00Z

@tedpennings friendly ping :)

justinfoote · 2020-07-13T18:10:53Z

In the metrics SIG meeting on July 9, I was volunteered to help wrap up this PR. I'm just now diving into it, but I'll try to get open questions/issues resolved before Thursday July 16.

justinfoote · 2020-07-16T15:40:47Z

@open-telemetry/technical-committee, It seems that we have broad approval on this OTEP, with five approvals so far. The only outstanding issue is with the specific definition of "common" labels.

I'd like to propose that we merge this OTEP and sort out the details in a spec at a later date.

jmacd · 2020-07-17T07:42:03Z

@open-telemetry/specs-metrics-approvers please merge.

* Proposal for metric naming conventions * Add Node example metrics * Node.js instead of Node * Rename file, add Prometheus quote * Second round of revisions * Working group feedback * More feedback changes * Minor clarifications * Word choice * Whitespace to check CLA status * Update text/metrics/0108-naming-conventions.md Co-authored-by: Tyler Yahn <[email protected]> * Update text/metrics/0108-naming-conventions.md Co-authored-by: Tyler Yahn <[email protected]> * Update text/metrics/0108-naming-conventions.md Co-authored-by: Tyler Yahn <[email protected]> * Update text/metrics/0108-naming-conventions.md Co-authored-by: Tyler Yahn <[email protected]> * Update text/metrics/0108-naming-conventions.md Co-authored-by: Tyler Yahn <[email protected]> * Update text/metrics/0108-naming-conventions.md Co-authored-by: Tyler Yahn <[email protected]> * Update text/metrics/0108-naming-conventions.md Co-authored-by: Tyler Yahn <[email protected]> * Code review feedback, remove discussion section * Remove some discussion topics, and fix an example * Rename OTEP 108 to metric naming _guidelines_ Co-authored-by: Tyler Yahn <[email protected]> Co-authored-by: Bogdan Drutu <[email protected]> Co-authored-by: jfoote <[email protected]> Co-authored-by: Yuri Shkuro <[email protected]>

tedpennings requested review from arminru, bogdandrutu, c24t, carlosalberto, iredelmeier, jmacd, reyang, SergeyKanzhelev, tedsuo, tigrannajaryan and yurishkuro as code owners May 21, 2020 20:44

Proposal for metric naming conventions

a59a7fd

tedpennings force-pushed the metric-naming-conventions branch from 81dc33c to a59a7fd Compare May 21, 2020 20:45

Add Node example metrics

cece358

jkwatson reviewed May 21, 2020

View reviewed changes

text/metrics/0001-naming-conventions.md Outdated Show resolved Hide resolved

tedpennings mentioned this pull request May 21, 2020

More prescriptive guidance on metric naming open-telemetry/opentelemetry-specification#600

Closed

jkwatson reviewed May 21, 2020

View reviewed changes

text/metrics/0001-naming-conventions.md Outdated Show resolved Hide resolved

jkwatson reviewed May 21, 2020

View reviewed changes

text/metrics/0001-naming-conventions.md Outdated Show resolved Hide resolved

bogdandrutu reviewed May 21, 2020

View reviewed changes

text/metrics/0001-naming-conventions.md Outdated Show resolved Hide resolved

text/metrics/0001-naming-conventions.md Outdated Show resolved Hide resolved

text/metrics/0001-naming-conventions.md Outdated Show resolved Hide resolved

Node.js instead of Node

63443ef

tedpennings force-pushed the metric-naming-conventions branch from e206fab to 63443ef Compare May 21, 2020 23:08

quentinmit reviewed May 21, 2020

View reviewed changes

jkwatson approved these changes Jun 16, 2020

View reviewed changes

aabmass reviewed Jun 18, 2020

View reviewed changes

text/metrics/0108-naming-conventions.md Outdated Show resolved Hide resolved

aabmass mentioned this pull request Jun 20, 2020

Standard system metrics and semantic conventions #119

Merged

cijothomas approved these changes Jun 29, 2020

View reviewed changes

jmacd mentioned this pull request Jul 2, 2020

Make jmacd an OTEPs maintainer (need more OTEPs maintainers) open-telemetry/community#406

Closed

yurishkuro reviewed Jul 2, 2020

View reviewed changes

jmacd mentioned this pull request Jul 8, 2020

Adapt semantic conventions for the span name of messaging systems open-telemetry/opentelemetry-specification#690

Merged

Merge branch 'master' into metric-naming-conventions

1d61a72

bogdandrutu requested review from a team July 10, 2020 15:07

Rename OTEP 108 to metric naming _guidelines_

8b88b36

aabmass mentioned this pull request Jul 14, 2020

Exporting metrics to backends with strict schemas open-telemetry/opentelemetry-specification#703

Closed

yurishkuro approved these changes Jul 16, 2020

View reviewed changes

Merge branch 'master' into metric-naming-conventions

faa0138

yurishkuro merged commit 99b2e7d into open-telemetry:master Jul 17, 2020

justinfoote mentioned this pull request Sep 25, 2020

Add outcome label for http conventions for metrics (only) open-telemetry/opentelemetry-specification#1000

Closed

aabmass mentioned this pull request Oct 8, 2020

System metrics semantic conventions open-telemetry/opentelemetry-specification#937

Merged

justinfoote mentioned this pull request Oct 19, 2020

Add metric name pluralization guidelines open-telemetry/opentelemetry-specification#1109

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metric naming conventions #108

Metric naming conventions #108

tedpennings commented May 21, 2020 •

edited

Loading

tedpennings commented May 21, 2020 •

edited

Loading

tedpennings commented May 21, 2020

tedpennings commented May 21, 2020

jkwatson commented May 21, 2020

tedpennings commented May 21, 2020

jkwatson commented May 21, 2020

jrcamp commented May 21, 2020

bogdandrutu commented May 21, 2020

jmacd commented Jun 18, 2020

aabmass commented Jun 30, 2020

yurishkuro Jul 2, 2020

jmacd Jul 8, 2020

justinfoote Jul 13, 2020

jmacd Jul 17, 2020

bogdandrutu commented Jul 7, 2020

tedpennings commented Jul 7, 2020

bogdandrutu commented Jul 11, 2020

justinfoote commented Jul 13, 2020

justinfoote commented Jul 16, 2020

jmacd commented Jul 17, 2020

		The hierarchical structure of metrics defines the namespacing. Supporting OpenTelemetry artifacts define the metric structures and hierarchies for some categories of metrics, and these can assist decisions when creating future metrics.

		Common labels SHOULD be consistently named. This aids in discoverability and disambiguates similar labels to metric names.

Metric naming conventions #108

Metric naming conventions #108

Conversation

tedpennings commented May 21, 2020 • edited Loading

tedpennings commented May 21, 2020 • edited Loading

tedpennings commented May 21, 2020

tedpennings commented May 21, 2020

jkwatson commented May 21, 2020

tedpennings commented May 21, 2020

jkwatson commented May 21, 2020

jrcamp commented May 21, 2020

bogdandrutu commented May 21, 2020

jmacd commented Jun 18, 2020

aabmass commented Jun 30, 2020

yurishkuro Jul 2, 2020

Choose a reason for hiding this comment

jmacd Jul 8, 2020

Choose a reason for hiding this comment

justinfoote Jul 13, 2020

Choose a reason for hiding this comment

jmacd Jul 17, 2020

Choose a reason for hiding this comment

bogdandrutu commented Jul 7, 2020

tedpennings commented Jul 7, 2020

bogdandrutu commented Jul 11, 2020

justinfoote commented Jul 13, 2020

justinfoote commented Jul 16, 2020

jmacd commented Jul 17, 2020

tedpennings commented May 21, 2020 •

edited

Loading

tedpennings commented May 21, 2020 •

edited

Loading