RFC: Trace Identifiers #109

tedsuo · 2018-03-09T01:05:50Z

Proposal

https://github.com/opentracing/specification/blob/tedsuo/trace-identifiers/rfc/trace_identifiers.md

The Opentracing model of computation specifies two primary object types, Spans and Traces, but does not specify identifiers for these objects. The lack of identifiers makes it very difficult to correlate tracing data with data in other systems. This complicates a number of important tasks, and prevents the creation of reusable trace observers.

To address these difficulties, the OpenTracing SpanContext interface is extended to include SpanID and TraceID accessors.

Prior Proposals

#107 Trace-Parent Accessors

objectiser · 2018-03-09T09:49:11Z

rfc/trace_identifiers.md

+# Risk Assessment
+Because this proposal includes the exposure of new information, and adds entirely new concepts to the interface, some risks exist.
+
+Existing tracers may not be able to support this feature, as their internal model does not include and client-side trace identifiers.


include any client-side trace identifiers

yurishkuro · 2018-03-09T19:23:10Z

rfc/trace_identifiers.md

+
+The `Trace-Parent` header contains the following fields: `version`, `trace-id`, `span-id`, and `trace-options`.
+
+`Trace-id` is the ID of the whole trace forest. It is represented as a 16-bytes array, e.g.,4bf92f3577b34da6a3ce929d0e0e4736. All bytes 0 is considered invalid. Implementation may decide to completely ignore the Trace-Parent if the trace-id is invalid.


nit: make these bullet points, for easier reading

yurishkuro · 2018-03-09T19:23:29Z

rfc/trace_identifiers.md

+## B3 HTTP Headers
+The [B3 HTTP headers](https://github.com/openzipkin/b3-propagation) are widely adopted, mostly by Zipkin-like tracing systems. The B3 protocol includes `X-B3-TraceId` and `X-B3-SpanId` as required headers.
+
+`TraceId` is 64 or 128-bit in length and indicates the overall ID of the trace. Every span in a trace shares this ID.


bullet points

I like the table approach!

yurishkuro · 2018-03-09T19:29:40Z

rfc/trace_identifiers.md

+* Strongly supported across many languages, and commonly used for transferring data between independent subsystems.
+
+## Alternate Formats
+In some cases, additional formats may be appropriate, if a language supports multiple common transport formats. Exposing accessors in other formats should be done to prevent double allocations while formating the identifiers. For example, converting from a tracer’s native format to a string may trigger an allocation. If there are many systems which want to consume the identifier in a format which requires an allocation when converting from a string, a second allocation could occur.


The comments about memory allocations might be better moved to the Risks section, as a performance consideration. It could be relevant even in cases when we don't introduce additional exposition formats, e.g. when the internal representation that the tracer uses is not a string. Many existing tracers represent span id as uint64, so to support the new API change efficiently they might want to cache the string representation.

Yes, makes sense.

erabug · 2018-03-10T04:16:18Z

rfc/trace_identifiers.md

+
+The `Trace-Parent` header contains the following fields: `version`, `trace-id`, `span-id`, and `trace-options`.
+
+`Trace-id` is the ID of the whole trace forest. It is represented as a 16-bytes array, e.g.,4bf92f3577b34da6a3ce929d0e0e4736. All bytes 0 is considered invalid. Implementation may decide to completely ignore the Trace-Parent if the trace-id is invalid.


erabug · 2018-03-10T04:25:06Z

rfc/trace_identifiers.md

+## B3 HTTP Headers
+The [B3 HTTP headers](https://github.com/openzipkin/b3-propagation) are widely adopted, mostly by Zipkin-like tracing systems. The B3 protocol includes `X-B3-TraceId` and `X-B3-SpanId` as required headers.
+
+`TraceId` is 64 or 128-bit in length and indicates the overall ID of the trace. Every span in a trace shares this ID.


erabug · 2018-03-10T04:26:14Z

rfc/trace_identifiers.md

+
+The `Trace-Parent` header contains the following fields: `version`, `trace-id`, `span-id`, and `trace-options`.
+
+`Trace-id` is the ID of the whole trace forest. It is represented as a 16-bytes array, e.g.,4bf92f3577b34da6a3ce929d0e0e4736. All bytes 0 is considered invalid. Implementation may decide to completely ignore the Trace-Parent if the trace-id is invalid.


If the Trace-Context spec changes, these format specifications will become stale. Perhaps we could include the spec version described here as a signal to make sure you check the Trace-Context spec as the source of truth. (Same with the B3 protocol below.) Alternatively, just add a warning that these formats are subject to change.

Good point!

erabug · 2018-03-10T04:29:36Z

rfc/trace_identifiers.md

+## Trace-Context HTTP Headers
+[Trace-Context HTTP headers](https://github.com/w3c/distributed-tracing) are in the process of being standardized via the w3c. The tracing community has voiced strong support in implementing these headers for use in tracing interop.
+
+The `Trace-Parent` header contains the following fields: `version`, `trace-id`, `span-id`, and `trace-options`.


This sentence nicely orients you to the context of trace-id and span-id within the header. A similar sentence under the B3 Headers section would also be nice.

erabug · 2018-03-10T04:36:46Z

rfc/trace_identifiers.md

+**Current State:** Draft 
+**Author:** [tedsuo](https://github.com/tedsuo)
+
+The Opentracing model of computation specifies two primary object types, `Spans` and `Traces`, but does not specify identifiers for these objects. The lack of identifiers makes it very difficult to correlate tracing data with data in other systems. This complicates a number of important tasks, and prevents the creation of reusable trace observers.


s/Opentracing/OpenTracing. In fact, I might reword the opening with a more positive bent and explain the change first, then the why, and link to the use cases you explain further down.

The OpenTracing SpanContext interface extends to include SpanID and TraceID accessors. Identifiers for the two primary object types make it easier to correlate tracing data with data in other systems, simplify important tasks, and allow the creation of reusable trace observers. Some use cases are detailed below.

erabug · 2018-03-10T04:51:37Z

rfc/trace_identifiers.md

+## Backwards Compatibility and Optional Support
+The OpenTracing specification does not currently require trace and span identifiers. To continue support for existing tracers, the empty string value can be returned when no ID has been set.
+
+# Use Cases


+1 to use cases! At the top you mention that the lack of identifiers "complicates a number of important tasks" in addition to preventing use in systems like logging and reusable tracer observers (which are each called out in separate sections). What are these other important tasks that IDs facilitate?

erabug · 2018-03-10T04:59:13Z

rfc/trace_identifiers.md

+# Use Cases
+
+## Log Correlation
+The primary expected consumer for Trace-Context identifiers are logging systems which run independently from the tracing system. Request level log indexing has become a common practice in logging, and the tracing system contains the mechanism for propagating the relevant identifiers.


I feel like this is missing the "therefore" at the end, i.e.

Logging systems are primary consumers (sources?) of trace data but they can't directly access the data

Including request identifiers is a common logging practice

Tracing systems know these identifiers, therefore it makes sense for them to provide accessors

erabug · 2018-03-10T05:04:38Z

rfc/trace_identifiers.md

+* Correlating logs, as mentioned above.
+
+# Risk Assessment
+Because this proposal includes the exposure of new information, and adds entirely new concepts to the interface, some risks exist.


We should probably be more specific about these risks and implications. On the last call, Chris raised the issue that the ID returned might not be the one you're expecting (for example, the implementation's concept of a trace ID rather than the one propagated in the header).

Is there also concern about being too closely tied to protocol specifics? Should we be explicit the decision that trace and span IDs are "special cases" and worth the risk?

Yes, there's general agreement that this is useful for secondary systems, and since most tracers support these identifiers (and many are looking to standardize on Trace-Context) this is a good time to add them.

Part of the spec is that you should not assume anything about the value returned, other than it's uniqueness. Traces are globally unique, spans are unique within a trace. These are the only two properties a caller should depend on if they want to write observers which are vendor-neutral.

Happy to add more language making this clearer.

erabug · 2018-03-10T05:07:59Z

rfc/trace_identifiers.md

+## Trace Observers
+The OpenTracing community would like to develop secondary observation systems which utilize the tracing runtime, but are tracer independent. Examples include:
+
+* Generating metrics from tracing data.


Nitpick that bullet points typically do not include periods. Also, it might be nice to add a "therefore" to the introduction, like

Trace and span identifiers would allow these tracer observers to create trace metadata without having to pay attention to the request headers.

Good point! Get it? Point? Like bullet point?

Sorry.

erabug · 2018-03-10T05:09:16Z

rfc/trace_identifiers.md

+* `TraceID`, accessible as a **string**.
+* `SpanID`, accessible as a **string**.
+
+**String** values are used for identifiers. In this context, a string is defined as an immutable, variable length sequence of unicode characters. A string is preferred over other formats for the following reasons:


Perhaps call out here that an empty string is a valid return type.

* Improved Risks and Use Cases * Tables for API descritions * Nits

tedsuo · 2018-03-13T22:05:34Z

@erabug @yurishkuro thanks for the feedback! I've made changes based on your suggestions.

One pedantic note: I like the way the tables look. But, technically, tables are not part of basic markdown, only GitHub. We've discussed avoiding whatever-flavored markdown in our docs, since it complicated which tool you use to render them. Is this a red flag for RFCs as well?

cwe1ss

LGTM! 👍

Just curious: Has it been discussed if using Object instead of String would be an option to prevent allocations? I guess it doesn' make much sense as most users of these IDs need a human-readable format, so having e.g. a byte array wouldn't be of much help anyway, right?

cwe1ss · 2018-03-14T15:29:24Z

rfc/trace_identifiers.md

+
+The OpenTracing SpanContext interface is extended to include `SpanID` and `TraceID` accessors. 
+
+The Opentracing model of computation specifies two primary object types, `Spans` and `Traces`, but does not specify identifiers for these objects. Identifiers for the two primary object types make it easier to correlate tracing data with data in other systems, simplify important tasks, and allow the creation of reusable trace observers. Some use cases are detailed below.


nit: "OpenTracing" instead of "Opentracing"

isaachier · 2018-03-14T21:01:25Z

rfc/trace_identifiers.md

+## Extra Allocations and Overhead
+Internally, tracers do not always use strings to represent their identifiers. So there is a conversion cost when using these accessors. 
+
+While a single allocation may be inevitable, exposing accessors in additional formats could be done to prevent double allocations while formating the identifiers. For example, converting from a tracer’s native format to a string may trigger an allocation. If there are many systems which want to consume the identifier in a format which requires an allocation when converting from a string, a second allocation could occur.


formating should be formatting

isaachier · 2018-03-14T21:11:17Z

rfc/trace_identifiers.md

+## Extra Allocations and Overhead
+Internally, tracers do not always use strings to represent their identifiers. So there is a conversion cost when using these accessors. 
+
+While a single allocation may be inevitable, exposing accessors in additional formats could be done to prevent double allocations while formating the identifiers. For example, converting from a tracer’s native format to a string may trigger an allocation. If there are many systems which want to consume the identifier in a format which requires an allocation when converting from a string, a second allocation could occur.


What I picture for a C API would not incur allocation in the library. Just ask the user for a buffer.

#define TRACE_ID_STR_LEN 16 int span_context_trace_id(const custom_span_context* ctx, void* buf, int len) { return snprintf(buf, len, "%" PRIx64 "%016" PRIx64, span_context->trace_id.high, span_context->trace_id.low); }

Good approach! In many languages, this option will probably not be available, as the APIs you are trying to put the data into often will not support buffer types (or the language may not have them at all), but for C this is good.

isaachier · 2018-03-14T21:14:22Z

@cwe1ss if the only issue were how to represent the array, this at least could be done statically as a fixed-size array. I think the bigger issue here is why expose only a string interface and not also a raw byte interface?

tedsuo · 2018-03-20T21:37:02Z

@isaachier @cwe1ss regarding the use of String as the initial type. These accessors don't presume to know what the underlying format is; instead they provide formats which the caller may find useful. String is the most common and widely supported format, so at the cross-language level, it represents the lowest common denominator. However, on a language by language level, additional accessors which return different formats may be useful. Please see the "Alternate Formats" section for details:
https://github.com/opentracing/specification/blob/tedsuo/trace-identifiers/rfc/trace_identifiers.md#alternate-formats

isaachier · 2018-03-20T21:55:34Z

@tedsuo can we set a cap on the maximum size the string can be. This would simplify the C API considerably.

tedsuo · 2018-03-20T22:04:27Z

@isaachier a cap is a good idea. There is surely a reasonable upper bound, possibly 16-bytes.

tedsuo · 2018-03-23T01:50:43Z

@isaachier I added a section on length and formatting under Risks:
https://github.com/opentracing/specification/blob/tedsuo/trace-identifiers/rfc/trace_identifiers.md#restrictions-on-length-and-formatting

Would you be willing to move forwards with testing a string interface? We can continue discussing restrictions in the context of a release candidate.

cwe1ss · 2018-03-23T13:58:12Z

rfc/trace_identifiers.md

@@ -60,12 +60,12 @@ The primary expected consumer for Trace-Context identifiers are logging systems
 Log indexing has become a common practice, often by including a request identifier in the log. In the past, this has involved manually propagating these identifiers as headers. However, systems which using OpenTracing automatically propagate these identifiers via the Inject/Extract interface. Some of these identifiers are user-generated, and contained in Baggage. However, the most relevant identifiers for log indexing are the Trace and Span IDs. Therefore, exposing these values would be immensley valuable.

 ## Trace Observers
-The OpenTracing community would like to develop secondary observation systems which utilize the tracing runtime, but are tracer independent. Examples include:
+The OpenTracing community would like to develop secondary observation systems which utilize the tracing runtime, but are tracer independent. Trace and span identifiers would allow these observers to correlate tracing data without having knowledge of the wire protocol or tracing implemnetation. Examples include:


nit: "implemnetation"

isaachier · 2018-03-23T15:38:15Z

Ya no worries I'm fine with this approach.

felixbarny · 2018-03-31T07:28:59Z

rfc/trace_identifiers.md

+
+| field | format | description |
+| :---  | :---   | :---        |
+| `TraceId` | 64 or 128-bit | The ID of the trace. Every span in a trace shares this ID. |


Be consistent in describing the length in bits vs bytes (in TraceContext, you wrote the length is 16 bytes) . Bits is more commonly used I feel.

Ok, I normalized on bits, and included a note about character space (HEXDIG vs opaque).

* bits instead of bytes * include description of character space

First draft of Trace Identifiers proposal

a0e7dcd

tedsuo mentioned this pull request Mar 9, 2018

RFC: Trace-Parent Accessors #107

Closed

objectiser reviewed Mar 9, 2018

View reviewed changes

yurishkuro approved these changes Mar 9, 2018

View reviewed changes

erabug approved these changes Mar 10, 2018

View reviewed changes

First Revision

ae353cd

* Improved Risks and Use Cases * Tables for API descritions * Nits

cwe1ss approved these changes Mar 14, 2018

View reviewed changes

isaachier suggested changes Mar 14, 2018

View reviewed changes

jpkrohling approved these changes Mar 20, 2018

View reviewed changes

tedsuo added 2 commits March 20, 2018 16:30

Small text change and typos

0b740b7

Add risk around length and character restrictions

3ff283e

cwe1ss reviewed Mar 23, 2018

View reviewed changes

isaachier approved these changes Mar 23, 2018

View reviewed changes

cwe1ss mentioned this pull request Mar 30, 2018

ActiveSpan is too Opaque opentracing/opentracing-java#208

Closed

3 tasks

felixbarny reviewed Mar 31, 2018

View reviewed changes

normalize id formats

147c784

* bits instead of bytes * include description of character space

tedsuo merged commit c1d2721 into master Apr 16, 2018


		The `Trace-Parent` header contains the following fields: `version`, `trace-id`, `span-id`, and `trace-options`.

		`Trace-id` is the ID of the whole trace forest. It is represented as a 16-bytes array, e.g.,4bf92f3577b34da6a3ce929d0e0e4736. All bytes 0 is considered invalid. Implementation may decide to completely ignore the Trace-Parent if the trace-id is invalid.

field	format	example	description
`trace-id`	16-byte array	4bf92f3577b34da6a3ce929d0e0e4736	The ID of the whole trace forest. If all bytes are 0, the `Trace-Parent` may be ignored.
`span-id`	8-byte array	00f067aa0ba902b7	The ID of the caller span (parent). If all bytes are 0, the `Trace-Parent` may be ignored.


		The OpenTracing SpanContext interface is extended to include `SpanID` and `TraceID` accessors.

		The Opentracing model of computation specifies two primary object types, `Spans` and `Traces`, but does not specify identifiers for these objects. Identifiers for the two primary object types make it easier to correlate tracing data with data in other systems, simplify important tasks, and allow the creation of reusable trace observers. Some use cases are detailed below.

RFC: Trace Identifiers #109

RFC: Trace Identifiers #109

Conversation

tedsuo commented Mar 9, 2018

Proposal

Prior Proposals

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tedsuo commented Mar 13, 2018

cwe1ss left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

isaachier commented Mar 14, 2018

tedsuo commented Mar 20, 2018

isaachier commented Mar 20, 2018

tedsuo commented Mar 20, 2018

tedsuo commented Mar 23, 2018

Choose a reason for hiding this comment

isaachier commented Mar 23, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment