Mark local root spans in distributed parts of a trace. #366

gbbr · 2019-11-26T09:03:26Z

I'm proposing to make the same_process_as_parent_span property of the span prototype part of the specification, as this can turn up to be useful in many scenarios.

It doesn't have to have the same form or the same name, but as long as the information is there, that would be great.

The text was updated successfully, but these errors were encountered:

Flarna · 2019-12-04T06:50:11Z

I think such information should be available via resources. It may be hard to detect if the parent span is from same process or not. Consider a HTTP request to your own application or you use some message queue and send messages to yourself. In both cases it's hard to distinguish if the sender was in process or not as you parse the tag on the message (e.g. traceparent HTTP header) to get the parent span id.

jmacd · 2019-12-05T23:48:06Z

@Flarna I agree. It would be appropriate if there were a semantic convention that uniquely identified the process, but the are only TODOs around this: https://github.com/open-telemetry/opentelemetry-specification/blob/master/specification/data-resource-semantic-conventions.md

gbbr · 2019-12-06T11:18:20Z

That's a good point to consider. I'm wondering if that is a case we should worry about. It certainly seems very edge and possibly indicative of a sub optimal architecture, but forgive me if my assumption is wrong. Nevertheless it is a use case to consider.

What I'm actually hoping to obtain here is a flag which indicates if the parent is in another part of a distributed trace, similar to what existed in Opencensus (see census-instrumentation/opencensus-specs#155), be that the same process or not. This would apply in the described scenario too.

jmacd · 2019-12-06T17:51:50Z

I believe in the current specification, the goal of SpanKind is to convey this information. If you use SpanKind=CLIENT you are a remote child, if you use SpanKind=SERVER you are a remote parent.

There are several active discussions around SpanKind, please see the current spec and #51, #371.

Oberon00 · 2020-01-30T12:45:08Z

Actually, I think I just found a real use-case for having this field.

Consider the case where a trace crosses systems monitored by different backends (not even necessarily different vendors, but backend instances with disconnected data storage). More concretely, a trace that goes A1 -> B -> A2. Thanks to W3C TraceContext, the traceparent and tracestate header get "properly" propagated through. But even though A1 and A2 export spans to the same backend, they will not be able to properly connect the span from A2 as a child of A1. The problem here is that the parent span ID at A2 will be that of B and backend A has no knowledge of it (see also #208 (comment)).

There is, however, a simple solution to this: Store a "per-backend" parent span ID in the tracestate (like instanceid@myvendor=parent:b7ad6b7169203331. This will usually be redundant with the traceparent's parent span ID , but it will be different for cross-backend traces like in the above example. This way, at A2, we know both the ID of the last span traced by our backend instance and that there was another (at least one) unknown span in-between. The backend also knows this, since the tracestate for each span is (or at least can be) exported.

Which is all fine and has nothing to do with this issue so far, until you consider that currently you will run into troubles implementing this. Namely, you only get to modify the tracestate at the propagators. But the tracestate is also propagated in-process, which means that when you have an in-process child, the parent span ID and the tracestate's parent span ID will get out of sync, which is indistinguishable from the case where an unknown span was in-between. The same_process_as_parent_span (I'd prefer is_local_root) field would make this distinguishable, in case it is false, the backend can ignore the tracestate.
EDIT: Note that the isRemote property (introduced with #187, #216) on the span context does not help here because someLocalRootSpan.getContext().isRemote() will always be false. It would only true for the parent span context of someLocalRootSpan which is not recorded anywhere.
EDIT2: Given that, maybe we want to store the full parent span context on the span instead of just the parent span ID. That would elegantly resolve the issue.

Oberon00 · 2020-04-27T15:46:31Z

Can we clarify the requirements in the spec on how to store the parent Span? If we clarified that at least the complete parent SpanContext (including it's IsRemote property) is stored, then we do have local root spans marked and this issue can be closed.

anuraaga · 2022-01-17T03:03:09Z

Hi all - wondering if it's a good time to look into this again? We've found a need for distinguishing local root spans because we treat service.name specially only for local root spans. Currently, we can be almost completely confident that a SERVER span is a local root, but it's less clear for example for CONSUMER which is often a local root but may not be. I like the idea of an is_local_root span attribute.

axw · 2022-01-17T03:10:26Z

@anuraaga I opened an OTEP relevant to this: open-telemetry/oteps#182. I'd be happy to hear some more feedback there.

austinlparker · 2024-04-09T20:59:01Z

Is this still relevant after OTEP 182?

Oberon00 · 2024-04-10T06:38:59Z

I think OTEP 182 solves exactly this

svrnm · 2024-07-08T09:51:42Z

This is closed via OTEP 182, if this is incorrect please re-open the issue

Oberon00 mentioned this issue Mar 12, 2020

Flask routes cannot be tagged as context propagation boundaries open-telemetry/opentelemetry-python#477

Closed

Oberon00 mentioned this issue Mar 23, 2020

Document IsRemote flag should not be propagated to children. #523

Closed

bogdandrutu added the spec:trace Related to the specification/trace directory label Jun 12, 2020

reyang added the area:api Cross language API specification issue label Jun 30, 2020

carlosalberto added the release:after-ga Not required before GA release, and not going to work on before GA label Jul 2, 2020

Oberon00 mentioned this issue Jul 5, 2020

Specify the behavior of the Tracer APIs in the absence of an SDK #689

Closed

Oberon00 mentioned this issue Jul 22, 2020

Describe the span class used by Exporters #158

Closed

Oberon00 mentioned this issue Oct 14, 2020

Move isRemote from SpanContext to Span #1086

Closed

Oberon00 mentioned this issue Apr 21, 2021

Controlling context propagation boundary #1633

Open

axw mentioned this issue May 27, 2021

translate otel messaging.* to ecs elastic/apm-server#5334

Merged

2 tasks

stuartnelson3 mentioned this issue May 31, 2021

Differentiate between active/passive message consumption when translating otel spans elastic/apm-server#5373

Open

Oberon00 mentioned this issue Aug 11, 2021

Add a section for OTel specific values in TraceState. #1852

Merged

anuraaga mentioned this issue Jan 17, 2022

Kafka consumer is showing invalid service name in XRay Service Map aws-observability/aws-otel-collector#894

Open

Oberon00 mentioned this issue Aug 10, 2022

Messaging: per-message tracing when sending batches open-telemetry/semantic-conventions#1187

Closed

Oberon00 mentioned this issue Jun 13, 2023

Indicate if a span's parent or link is remote using 2 bit flag as described in OTEP 0182 open-telemetry/opentelemetry-proto#484

Merged

Oberon00 mentioned this issue Aug 22, 2023

FaaS: Change requirements regarding handling of AWS Lambda-provided SpanContext open-telemetry/semantic-conventions#272

Closed

austinlparker added the triage:deciding:needs-info Not enough information. Left open to provide the author with time to add more details label Apr 9, 2024

svrnm closed this as completed Jul 8, 2024

svrnm removed the triage:deciding:needs-info Not enough information. Left open to provide the author with time to add more details label Jul 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mark local root spans in distributed parts of a trace. #366

Mark local root spans in distributed parts of a trace. #366

gbbr commented Nov 26, 2019 •

edited

Loading

Flarna commented Dec 4, 2019

jmacd commented Dec 5, 2019

gbbr commented Dec 6, 2019

jmacd commented Dec 6, 2019

Oberon00 commented Jan 30, 2020 •

edited

Loading

Oberon00 commented Apr 27, 2020

anuraaga commented Jan 17, 2022

axw commented Jan 17, 2022

austinlparker commented Apr 9, 2024

Oberon00 commented Apr 10, 2024

svrnm commented Jul 8, 2024

Mark local root spans in distributed parts of a trace. #366

Mark local root spans in distributed parts of a trace. #366

Comments

gbbr commented Nov 26, 2019 • edited Loading

Flarna commented Dec 4, 2019

jmacd commented Dec 5, 2019

gbbr commented Dec 6, 2019

jmacd commented Dec 6, 2019

Oberon00 commented Jan 30, 2020 • edited Loading

Oberon00 commented Apr 27, 2020

anuraaga commented Jan 17, 2022

axw commented Jan 17, 2022

austinlparker commented Apr 9, 2024

Oberon00 commented Apr 10, 2024

svrnm commented Jul 8, 2024

gbbr commented Nov 26, 2019 •

edited

Loading

Oberon00 commented Jan 30, 2020 •

edited

Loading