Propagate `sampling_priority` #248

ufoot · 2017-11-10T21:33:58Z

Follow-up to #245

Most significant change here is that context is used to propagate the trace_id, parent_id and sampling_priority. This makes it more alike the Python implementation and allows more code sharing.

ufoot · 2017-11-10T21:54:28Z

lib/ddtrace/tracer.rb


-      ctx ||= call_context


^ that call_context was creating buggy behaviors in corner cases, it broke some tests as I implemented the id propagation using the context. Those tests were fixed when removing this call and creating a fresh context when none is given. Which is what we do in Python BTW.

ufoot · 2017-11-10T21:55:10Z

test/tracer_test.rb

@@ -264,7 +264,7 @@ def test_start_span_child_of_context
    thread = Thread.new do
      mutex.synchronize do
        @thread_span = tracer.start_span('a')
-        @thread_ctx = tracer.call_context
+        @thread_ctx = @thread_span.context


Nasty, but yes, getting the span context is the right thing to do here.

ufoot · 2017-11-10T21:57:59Z

lib/ddtrace/context.rb

-      @sampled = false
+      @parent_trace_id = options.fetch(:trace_id, nil)
+      @parent_span_id = options.fetch(:span_id, nil)
+      @sampled = options.fetch(:sampled, false)


I'm mixed here. Python implementation uses true by default. We could change this, it possibly breaks some internal tests, but none of them should impact behavior from a user POV. AFAIK client sampling is not really massively used today in Ruby, and as a side note, first span added would turn this to true. But I think it's worth noticing.

ufoot · 2017-11-10T22:20:02Z

lib/ddtrace/distributed_headers.rb

+    def sampling_priority
+      hdr = header(HTTP_HEADER_SAMPLING_PRIORITY)
+      # It's important to make a difference between no header,
+      # and a header defined to zero.


This is required because some hosts might have priority sampling disabled and/or have outdated libraries so they are not propagating the information. In this case, we want to just ignore it and not set it.

p-lambert

Thanks for helping me a lot with all these changes! My comments are mostly about following the principle of single responsibility in the DistributedHeaders. Since we're designing new stuff, I think it's worth discussing these topics.

p-lambert · 2017-11-13T16:25:28Z

lib/ddtrace/distributed_headers.rb

+    def sampling_priority
+      hdr = header(HTTP_HEADER_SAMPLING_PRIORITY)
+      # It's important to make a difference between no header,
+      # and a header defined to zero.


p-lambert · 2017-11-13T16:26:58Z

lib/ddtrace/distributed_headers.rb

+        headers[HTTP_HEADER_SAMPLING_PRIORITY] = span.sampling_priority.to_s
+      end
+      env.merge! headers
+      env.delete(HTTP_HEADER_SAMPLING_PRIORITY) unless span.sampling_priority


Is there a reason for this #delete?

I did it on purpose indeed, my sense was that, if the headers say: there's no sampling priority, it should actually be removed. After all that function is named inject! so it sort of implies: take whatever step is needed so that this env, afterwards, reflects the state of the headers passed to it, including the fact that if it has no sampling priority, then this field should just not exist at all. It's really a corner case I think.

p-lambert · 2017-11-13T16:34:35Z

lib/ddtrace/distributed_headers.rb

+      env.delete(HTTP_HEADER_SAMPLING_PRIORITY) unless span.sampling_priority
+    end
+
+    def self.extract(env)


More a design nitpick, but I'd prefer to do less things here and keep DistributedHeaders more like as presenter around the environment object. I think the concept of a Context should belong to somewhere else (probably to whoever call this).

OK, my sense is that this probably, indeed, does not belong to something like distributed_headers, true enough. How about, then, renaming it to, say, propagation/http as in https://github.com/DataDog/dd-trace-py/blob/master/ddtrace/propagation/http.py Because, it's of general usage, it's proven useful in Python (extracting and injecting those values from headers to hash is quite a straightforward use case and many libraries may benefit from it, even if today only Faraday uses it).

Generally I'd prefer to defer things like this until they're proven useful, but I'm pretty sure you can anticipate more use-cases and scenarios for it than I do, so I'm fine with it.

p-lambert · 2017-11-13T16:36:28Z

lib/ddtrace/context.rb

      @finished_spans = 0
      @current_span = nil
    end

+    def trace_id
+      @mutex.synchronize do
+        return @parent_trace_id


If you don't mind changing this, let's get rid of the return just to be more idiomatic.

p-lambert · 2017-11-13T16:39:22Z

lib/ddtrace/context.rb

@@ -37,11 +64,21 @@ def current_span
      end
    end

+    def set_current_span(span)


p-lambert · 2017-11-13T16:46:36Z

lib/ddtrace/distributed_headers.rb

+      value
+    end
+
+    def self.inject!(span, env)


Since this is basically related to the faraday integration I would prefer to keep it there. I don't think we should add methods to the public API of this class unless it is reusable for multiple entities.

p-lambert · 2017-11-13T16:49:43Z

test/distributed_headers_test.rb

+require 'ddtrace/span'
+require 'ddtrace/distributed_headers'
+
+class DistributedHeadersTest < Minitest::Test


👍
Sorry for leaving this behind!

p-lambert and others added 5 commits November 3, 2017 16:14

[WIP] Fetch distributed tracing context from HTTP headers

156dbf4

[WIP] Refactor #guess_context_and_parent

2c02803

[WIP] Propagate sampling_priority

57f18fa

[priority sampling] make a difference between sampling priority 0 vs nil

b380f18

[priority sampling] using context and common to propagate headers

9bfe765

ufoot requested review from p-lambert and palazzem November 10, 2017 21:33

[priority sampling] refined priority sampling tests

acfc17e

ufoot commented Nov 10, 2017

View reviewed changes

p-lambert reviewed Nov 13, 2017

View reviewed changes

ufoot added 2 commits November 13, 2017 15:45

[priority sampling] splitting distributed_headers and http_propagator

8375aca

[priority sampling] coding style nitpicks and more tests

19c2fb4

p-lambert approved these changes Nov 13, 2017

View reviewed changes

ufoot modified the milestones: 0.9.2, 0.10.0 Nov 14, 2017

ufoot merged commit 19c2fb4 into master Nov 14, 2017

palazzem deleted the christian/sampling_priority_propagation branch November 14, 2017 17:30

palazzem added the core Involves Datadog core libraries label Nov 30, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Propagate `sampling_priority` #248

Propagate `sampling_priority` #248

ufoot commented Nov 10, 2017

ufoot Nov 10, 2017

ufoot Nov 10, 2017

ufoot Nov 10, 2017

ufoot Nov 10, 2017

p-lambert Nov 13, 2017

p-lambert left a comment

p-lambert Nov 13, 2017

p-lambert Nov 13, 2017

ufoot Nov 13, 2017 •

edited

Loading

p-lambert Nov 13, 2017

p-lambert Nov 13, 2017

ufoot Nov 13, 2017

p-lambert Nov 13, 2017

p-lambert Nov 13, 2017

ufoot Nov 13, 2017

p-lambert Nov 13, 2017

p-lambert Nov 13, 2017

p-lambert Nov 13, 2017

Propagate sampling_priority #248

Propagate sampling_priority #248

Conversation

ufoot commented Nov 10, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

p-lambert left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ufoot Nov 13, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Propagate `sampling_priority` #248

Propagate `sampling_priority` #248

ufoot Nov 13, 2017 •

edited

Loading