[processor/transform] Add SpanID and TraceID to the grammar #10487

TylerHelmuth · 2022-06-01T15:47:46Z

Description:
This PR adds the ability for the grammar to parse trace ids and span ids. With this change users can use trace/span ids in functions and conditions. Function creators will also be able to use SpanID and TraceID literals in their function's arguments.

Link to tracking Issue:
Originated from this PR/comment: #10471 (comment)

Testing:
Updated unit tests

Documentation:
Updated readme

TylerHelmuth · 2022-06-01T15:50:04Z

/cc @anuraaga @bogdandrutu

processor/transformprocessor/internal/common/functions.go

mx-psi · 2022-06-01T15:52:52Z

processor/transformprocessor/internal/common/parser.go

+		{Name: `SpanIDWrapper`, Pattern: `{[a-fA-F0-9]{16}}`},
+		{Name: `TraceIDWrapper`, Pattern: `{[a-fA-F0-9]{32}}`},


What if I want to pass {0000000000000000} as a string instead of an id? How would I do that?

You would do "0000000000000000", but as part of #10471 we plan to make the trace and span accessor return the SpanID and TraceID structs, so any comparison or setting wouldn't work. This PR essentially says "If you want to interact with trace ids or span ids in this grammar you better wrap it in {}"

What do you think about defining Bytes instead of specific span ID and trace ID types? It means that wrong lengths will not fail at parse time, but should still be failable when constructing the accessors in accessor validation. It could be useful elsewhere then.

Also it'd be good to think of some ideas on the syntax. {} feels a bit off personally, it looks like common template substitution paradigms. Couple of ideas

b'g00dbeef' 0xg00dbeef (perhaps literal can somehow have a helper to read this either as a int or a bytes) hex("goodbeef")

What do you think about defining Bytes instead of specific span ID and trace ID types? It means that wrong lengths will not fail at parse time, but should still be failable when constructing the accessors in accessor validation. It could be useful elsewhere then.

I like a more generic Bytes idea, but how would that affect the TraceID and SpanID accessors? The original goal was to be able to interact with TraceID and SpanID directly and not need the accessors to convert to some other form. Although I think calling .SpanID().Bytes() and .TraceID().Bytes() is still an improvement from .HexString().

Would functions that want to work with a SpanID/TraceID need to instead accept a byte array and construct the ID themselves?

Also it'd be good to think of some ideas on the syntax. {} feels a bit off personally, it looks like common template substitution paradigms

I am definitely not sold on {}. In fact my original solution was SpanID{} and TraceID{} but I removed the identifiers.

I am against using () anywhere with this token since () feels reserved for functions. hex() would look like a function call when it is not.

0x is definitely the most hex-like, but it would feel weird when thrown in front of a trace or span id, only because that isn't how they are normally interacted with. People are most used to copy-pasting the ids everywhere as if they were strings. b'' kinda feels similar, but I like it the most of the three.

Maybe there is a combination solution? We can add Bytes to the lexer's token as b'[a-fA-F0-9]*', and add SpanID, TraceID, and Bytes to the Value struct like:

type Value struct { Invocation *Invocation `( @@` SpanIDWrapper *SpanIDWrapper `| "SpanID" "{" @Bytes "}"` TraceIDWrapper *TraceIDWrapper `| "TraceID" "{" @Bytes "}"` Bytes *Bytes `| @Bytes` String *string `| @String` Float *float64 `| @Float` Int *int64 `| @Int` Bool *Boolean `| @("true" | "false")` Path *Path `| @@ )` }

With this we could interact with a normal Byte[] like set(attributes["test"], b'010203') and interact with Span and Trace IDs like set(span_id, SpanID{b'0000000000000000'}) where trace_id == TraceID{b'00000000000000000000000000000000'}. Now it feels more like we are constructing a Span or Trace ID using a primitive type.

Maybe () would be good in that combined solution 🤔

SpanID(0x0123456789abcdef) feels appropriate to me. Treating SpanID(...) as a function that returns a SpanID struct. I would expect b'0101010101' to be a binary representation. As for what values would be appropriate to provide to a SpanID(...) function, I suspect that having a Bytes type and accepting that with a hex-literal format is correct, but it would probably also be good to have a function to convert hex strings to bytes and bytes to hex strings.

Should they be actual functions tho? I guess if the function constructs the struct and returns it in the actual ExprFunc that would be performant still.

I'm implementing a solution using 0x for a generic byte slice type and making new SpanID and TraceID functions, but it would be great if this PR was merged in first: #10489

NVM @djaglowski is a mind reader

Co-authored-by: Pablo Baeyens <[email protected]>

…grammar

TylerHelmuth · 2022-06-08T18:37:11Z

The PR has been updated to handle a generic byte slice type using 0x and adds 2 new functions, TraceID and SpanID that take byte slices and return a SpanID and TraceID respectively.

Technically TraceID and SpanID do not fit the Telemetry Query Language's function syntax standards, but I didn't want to do create_TraceID or create_SpanID so I propose the function syntax standards be updated to allow "constructor" functions be allowed to not start with a verb.

codeboten · 2022-06-09T16:53:40Z

@mx-psi please review

mx-psi · 2022-06-10T07:35:39Z

@mx-psi please review

Sorry, I had already clocked off for the day. This looks good to me in any case :)

…emetry#10487) * Add SpanID and TraceID to the grammar * Updated NewGetter * Updated readme * Update NewFunctionCall to handle SpanID and TraceID * Update Changelog * Update processor/transformprocessor/internal/common/functions.go Co-authored-by: Pablo Baeyens <[email protected]> * Updated error message * Fix lint issue * Add byte slice type to grammar * update tests * Add TraceID function * Add SpanID function * Updated changelog * Updated readme * Add error check * Fixing build checks * Fix lint issues Co-authored-by: Pablo Baeyens <[email protected]> Co-authored-by: Daniel Jaglowski <[email protected]>

TylerHelmuth added 5 commits May 31, 2022 16:58

Add SpanID and TraceID to the grammar

a4954b6

Updated NewGetter

c88f7ec

Updated readme

c15d35e

Merged in upstream/main

a48ec5e

Update NewFunctionCall to handle SpanID and TraceID

faaeccd

TylerHelmuth requested a review from a team June 1, 2022 15:47

TylerHelmuth requested review from Aneurysm9 and bogdandrutu as code owners June 1, 2022 15:47

github-actions bot assigned mx-psi Jun 1, 2022

TylerHelmuth mentioned this pull request Jun 1, 2022

[processor/transform] Fix trace accessor bugs #10471

Merged

Update Changelog

5050c3b

mx-psi reviewed Jun 1, 2022

View reviewed changes

TylerHelmuth and others added 11 commits June 1, 2022 09:56

Update processor/transformprocessor/internal/common/functions.go

ef04b8e

Co-authored-by: Pablo Baeyens <[email protected]>

Updated error message

5ea0a37

Merged in upstream/main

2a11031

Merge remote-tracking branch 'upstream/main' into issue-10350-update-…

90b99a0

…grammar

Fix lint issue

77b3e05

Add byte slice type to grammar

76c9f7f

update tests

950e00d

merged in upstream/main

ddae070

Add TraceID function

6f7885c

Add SpanID function

6ef96ec

Updated changelog

f7b90e9

TylerHelmuth added 4 commits June 8, 2022 12:44

Updated readme

0130156

Add error check

e40920e

Fixing build checks

367e0c6

Fix lint issues

9afbecd

anuraaga approved these changes Jun 9, 2022

View reviewed changes

djaglowski approved these changes Jun 9, 2022

View reviewed changes

Merge branch 'main' into issue-10350-update-grammar

3f613e9

TylerHelmuth mentioned this pull request Jun 9, 2022

[Telemetry Query Language] Added Constructor definition open-telemetry/opentelemetry-collector#5509

Merged

Merge branch 'open-telemetry:main' into issue-10350-update-grammar

a3d707d

codeboten approved these changes Jun 9, 2022

View reviewed changes

codeboten added the ready to merge Code review completed; ready to merge by maintainers label Jun 9, 2022

codeboten removed the ready to merge Code review completed; ready to merge by maintainers label Jun 9, 2022

djaglowski merged commit 5e799df into open-telemetry:main Jun 9, 2022

TylerHelmuth deleted the issue-10350-update-grammar branch June 9, 2022 17:17

TylerHelmuth mentioned this pull request Aug 1, 2022

[pkg/telemetryquerylanguage] Update metrics context to be more efficient with slices #12872

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[processor/transform] Add SpanID and TraceID to the grammar #10487

[processor/transform] Add SpanID and TraceID to the grammar #10487

TylerHelmuth commented Jun 1, 2022

TylerHelmuth commented Jun 1, 2022

mx-psi Jun 1, 2022

TylerHelmuth Jun 1, 2022

anuraaga Jun 2, 2022

TylerHelmuth Jun 2, 2022

TylerHelmuth Jun 2, 2022

Aneurysm9 Jun 2, 2022

TylerHelmuth Jun 2, 2022

TylerHelmuth Jun 8, 2022

TylerHelmuth Jun 8, 2022

TylerHelmuth commented Jun 8, 2022

codeboten commented Jun 9, 2022

mx-psi commented Jun 10, 2022

		{Name: `SpanIDWrapper`, Pattern: `{[a-fA-F0-9]{16}}`},
		{Name: `TraceIDWrapper`, Pattern: `{[a-fA-F0-9]{32}}`},

[processor/transform] Add SpanID and TraceID to the grammar #10487

[processor/transform] Add SpanID and TraceID to the grammar #10487

Conversation

TylerHelmuth commented Jun 1, 2022

TylerHelmuth commented Jun 1, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

TylerHelmuth commented Jun 8, 2022

codeboten commented Jun 9, 2022

mx-psi commented Jun 10, 2022