Introduce model translation and encoding interfaces #3200

jrcamp · 2021-05-17T17:28:50Z

This is a somewhat more limited version of #3044. It decouples
translation of models from encodings.

Models refers to the in-memory representation of a protcol like the zipkin v2
SpanModel. After translating from pdata to the model an encoding is used to
serialize the model to a particular byte representation (protobuf, JSON, etc.).
The reverse also applies in deserializing an encoding of bytes to a model then
translating the model to pdata.

The goal is to be able to have a generic way of translating and encoding to/from pdata from supported types (zipkin, Jaeger, etc.) without having to have explicit knowledge about all those types. For instance kafkareceiver/kafkaexporter can then be made to support any type that adheres to the interfaces without having to have knowledge about any of them specifically.

Open questions

* Currently encoding/decoding are separate interfaces which means you don't have to implement both. Model translation is currently a single interface which means you have to implement both. Should requiring two-way conversion be mandated by the interfaces like in translation or should you be able to implement only one-way conversions like the encode/decode interfaces are?

They were made separate interfaces.

This is a somewhat more limited version of open-telemetry#3044. It decouples translation of models from encodings. Models refers to the in-memory representation of a protcol like the zipkin v2 SpanModel. After translating from pdata to the model an encoding is used to serialize the model to a particular byte representation (protobuf, JSON, etc.). The reverse also applies in deserializing an encoding of bytes to a model then translating the model to pdata.

protocols/encodings/encoder.go

protocols/encodings/decoder.go

protocols/encodings/encodings.go

protocols/encodings/encoder.go

protocols/encodings/encodings.go

protocols/encodings/encoder.go

protocols/models/models.go

protocols/encoding/encoding.go

protocols/encoding/decoder.go

Before the encoder interfaces took pdata and serialized it by calling the translator. This fully separates the concerns by having model do pure translation, encoding do pure serialization, and the transcoder doing both. Without this there was no way to use the encoder if you already had a model. Only updates traces for feedback purposes.

bogdandrutu · 2021-05-17T19:34:54Z

protocols/zipkinv2/encoding.go

+	_ encoding.TracesDecoder = (*Encoder)(nil)
+)
+
+type Encoder struct {


map[encoding.Type]Encoder

I think it's possible but will require some helpers. Let's punt on it for now though as it doesn't impact the external API so can be easily tidied up later.

* standardized on encoding/decoding terminology * renamed encodings to bytes to avoid confusion with encode/decode terminology * added high level interfaces to top level protocols package that goes directly pdata <-> bytes

protocols/bytes/encoding.go

protocols/README.md

protocols/decoder.go

protocols/models/models.go

bogdandrutu · 2021-05-18T21:21:11Z

@jrcamp later we may want to add a "compressor" interface for things like https://github.com/open-telemetry/opentelemetry-collector/issues/3223. Or maybe we can use a standard library for that.

protocols/README.md

protocols/encoding/decoder.go

protocols/translation/decoder.go

protocols/translation/encoder.go

bogdandrutu

Top directory should be "model" instead of "protocols", see #3104 for the decision of the package.

Co-authored-by: Tigran Najaryan <[email protected]>

model/translator/decoder.go

model/translator/encoder.go

model/translator/decoder.go

model/serializer/deserialize.go

model/translator/translator.go

model/serializer/deserialize.go

bogdandrutu · 2021-05-20T00:14:55Z

Some feedback based on my quick PR to fix usage of internal protos in fileexporter:

Because we accept/return byte[] instead of io.Reader/io.Writer need to do extra allocation and copy of the bytes
I feel that "Serializer -> Marshaler" and "Deserializer -> Unmarshaler" makes more sense now :)

See #3238

jrcamp · 2021-05-20T03:06:12Z

Some feedback based on my quick PR to fix usage of internal protos in fileexporter:

Because we accept/return byte[] instead of io.Reader/io.Writer need to do extra allocation and copy of the bytes

I looked into the io.{Reader,Writer} earlier and from what I could tell protobuf (binary representation) didn't appear to support them and dealt in []byte. And most of the existing serialization code seemed to return []byte which is why I went that route. (Not saying it's correct, just to explain how I got here).

I think both might have their place and maybe the answer is to support both. My concern about only supporting Reader/Writer would be internally the encoder is dealing in bytes, has to return bytes.NewReader(buf) then the user wants to deal in []byte so they ioutil.ReadAll(reader) which kind of defeats the whole purpose. :)

Perhaps both could be supported though?

// MetricsMarshaler encodes protocol-specific data model into bytes.
type MetricsMarshaler interface {
	MarshalMetrics(model interface{}) (io.Reader, error)
	MarshalMetricsBytes(model interface{}) ([]byte, error)
}

The protocol would implement one then the default implementation of the unimplemented one would call the implemented one and wrap/unwrap bytes as needed. But that way if you call MarshalMetricsBytes and internally it's dealing in bytes and you want bytes you don't have to double allocate. Maybe.

I feel that "Serializer -> Marshaler" and "Deserializer -> Unmarshaler" makes more sense now :)

See #3238

bogdandrutu

I think this is good as a first step. Can you for the moment put it in the internal/model so we don't expose it publicly until we know the exact path we want?

bogdandrutu · 2021-05-20T16:40:07Z

internal/model/translator/translator.go:32:58: unexported-return: exported func NewErrIncompatibleType returns unexported type *translator.errIncompatibleType, which can be annoying to use (revive)
func NewErrIncompatibleType(expected, given interface{}) *errIncompatibleType {
                                                         ^

Just return error :)

) * [wip] Introduce model translation and encodings interfaces This is a somewhat more limited version of open-telemetry#3044. It decouples translation of models from encodings. Models refers to the in-memory representation of a protcol like the zipkin v2 SpanModel. After translating from pdata to the model an encoding is used to serialize the model to a particular byte representation (protobuf, JSON, etc.). The reverse also applies in deserializing an encoding of bytes to a model then translating the model to pdata. * use encode/decode consistently * review feedback * decouple encoding from model Before the encoder interfaces took pdata and serialized it by calling the translator. This fully separates the concerns by having model do pure translation, encoding do pure serialization, and the transcoder doing both. Without this there was no way to use the encoder if you already had a model. Only updates traces for feedback purposes. * add high level interface * standardized on encoding/decoding terminology * renamed encodings to bytes to avoid confusion with encode/decode terminology * added high level interfaces to top level protocols package that goes directly pdata <-> bytes * cleanup * Apply suggestions from code review Co-authored-by: Tigran Najaryan <[email protected]> * renamings * reword error * cleanup * return interface instead of out parameter * review feedback * serialize -> marshal * put in internal * lint Co-authored-by: Tigran Najaryan <[email protected]>

jrcamp commented May 17, 2021

View reviewed changes

protocols/encodings/encoder.go Outdated Show resolved Hide resolved

use encode/decode consistently

deaafb6

bogdandrutu reviewed May 17, 2021

View reviewed changes

review feedback

2cf2810

bogdandrutu reviewed May 17, 2021

View reviewed changes

jrcamp mentioned this pull request May 17, 2021

OTLP over message systems open-telemetry/oteps#157

Closed

jrcamp force-pushed the unmarshal-intf branch 2 times, most recently from 95571ad to a87a9ec Compare May 18, 2021 03:31

add high level interface

c22633a

* standardized on encoding/decoding terminology * renamed encodings to bytes to avoid confusion with encode/decode terminology * added high level interfaces to top level protocols package that goes directly pdata <-> bytes

jrcamp force-pushed the unmarshal-intf branch from a87a9ec to c22633a Compare May 18, 2021 03:35

bogdandrutu reviewed May 18, 2021

View reviewed changes

bogdandrutu mentioned this pull request May 19, 2021

Extract pdata into a separate Go Module #3104

Closed

9 tasks

cleanup

2cbc215

jrcamp changed the title ~~[wip] Introduce model translation and encodings interfaces~~ Introduce model translation and encoding interfaces May 19, 2021

jrcamp marked this pull request as ready for review May 19, 2021 03:16

jrcamp requested a review from a team May 19, 2021 03:16

dashpole approved these changes May 19, 2021

View reviewed changes

jrcamp requested a review from bogdandrutu May 19, 2021 18:50

tigrannajaryan reviewed May 19, 2021

View reviewed changes

bogdandrutu reviewed May 19, 2021

View reviewed changes

jrcamp and others added 2 commits May 19, 2021 17:42

Apply suggestions from code review

dd2068d

Co-authored-by: Tigran Najaryan <[email protected]>

renamings

2cdfcc4

jrcamp force-pushed the unmarshal-intf branch from 9b8d2de to 2cdfcc4 Compare May 19, 2021 22:09

jrcamp added 2 commits May 19, 2021 18:13

reword error

78a9a4c

cleanup

a1f4736

bogdandrutu reviewed May 19, 2021

View reviewed changes

model/translator/decoder.go Outdated Show resolved Hide resolved

model/translator/encoder.go Outdated Show resolved Hide resolved

model/translator/decoder.go Outdated Show resolved Hide resolved

bogdandrutu reviewed May 19, 2021

View reviewed changes

model/serializer/deserialize.go Outdated Show resolved Hide resolved

bogdandrutu reviewed May 19, 2021

View reviewed changes

model/translator/translator.go Outdated Show resolved Hide resolved

return interface instead of out parameter

b384588

bogdandrutu reviewed May 19, 2021

View reviewed changes

model/serializer/deserialize.go Outdated Show resolved Hide resolved

review feedback

1aa9a13

serialize -> marshal

8cadcad

bogdandrutu approved these changes May 20, 2021

View reviewed changes

put in internal

3460c5d

jrcamp added 2 commits May 20, 2021 12:53

lint

a4345e0

Merge remote-tracking branch 'upstream/main' into unmarshal-intf

1675664

bogdandrutu merged commit db093a6 into open-telemetry:main May 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce model translation and encoding interfaces #3200

Introduce model translation and encoding interfaces #3200

jrcamp commented May 17, 2021 •

edited

Loading

bogdandrutu May 17, 2021

jrcamp May 18, 2021

bogdandrutu commented May 18, 2021

bogdandrutu left a comment

bogdandrutu commented May 20, 2021 •

edited

Loading

jrcamp commented May 20, 2021

bogdandrutu left a comment

bogdandrutu commented May 20, 2021 •

edited

Loading

Introduce model translation and encoding interfaces #3200

Introduce model translation and encoding interfaces #3200

Conversation

jrcamp commented May 17, 2021 • edited Loading

Open questions

bogdandrutu May 17, 2021

Choose a reason for hiding this comment

jrcamp May 18, 2021

Choose a reason for hiding this comment

bogdandrutu commented May 18, 2021

bogdandrutu left a comment

Choose a reason for hiding this comment

bogdandrutu commented May 20, 2021 • edited Loading

jrcamp commented May 20, 2021

bogdandrutu left a comment

Choose a reason for hiding this comment

bogdandrutu commented May 20, 2021 • edited Loading

jrcamp commented May 17, 2021 •

edited

Loading

bogdandrutu commented May 20, 2021 •

edited

Loading

bogdandrutu commented May 20, 2021 •

edited

Loading