Template provider #8372

djaglowski · 2023-09-06T15:59:56Z

Is your feature request related to a problem? Please describe.

Configuration of the collector is a major barrier to entry for users because the process of "developing" a configuration solution often requires detailed knowledge of one or more collector components, a sophisticated understanding of how to interface with an external technology, or just a non-trivial amount of effort working through necessary data manipulations.

Describe the solution you'd like

We should provide an abstraction mechanism that allows expert users to abstract away complex "configuration solutions" and provide novice users with a simplified configuration experience.

I propose that expert users should write templated configuration files which are natively recognized by the collector. Novice users may then include a templated solution in their configuration by defining only a simplified set of parameters.

For example, consider the following configuration (source) for scraping and normalizing metrics from couchbase.

receivers:
  prometheus/couchbase:
    config:
      scrape_configs:
        - job_name: 'couchbase'
          scrape_interval: 5s
          static_configs:
            - targets: ['couchbase:8091']
          basic_auth:
            username: 'otelu'
            password: 'otelpassword'
          metric_relabel_configs:
            # Include only a few key metrics
            - source_labels: [ __name__ ]
              regex: "(kv_ops)|\
                (kv_vb_curr_items)|\
                (kv_num_vbuckets)|\
                (kv_ep_cursor_memory_freed_bytes)|\
                (kv_total_memory_used_bytes)|\
                (kv_ep_num_value_ejects)|\
                (kv_ep_mem_high_wat)|\
                (kv_ep_mem_low_wat)|\
                (kv_ep_tmp_oom_errors)|\
                (kv_ep_oom_errors)"
              action: keep

processors:
  filter/couchbase:
    # Filter out prometheus scraping meta-metrics.
    metrics:
      exclude:
        match_type: strict
        metric_names:
          - scrape_samples_post_metric_relabeling
          - scrape_series_added
          - scrape_duration_seconds
          - scrape_samples_scraped
          - up

  metricstransform/couchbase:
    transforms:
      # Rename from prometheus metric name to OTel metric name.
      # We cannot do this with metric_relabel_configs, as the prometheus receiver does not
      # allow metric renames at this time.
      - include: kv_ops
        match_type: strict
        action: update
        new_name: "couchbase.bucket.operation.count"
      - include: kv_vb_curr_items
        match_type: strict
        action: update
        new_name: "couchbase.bucket.item.count"
      - include: kv_num_vbuckets
        match_type: strict
        action: update
        new_name: "couchbase.bucket.vbucket.count"
      - include: kv_ep_cursor_memory_freed_bytes
        match_type: strict
        action: update
        new_name: "couchbase.bucket.memory.usage.free"
      - include: kv_total_memory_used_bytes
        match_type: strict
        action: update
        new_name: "couchbase.bucket.memory.usage.used"
      - include: kv_ep_num_value_ejects
        match_type: strict
        action: update
        new_name: "couchbase.bucket.item.ejection.count"
      - include: kv_ep_mem_high_wat
        match_type: strict
        action: update
        new_name: "couchbase.bucket.memory.high_water_mark.limit"
      - include: kv_ep_mem_low_wat
        match_type: strict
        action: update
        new_name: "couchbase.bucket.memory.low_water_mark.limit"
      - include: kv_ep_tmp_oom_errors
        match_type: strict
        action: update
        new_name: "couchbase.bucket.error.oom.count.recoverable"
      - include: kv_ep_oom_errors
        match_type: strict
        action: update
        new_name: "couchbase.bucket.error.oom.count.unrecoverable"
      # Combine couchbase.bucket.error.oom.count.x and couchbase.bucket.memory.usage.x
      # metrics.
      - include: '^couchbase\.bucket\.error\.oom\.count\.(?P<error_type>unrecoverable|recoverable)$$'
        match_type: regexp
        action: combine
        new_name: "couchbase.bucket.error.oom.count"
      - include: '^couchbase\.bucket\.memory\.usage\.(?P<state>free|used)$$'
        match_type: regexp
        action: combine
        new_name: "couchbase.bucket.memory.usage"
      # Aggregate "result" label on operation count to keep label sets consistent across the metric datapoints
      - include: 'couchbase.bucket.operation.count'
        match_type: strict
        action: update
        operations:
          - action: aggregate_labels
            label_set: ["bucket", "op"]
            aggregation_type: sum

  transform/couchbase:
    metric_statements:
    - context: datapoint
      statements:
        - convert_gauge_to_sum("cumulative", true) where metric.name == "couchbase.bucket.operation.count"
        - set(metric.description, "Number of operations on the bucket.") where metric.name == "couchbase.bucket.operation.count"
        - set(metric.unit, "{operations}") where metric.name == "couchbase.bucket.operation.count"

        - convert_gauge_to_sum("cumulative", false) where metric.name == "couchbase.bucket.item.count"
        - set(metric.description, "Number of items that belong to the bucket.") where metric.name == "couchbase.bucket.item.count"
        - set(metric.unit, "{items}") where metric.name == "couchbase.bucket.item.count"

        - convert_gauge_to_sum("cumulative", false) where metric.name == "couchbase.bucket.vbucket.count"
        - set(metric.description, "Number of non-resident vBuckets.") where metric.name == "couchbase.bucket.vbucket.count"
        - set(metric.unit, "{vbuckets}") where metric.name == "couchbase.bucket.vbucket.count"

        - convert_gauge_to_sum("cumulative", false) where metric.name == "couchbase.bucket.memory.usage"
        - set(metric.description, "Usage of total memory available to the bucket.") where metric.name == "couchbase.bucket.memory.usage"
        - set(metric.unit, "By") where metric.name == "couchbase.bucket.memory.usage"

        - convert_gauge_to_sum("cumulative", true) where metric.name == "couchbase.bucket.item.ejection.count"
        - set(metric.description, "Number of item value ejections from memory to disk.") where metric.name == "couchbase.bucket.item.ejection.count"
        - set(metric.unit, "{ejections}") where metric.name == "couchbase.bucket.item.ejection.count"

        - convert_gauge_to_sum("cumulative", true) where metric.name == "couchbase.bucket.error.oom.count"
        - set(metric.description, "Number of out of memory errors.") where metric.name == "couchbase.bucket.error.oom.count"
        - set(metric.unit, "{errors}") where metric.name == "couchbase.bucket.error.oom.count"

        - set(metric.description, "The memory usage at which items will be ejected.") where metric.name == "couchbase.bucket.memory.high_water_mark.limit"
        - set(metric.unit, "By") where metric.name == "couchbase.bucket.memory.high_water_mark.limit"

        - set(metric.description, "The memory usage at which ejections will stop that were previously triggered by a high water mark breach.") where metric.name == "couchbase.bucket.memory.low_water_mark.limit"
        - set(metric.unit, "By") where metric.name == "couchbase.bucket.memory.low_water_mark.limit"

exporters:
  prometheus:
    endpoint: "0.0.0.0:9123"

service:
  pipelines:
    metrics/couchbase:
      receivers: [prometheus/couchbase]
      processors: [filter/couchbase, metricstransform/couchbase, transform/couchbase]
      exporters: [prometheus]

The configuration includes a complex receiver and multiple complex processors. Ideally, a novice user should only need to be concerned with the endpoint and auth values. Most of the complexities of the receiver, and all of the complexities of the processors can be abstracted away such that the following configuration is equivalent:

receivers:
  template/couchbase_metrics:
    endpoint: couchbase:8091
    username: otelu
    password: otelpassword

exporters:
  prometheus:
    endpoint: 0.0.0.0:9123

service:
  pipelines:
    metrics:
      receivers: [template/couchbase_metrics]
      exporters: [prometheus]

The template file, would look like this.

type: couchbase_metrics
template: |
  receivers:
    prometheus:
      config:
        scrape_configs:
          - job_name: 'couchbase'
            scrape_interval: 5s
            static_configs:
              - targets: [{{ .endpoint }}]
            basic_auth:
              username: {{ .username }}
              password: {{ .password }}
            metric_relabel_configs:
              # Include only a few key metrics
              - source_labels: [ __name__ ]
                regex: "(kv_ops)|\
                  (kv_vb_curr_items)|\
                  (kv_num_vbuckets)|\
                  (kv_ep_cursor_memory_freed_bytes)|\
                  (kv_total_memory_used_bytes)|\
                  (kv_ep_num_value_ejects)|\
                  (kv_ep_mem_high_wat)|\
                  (kv_ep_mem_low_wat)|\
                  (kv_ep_tmp_oom_errors)|\
                  (kv_ep_oom_errors)"
                action: keep

  processors:
    filter:
      # Filter out prometheus scraping meta-metrics.
      metrics:
        exclude:
          match_type: strict
          metric_names:
            - scrape_samples_post_metric_relabeling
            - scrape_series_added
            - scrape_duration_seconds
            - scrape_samples_scraped
            - up

    metricstransform:
      transforms:
        # Rename from prometheus metric name to OTel metric name.
        # We cannot do this with metric_relabel_configs, as the prometheus receiver does not
        # allow metric renames at this time.
        - include: kv_ops
          match_type: strict
          action: update
          new_name: "couchbase.bucket.operation.count"
        - include: kv_vb_curr_items
          match_type: strict
          action: update
          new_name: "couchbase.bucket.item.count"
        - include: kv_num_vbuckets
          match_type: strict
          action: update
          new_name: "couchbase.bucket.vbucket.count"
        - include: kv_ep_cursor_memory_freed_bytes
          match_type: strict
          action: update
          new_name: "couchbase.bucket.memory.usage.free"
        - include: kv_total_memory_used_bytes
          match_type: strict
          action: update
          new_name: "couchbase.bucket.memory.usage.used"
        - include: kv_ep_num_value_ejects
          match_type: strict
          action: update
          new_name: "couchbase.bucket.item.ejection.count"
        - include: kv_ep_mem_high_wat
          match_type: strict
          action: update
          new_name: "couchbase.bucket.memory.high_water_mark.limit"
        - include: kv_ep_mem_low_wat
          match_type: strict
          action: update
          new_name: "couchbase.bucket.memory.low_water_mark.limit"
        - include: kv_ep_tmp_oom_errors
          match_type: strict
          action: update
          new_name: "couchbase.bucket.error.oom.count.recoverable"
        - include: kv_ep_oom_errors
          match_type: strict
          action: update
          new_name: "couchbase.bucket.error.oom.count.unrecoverable"
        # Combine couchbase.bucket.error.oom.count.x and couchbase.bucket.memory.usage.x
        # metrics.
        - include: '^couchbase\.bucket\.error\.oom\.count\.(?P<error_type>unrecoverable|recoverable)$$'
          match_type: regexp
          action: combine
          new_name: "couchbase.bucket.error.oom.count"
        - include: '^couchbase\.bucket\.memory\.usage\.(?P<state>free|used)$$'
          match_type: regexp
          action: combine
          new_name: "couchbase.bucket.memory.usage"
        # Aggregate "result" label on operation count to keep label sets consistent across the metric datapoints
        - include: 'couchbase.bucket.operation.count'
          match_type: strict
          action: update
          operations:
            - action: aggregate_labels
              label_set: ["bucket", "op"]
              aggregation_type: sum

    transform:
      metric_statements:
      - context: datapoint
        statements:
          - convert_gauge_to_sum("cumulative", true) where metric.name == "couchbase.bucket.operation.count"
          - set(metric.description, "Number of operations on the bucket.") where metric.name == "couchbase.bucket.operation.count"
          - set(metric.unit, "{operations}") where metric.name == "couchbase.bucket.operation.count"

          - convert_gauge_to_sum("cumulative", false) where metric.name == "couchbase.bucket.item.count"
          - set(metric.description, "Number of items that belong to the bucket.") where metric.name == "couchbase.bucket.item.count"
          - set(metric.unit, "{items}") where metric.name == "couchbase.bucket.item.count"

          - convert_gauge_to_sum("cumulative", false) where metric.name == "couchbase.bucket.vbucket.count"
          - set(metric.description, "Number of non-resident vBuckets.") where metric.name == "couchbase.bucket.vbucket.count"
          - set(metric.unit, "{vbuckets}") where metric.name == "couchbase.bucket.vbucket.count"

          - convert_gauge_to_sum("cumulative", false) where metric.name == "couchbase.bucket.memory.usage"
          - set(metric.description, "Usage of total memory available to the bucket.") where metric.name == "couchbase.bucket.memory.usage"
          - set(metric.unit, "By") where metric.name == "couchbase.bucket.memory.usage"

          - convert_gauge_to_sum("cumulative", true) where metric.name == "couchbase.bucket.item.ejection.count"
          - set(metric.description, "Number of item value ejections from memory to disk.") where metric.name == "couchbase.bucket.item.ejection.count"
          - set(metric.unit, "{ejections}") where metric.name == "couchbase.bucket.item.ejection.count"

          - convert_gauge_to_sum("cumulative", true) where metric.name == "couchbase.bucket.error.oom.count"
          - set(metric.description, "Number of out of memory errors.") where metric.name == "couchbase.bucket.error.oom.count"
          - set(metric.unit, "{errors}") where metric.name == "couchbase.bucket.error.oom.count"

          - set(metric.description, "The memory usage at which items will be ejected.") where metric.name == "couchbase.bucket.memory.high_water_mark.limit"
          - set(metric.unit, "By") where metric.name == "couchbase.bucket.memory.high_water_mark.limit"

          - set(metric.description, "The memory usage at which ejections will stop that were previously triggered by a high water mark breach.") where metric.name == "couchbase.bucket.memory.low_water_mark.limit"
          - set(metric.unit, "By") where metric.name == "couchbase.bucket.memory.low_water_mark.limit"

  pipelines:
    metrics/couchbase:
      receivers: [prometheus]
      processors: [filter, metricstransform, transform]

The parameters specified by the novice user are rendered into this template. Then, the collector merges the rendered template into the overall configuration, yielding an effective configuration that achieves the exact same functionality.

Implementation details

Template Instance IDs

Templated components are used in place of receivers, but their usage in the configuration should seem somewhat intuitive. The notable difference is in the way the template component is identified. The standard for component IDs is <component_type>/[instance_name].

Templated components would be identified very similarly, with the format template/<template_type>[/instance_name].

The literal prefix template indicates that the component is templated.
The template_type is required and must reference a known template type, just as a component_type would reference a known component type.
The instance_name is optional. It serves exactly the same purpose as an instance name for a component. That is, it allows the user to define multiple instances of the template and refer to them distinctly in the configuration.

Template use in pipelines

Just as a normal receiver, a template is defined with an ID, and this ID must be used in one or more pipelines to indicate where data will be emitted.

receivers:
  template/my_template/1: ... # Templated receiver is defined alongside normal receivers
  foo: ...
exporters:
  bar: ...

service:
  pipelines:
    logs:
      receivers: [ template/my_template/1, foo ] # Templated receiver is used in pipeline(s)
      exporters: [ bar ]

Template expansion

The collector will render a template and integrate its components and pipelines into the service graph.

Components and pipelines defined within the template will be "scoped" by appending the template type and instance name to their ID. e.g. my_template/1 appended to each component and pipeline ID defined within the template.

Each pipeline within the template is integrated into the overall service graph as follows:

We autogenerate a forward connector that attaches the templated pipelines to the service pipelines.
This forward connector is used as an exporter in the templated pipelines.
The same forward connector is used as a receiver in the service pipelines, in place of the template ID.

Describe alternatives you've considered

Previously proposed as a receiver. This issue contains discussion about what is currently possible via various configuration merging strategies.

Additional context

The text was updated successfully, but these errors were encountered:

djaglowski · 2023-09-06T21:58:45Z

Based on discussion in today's SIG meeting, I have compiled a list of differences between this approach and the "Template Receiver" proposed in open-telemetry/opentelemetry-collector-contrib#26312.

Template Loading & Validation

Provider

The collector is informed of templates at run time via a new template provider scheme. Basically, the user runs the collector with an additional --config template:/path/to/my_template.yaml for each template.

The collector loads each template file and validates the expected structure.

Receiver

The receiver contains a config field e.g. path which indicates where to find the template file.

The receiver factory reads and validates the file when instantiating the receiver.

Alternate Template Sources

Provider

✅ We should be able to include templates by incorporating other types of providers. e.g. url, s3, etc

Receiver

⚠️ It might be possible for the receiver to tap into "providers" directly. At best this seems like an awkward mechanism for a component to incorporate.

Template Types

Provider

Each template file contains a unique "type", much like a receiver, processor, or exporter type.

✅ Once loaded and validated, each template may be referred to by its type. No additional information is required in order to refer to a given template.

⛔ Types must be unique. This can easily be caught at runtime, but a simple namespace pattern may be helpful.

Receiver

There is no notion of types. Each instance of the receiver has a path to a template file.

⛔ If multiple receivers refer to the same template, they must each specify the path to the file.

Component Configuration

Provider

⛔ To define a templated component, the component ID is prefixed with template/ to indicate that the component is templated. e.g. template/couchbase_metrics or ``template/couchbase_metrics/1`

✅ Template parameters are specified just like any other component.

receivers:
  template/my_template:
    foo: bar

Receiver

✅ template is an actual receiver type, so it is used in configuration just like any other type of receiver.

⛔ Template parameters are specified in a sub-section of the configuration because the top level must separately define the source of the template.

receivers:
  template/1:
    path: ./my_template.yaml
    parameters:
      foo: bar

Effective Configuration

Provider

When the template is rendered, the components and pipelines it contains are merged into the overall configuration.

⛔ Exposing the internals of the template is arguably against the general purpose of template, which are should abstract away complexity from the user.

✅ The effective configuration is accurate, even if it contains some elements which must be understood as internal to the template.

Receiver

✅ The effective configuration closely matches the level of abstraction familiar to the user.

⛔ Technically, the effective configuration is not accurate because it omits the internals of the template. It's not clear that there is any mechanism which would surface these details.

Service Management

Provider

The template is expanded and merged into the overall configuration.

✅ Service::Telemetry configuration is naturally the same as the rest of the collector.

✅ Rendered components and pipelines are started & stopped as part of normal collector lifecycle.

Receiver

The receiver runs an internal service which contains the templated components.

⛔ It's not clear that there is a mechanism to inherit telemetry settings from the main service.

⛔ The receiver must manage the independent service instance.

djaglowski · 2023-09-06T22:01:28Z

@jpkrohling, I hope I've represented your perspective fairly in the above comparison. I'm happy to update the list if not.

djaglowski · 2023-09-06T22:03:06Z

cc: @tigrannajaryan @jsuereth @jkowall @joshdover

jkowall · 2023-09-06T22:49:05Z

Sorry @djaglowski im not working on otel for the last 10 months.

jpkrohling · 2023-09-08T12:18:20Z

I believe your summary did capture what we talked about during the call. I'm convinced that the config provider approach is reasonable and provides a good set of features despite still thinking that the UX for the template provider is friendlier.

codeboten · 2023-09-14T20:06:40Z

I'm convinced that the config provider approach is reasonable and provides a good set of features despite still thinking that the UX for the template provider is friendlier.

@jpkrohling you mention provider twice, did you mean "UX for the template receiver is friendlier"?

tigrannajaryan · 2023-09-14T22:58:00Z

@djaglowski is it possible to have 2 template/couchbase_metrics receivers with different endpoints and attached to 2 different pipelines? Do you have to make the pipeline name a template parameter in that case?

Are there any limitations on what keys may be used in a template definition? (Can I add exporters, connectors, etc?)

djaglowski · 2023-09-15T01:44:36Z

@djaglowski is it possible to have 2 template/couchbase_metrics receivers with different endpoints and attached to 2 different pipelines?

Yes. What I'm proposing here is that each use of a template has the same level of uniqueness as you would have when using a normal component: Just as you would use otlp and otlp/2 as distinct receivers, you can use template/foo and template/foo/2 as distinct receivers.

If the normal component ID format is component_type[/instance_name], the templated component ID format would be template/ followed by template_type/[instance_name]. Effectively, we reserve template as a special "receiver type" and trigger the special behavior of rendering a template and substituting it in place of the receiver.

Do you have to make the pipeline name a template parameter in that case?

No, the user would use the template ID in whichever pipelines they choose, exactly as they would use a receiver ID.

receivers:
  template/couchbase_metrics:
    endpoint: couchbase:8091
    username: otelu
    password: otelpassword
  template/couchbase_metrics/another:
    endpoint: somethingelse:9999
    username: user
    password: pass

exporters:
  prometheus:
    endpoint: 0.0.0.0:9123

service:
  pipelines:
    metrics:
      receivers: [template/couchbase_metrics]
      exporters: [prometheus]
    metrics/2:
      receivers: [template/couchbase_metrics/another]
      exporters: [prometheus]

Are there any limitations on what keys may be used in a template definition? (Can I add exporters, connectors, etc?)

What I am proposing immediately is that yes, there are restrictions. A rendered template would have a format very similar to a normal config file. However, it would be slimmed down slightly:

receivers:
  foo:
  foo/2:
processors:
  bar:
  bar/2:
# exporters not allowed
# connectors not allowed

# service not allowed. Use "pipelines" directly
pipelines:
  logs:
    receivers: [foo, foo/2]
    processors: [bar, bar/2]
    # exporters not allowed. The template _is_ a receiver in a pipeline, so there is implicitly an "exporter"
    # which passes data onto the pipeline(s) in which the template is used.
  # additional pipelines here, just as in a normal config, except "exporters" is implied.

That said, I believe we can reasonably add support for connectors and exporters from here, if we choose. Here's how:

Keeping in mind that this template is acting as a receiver, we should emit data onto the pipeline(s) in which the template is used as a receiver. To that end, I think that every template would be required to include an autogenerated "forward" connector as an exporter from the template. The definition of the connector is implied, but it must be used in at least one pipeline in order for the template to make sense as a receiver.

receivers:
  foo:
    endpoint: {{ .host }}:{{ .port }}
processors:
  bar:
    hello: {{ .name }}
exporters:
  {{ if .copy_data_file }}
  file: # we'll use this to send a copy of all data emitted by the template into a file
    path: {{ .copy_data_file }}
  {{ end }}

pipelines:
  logs:
    receivers: [ foo ]
    processors: [ bar ]
    exporters:
      - forward # autogenerated connector which is used to forward data onto the pipeline(s) in which the template is used.
    {{ if .copy_data_file }}
      - file
    {{ end }}

Tying this back in with your earlier question about using a template multiple times, it's important to understand that all components within the template will be "scoped" by having /template_type[/instance_name] appended to their ID, such that when we expand the template into a configuration, all components will still be uniquely identified. (Technically is it possible to have collisions but this can be addressed in reasonable ways. Setting this aside for now..)

For example, let's say the above template type is called foo_bar. We could use it as follows and expect the corresponding effective configuration:

Actual configuration

receivers:
  template/foo_bar:
    host: localhost
    port: 1234
    name: Tigran
    copy_data_file: ./myfile.json
  template/foo_bar/2:
    host: localhost
    port: 6789
    name: Dan
exporters:
  otlp: ...

service:
  pipelines:
    logs:
      receivers: [ template/foo_bar, template/foo_bar/2 ]
      exporters: [ otlp ]
    logs/only_2:
      receivers: [ template/foo_bar/2 ]
      exporters: [ otlp ]

Effective configuration

receivers:
  foo/foo_bar: # Inserted from first use of template. Note the ID is scoped according to the template instance ID
    endpoint: localhost:1234
  foo/foo_bar/2: # Inserted from second use of template. 
    endpoint: localhost:6789
processors:
  bar/foo_bar: # From first use of template
    hello: Tigran
  bar/foo_bar/2: # From second use of template
    hello: Dan
exporters:
  otlp: ... # included directly in the actual config
  file/foo_bar: # From first use of template. No corresponding component was rendered in second use.
    path: ./myfile.json
connectors:
  forward/foo_bar:   # autogenerated to pass data from "template/foo_bar" to where it was used. Internally, the template just refers to this as "forward". The "/foo_bar" is added as "scope", just the same as all other components within the template.
  forward/foo_bar/2: # autogenerated to pass data from "template/foo_bar/2" to where it was used

service:
  pipelines:
    # These first two were generated from the template, one for each instance.
    logs/foo_bar:
      receivers: [ foo/foo_bar ] 
      processors: [ bar/foo_bar ] 
      exporters: [ forward/foo_bar, file/foo_bar ]
    logs/foo_bar/2:
      receivers: [ foo/foo_bar/2 ] 
      processors: [ bar/foo_bar/2 ] 
      exporters: [ forward/foo_bar/2 ]

    # The next two are the original pipelines defined in the config. Note that we've replaced the references
    # to the template instances with the corresponding forward connector instances.
    logs:
      receivers: [ forward/foo_bar, forward/foo_bar/2 ]
      exporters: [ otlp ]
    logs/only_2:
      receivers: [ forward/foo_bar/2 ]
      exporters: [ otlp ]

tigrannajaryan · 2023-09-15T16:30:44Z

OK, I think I understand now.

I think it is important to add to the original issue description these 2 key ideas that you refer to in your comment:

The instantiated components and pipelines are scoped by appending the instance id.
We autogenerate a forward connector that attaches the instantiated pipelines to the service pipelines.

The net result is that essentially the template declares its own connected graph of receivers, processors and pipelines, and once instanced the output of that graph autoconnects using a forward connector to the input of the pipeline with which the instantiated receiver is associated via the service.pipelines.<name>.receivers key. Is that correct?

Question: is it required for template definition to contain processors and pipelines entries or they are optional and if absent we just connect the instantiated receiver to the service pipeline directly?

djaglowski · 2023-09-15T17:00:03Z

Thanks @tigrannajaryan, I've updated the issue to include these key ideas.

is it required for template definition to contain processors and pipelines entries or they are optional and if absent we just connect the instantiated receiver to the service pipeline directly?

I believe what you are suggesting is possible. It should not be difficult to support both cases.

jpkrohling · 2023-09-19T13:18:27Z

@codeboten, yes! I did mean to say that I find the receiver having better UX. After @djaglowski's presentations and arguments, I no longer have a strong opinion in favor of the receiver despite still lightly leaning toward it.

djaglowski mentioned this issue Sep 6, 2023

New component: Template Receiver open-telemetry/opentelemetry-collector-contrib#26312

Closed

2 tasks

This was referenced Sep 22, 2023

Add template provider #8504

Closed

Add template converter #8507

Closed

djaglowski mentioned this issue Nov 16, 2023

Recursive unmarshaling of components #8940

Open

djaglowski mentioned this issue Jan 12, 2024

Improve Kubernetes Logs Collection Experience open-telemetry/opentelemetry-collector-contrib#25251

Closed

jsoriano mentioned this issue Aug 20, 2024

POC of integration receiver and processor elastic/opentelemetry-collector-components#96

Draft

mx-psi mentioned this issue Oct 25, 2024

Simplifying stateful behavior open-telemetry/opentelemetry-collector-contrib#35013

Open

VihasMakwana mentioned this issue Oct 31, 2024

Simplify Collector Configuration for Users open-telemetry/opentelemetry-collector-contrib#36116

Open

jsoriano mentioned this issue Nov 8, 2024

[RFC] OTel collector modules #11631

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Template provider #8372

Template provider #8372

djaglowski commented Sep 6, 2023 •

edited

Loading

djaglowski commented Sep 6, 2023

djaglowski commented Sep 6, 2023

djaglowski commented Sep 6, 2023

jkowall commented Sep 6, 2023

jpkrohling commented Sep 8, 2023 •

edited

Loading

codeboten commented Sep 14, 2023

tigrannajaryan commented Sep 14, 2023

djaglowski commented Sep 15, 2023

tigrannajaryan commented Sep 15, 2023

djaglowski commented Sep 15, 2023

jpkrohling commented Sep 19, 2023

Template provider #8372

Template provider #8372

Comments

djaglowski commented Sep 6, 2023 • edited Loading

Implementation details

Template Instance IDs

Template use in pipelines

Template expansion

djaglowski commented Sep 6, 2023

Template Loading & Validation

Provider

Receiver

Alternate Template Sources

Provider

Receiver

Template Types

Provider

Receiver

Component Configuration

Provider

Receiver

Effective Configuration

Provider

Receiver

Service Management

Provider

Receiver

djaglowski commented Sep 6, 2023

djaglowski commented Sep 6, 2023

jkowall commented Sep 6, 2023

jpkrohling commented Sep 8, 2023 • edited Loading

codeboten commented Sep 14, 2023

tigrannajaryan commented Sep 14, 2023

djaglowski commented Sep 15, 2023

tigrannajaryan commented Sep 15, 2023

djaglowski commented Sep 15, 2023

jpkrohling commented Sep 19, 2023

djaglowski commented Sep 6, 2023 •

edited

Loading

jpkrohling commented Sep 8, 2023 •

edited

Loading