Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create ServiceMonitors to gather metrics from the OTEL Operands #1874

Merged
merged 38 commits into from
Jul 25, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
38 commits
Select commit Hold shift + click to select a range
621888a
Allow the creation of ServiceMonitors to gather metrics from the Open…
iblancasa Jun 27, 2023
e5fb93f
Merge branch 'main' of github.com:open-telemetry/opentelemetry-operat…
iblancasa Jun 28, 2023
66d5660
Add missing changelog
iblancasa Jun 28, 2023
ab26178
Fix unprotected statement
iblancasa Jun 28, 2023
796b323
Merge branch 'main' of github.com:open-telemetry/opentelemetry-operat…
iblancasa Jun 28, 2023
80e25f5
Fix lint issues
iblancasa Jun 28, 2023
7880104
Merge branch 'main' into feature/1768
iblancasa Jun 28, 2023
638853f
Merge branch 'feature/1768' of github.com:iblancasa/opentelemetry-ope…
iblancasa Jun 28, 2023
4d5e3d7
Merge branch 'main' of github.com:open-telemetry/opentelemetry-operat…
iblancasa Jun 28, 2023
3c2bc7d
Merge branch 'main' into feature/1768
iblancasa Jun 30, 2023
42c6c90
Merge branch 'feature/1768' of github.com:iblancasa/opentelemetry-ope…
iblancasa Jul 10, 2023
27e2a7e
Merge branch 'main' of github.com:open-telemetry/opentelemetry-operat…
iblancasa Jul 10, 2023
4e718df
Apply changes requested in code review
iblancasa Jul 10, 2023
90b070b
Add missing generated files
iblancasa Jul 11, 2023
a5953e7
Change the way to enable the feature flag
iblancasa Jul 11, 2023
5604f1f
Change the way to enable the feature flag
iblancasa Jul 11, 2023
3101866
Merge branch 'main' of github.com:open-telemetry/opentelemetry-operat…
iblancasa Jul 11, 2023
8fe848c
Merge branch 'main' of github.com:open-telemetry/opentelemetry-operat…
iblancasa Jul 12, 2023
7a66076
Fix merge
iblancasa Jul 12, 2023
17e4bb5
Fix enable feature flag
iblancasa Jul 12, 2023
4f79a2d
Merge branch 'main' into feature/1768
iblancasa Jul 13, 2023
b50e2ac
Merge branch 'feature/1768' of github.com:iblancasa/opentelemetry-ope…
iblancasa Jul 13, 2023
4653d83
Merge branch 'main' into feature/1768
iblancasa Jul 14, 2023
7bf6584
Merge branch 'main' into feature/1768
iblancasa Jul 14, 2023
9aade8b
Merge branch 'feature/1768' of github.com:iblancasa/opentelemetry-ope…
iblancasa Jul 17, 2023
f9fca96
Merge branch 'main' of github.com:open-telemetry/opentelemetry-operat…
iblancasa Jul 17, 2023
f8af040
Change the name of the option and move the E2E tests to their own folder
iblancasa Jul 17, 2023
3791100
Fix unit test
iblancasa Jul 17, 2023
bd81e57
Fix docs
iblancasa Jul 17, 2023
cb1f6ba
Fix CRD field
iblancasa Jul 17, 2023
a259991
Fix CRD field
iblancasa Jul 17, 2023
25effa4
Merge branch 'main' of github.com:open-telemetry/opentelemetry-operat…
iblancasa Jul 19, 2023
175c0dc
Add from version to feature gate
iblancasa Jul 19, 2023
61ee15a
Merge branch 'main' of github.com:open-telemetry/opentelemetry-operat…
iblancasa Jul 21, 2023
1ca9357
Merge branch 'main' into feature/1768
iblancasa Jul 24, 2023
1dc416a
Merge branch 'feature/1768' of github.com:iblancasa/opentelemetry-ope…
iblancasa Jul 24, 2023
297af31
Move the E2E tests to their own section for the CI
iblancasa Jul 24, 2023
83d31d5
Merge branch 'main' of github.com:open-telemetry/opentelemetry-operat…
iblancasa Jul 25, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
16 changes: 16 additions & 0 deletions .chloggen/1768-gather-metrics-from-collectors.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
# One of 'breaking', 'deprecation', 'new_component', 'enhancement', 'bug_fix'
change_type: enhancement

# The name of the component, or a single word describing the area of concern, (e.g. operator, target allocator, github action)
component: operator

# A brief description of the change. Surround your text with quotes ("") if it needs to start with a backtick (`).
note: Add the ability to the operator to create Service Monitors for the OpenTelemetry Collectors in order to gather the metrics they are generating

# One or more tracking issues related to the change
issues: [1768]

# (Optional) One or more lines of additional information to render under the primary note.
# These lines will be padded with 2 spaces and then inserted directly into the document.
# Use pipe (|) for multiline entries.
subtext:
2 changes: 1 addition & 1 deletion .github/workflows/e2e.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -29,10 +29,10 @@ jobs:
- "1.27"
group:
- e2e e2e-upgrade
- e2e-prometheuscr
- e2e-autoscale

steps:

- name: Set up Go
uses: actions/setup-go@v4
with:
Expand Down
14 changes: 14 additions & 0 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -176,6 +176,11 @@ generate: controller-gen api-docs
e2e:
$(KUTTL) test

# end-to-end-test for PrometheusCR E2E tests
.PHONY: e2e-prometheuscr
e2e-prometheuscr:
$(KUTTL) test --config kuttl-test-prometheuscr.yaml

# end-to-end-test for testing upgrading
.PHONY: e2e-upgrade
e2e-upgrade: undeploy
Expand All @@ -200,6 +205,11 @@ e2e-log-operator:
prepare-e2e: kuttl set-image-controller container container-target-allocator container-operator-opamp-bridge start-kind cert-manager install-metrics-server load-image-all deploy
TARGETALLOCATOR_IMG=$(TARGETALLOCATOR_IMG) SED_BIN="$(SED)" ./hack/modify-test-images.sh

.PHONY: enable-prometheus-feature-flag
enable-prometheus-feature-flag:
$(SED) -i "s#--feature-gates=+operator.autoinstrumentation.go#--feature-gates=+operator.autoinstrumentation.go,+operator.observability.prometheus#g" config/default/manager_auth_proxy_patch.yaml


.PHONY: scorecard-tests
scorecard-tests: operator-sdk
$(OPERATOR_SDK) scorecard -w=5m bundle || (echo "scorecard test failed" && exit 1)
Expand Down Expand Up @@ -238,6 +248,10 @@ endif
install-metrics-server:
./hack/install-metrics-server.sh

.PHONY: install-prometheus-operator
install-prometheus-operator:
./hack/install-prometheus-operator.sh

.PHONY: load-image-all
load-image-all: load-image-operator load-image-target-allocator load-image-operator-opamp-bridge

Expand Down
29 changes: 29 additions & 0 deletions apis/v1alpha1/opentelemetrycollector_types.go
Original file line number Diff line number Diff line change
Expand Up @@ -190,6 +190,14 @@ type OpenTelemetryCollectorSpec struct {
// https://kubernetes.io/docs/concepts/workloads/pods/init-containers/
// +optional
InitContainers []v1.Container `json:"initContainers,omitempty"`

// ObservabilitySpec defines how telemetry data gets handled.
//
// +optional
// +kubebuilder:validation:Optional
// +operator-sdk:csv:customresourcedefinitions:type=spec,displayName="Observability"
Observability ObservabilitySpec `json:"observability,omitempty"`

// TopologySpreadConstraints embedded kubernetes pod configuration option,
// controls how pods are spread across your cluster among failure-domains
// such as regions, zones, nodes, and other user-defined topology domains
Expand Down Expand Up @@ -371,6 +379,27 @@ type AutoscalerSpec struct {
TargetMemoryUtilization *int32 `json:"targetMemoryUtilization,omitempty"`
}

// MetricsConfigSpec defines a metrics config.
type MetricsConfigSpec struct {
// EnableMetrics specifies if ServiceMonitors should be created for the OpenTelemetry Collector.
// The operator.observability.prometheus feature gate must be enabled to use this feature.
//
// +optional
// +kubebuilder:validation:Optional
// +operator-sdk:csv:customresourcedefinitions:type=spec,displayName="Create ServiceMonitors for OpenTelemetry Collector"
EnableMetrics bool `json:"enableMetrics,omitempty"`
}

// ObservabilitySpec defines how telemetry data gets handled.
type ObservabilitySpec struct {
// Metrics defines the metrics configuration for operands.
//
// +optional
// +kubebuilder:validation:Optional
// +operator-sdk:csv:customresourcedefinitions:type=spec,displayName="Metrics Config"
Metrics MetricsConfigSpec `json:"metrics,omitempty"`
}

// Probe defines the OpenTelemetry's pod probe config. Only Liveness probe is supported currently.
type Probe struct {
// Number of seconds after the container has started before liveness probes are initiated.
Expand Down
32 changes: 32 additions & 0 deletions apis/v1alpha1/zz_generated.deepcopy.go

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Original file line number Diff line number Diff line change
Expand Up @@ -76,6 +76,18 @@ spec:
- kind: StatefulSets
name: ""
version: apps/v1
specDescriptors:
- description: ObservabilitySpec defines how telemetry data gets handled.
displayName: Observability
path: observability
- description: Metrics defines the metrics configuration for operands.
displayName: Metrics Config
path: observability.metrics
- description: EnableMetrics specifies if ServiceMonitors should be created
for the OpenTelemetry Collector. The operator.observability.prometheus feature
gate must be enabled to use this feature.
displayName: Create ServiceMonitors for OpenTelemetry Collector
path: observability.metrics.enableMetrics
version: v1alpha1
description: |-
OpenTelemetry is a collection of tools, APIs, and SDKs. You use it to instrument, generate, collect, and export telemetry data (metrics, logs, and traces) for analysis in order to understand your software's performance and behavior.
Expand Down Expand Up @@ -209,6 +221,18 @@ spec:
- get
- list
- update
- apiGroups:
- monitoring.coreos.com
resources:
- servicemonitors
verbs:
- create
- delete
- get
- list
- patch
- update
- watch
- apiGroups:
- networking.k8s.io
resources:
Expand Down
13 changes: 13 additions & 0 deletions bundle/manifests/opentelemetry.io_opentelemetrycollectors.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -2949,6 +2949,19 @@ spec:
This is only relevant to daemonset, statefulset, and deployment
mode
type: object
observability:
description: ObservabilitySpec defines how telemetry data gets handled.
properties:
metrics:
description: Metrics defines the metrics configuration for operands.
properties:
enableMetrics:
description: EnableMetrics specifies if ServiceMonitors should
be created for the OpenTelemetry Collector. The operator.observability.prometheus
feature gate must be enabled to use this feature.
type: boolean
type: object
type: object
podAnnotations:
additionalProperties:
type: string
Expand Down
13 changes: 13 additions & 0 deletions config/crd/bases/opentelemetry.io_opentelemetrycollectors.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -2946,6 +2946,19 @@ spec:
This is only relevant to daemonset, statefulset, and deployment
mode
type: object
observability:
description: ObservabilitySpec defines how telemetry data gets handled.
properties:
metrics:
description: Metrics defines the metrics configuration for operands.
properties:
enableMetrics:
description: EnableMetrics specifies if ServiceMonitors should
be created for the OpenTelemetry Collector. The operator.observability.prometheus
feature gate must be enabled to use this feature.
type: boolean
type: object
type: object
podAnnotations:
additionalProperties:
type: string
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -56,6 +56,18 @@ spec:
- kind: StatefulSets
name: ""
version: apps/v1
specDescriptors:
- description: ObservabilitySpec defines how telemetry data gets handled.
displayName: Observability
path: observability
- description: Metrics defines the metrics configuration for operands.
displayName: Metrics Config
path: observability.metrics
- description: EnableMetrics specifies if ServiceMonitors should be created
for the OpenTelemetry Collector. The operator.observability.prometheus feature
gate must be enabled to use this feature.
displayName: Create ServiceMonitors for OpenTelemetry Collector
path: observability.metrics.enableMetrics
version: v1alpha1
description: |-
OpenTelemetry is a collection of tools, APIs, and SDKs. You use it to instrument, generate, collect, and export telemetry data (metrics, logs, and traces) for analysis in order to understand your software's performance and behavior.
Expand Down
12 changes: 12 additions & 0 deletions config/rbac/role.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -119,6 +119,18 @@ rules:
- get
- list
- update
- apiGroups:
- monitoring.coreos.com
resources:
- servicemonitors
verbs:
- create
- delete
- get
- list
- patch
- update
- watch
- apiGroups:
- networking.k8s.io
resources:
Expand Down
11 changes: 11 additions & 0 deletions controllers/opentelemetrycollector_controller.go
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,7 @@ import (
"sync"

"github.com/go-logr/logr"
monitoringv1 "github.com/prometheus-operator/prometheus-operator/pkg/apis/monitoring/v1"
appsv1 "k8s.io/api/apps/v1"
autoscalingv2 "k8s.io/api/autoscaling/v2"
autoscalingv2beta2 "k8s.io/api/autoscaling/v2beta2"
Expand All @@ -35,6 +36,7 @@ import (
"github.com/open-telemetry/opentelemetry-operator/internal/config"
"github.com/open-telemetry/opentelemetry-operator/pkg/autodetect"
"github.com/open-telemetry/opentelemetry-operator/pkg/collector/reconcile"
"github.com/open-telemetry/opentelemetry-operator/pkg/featuregate"
)

// OpenTelemetryCollectorReconciler reconciles a OpenTelemetryCollector object.
Expand Down Expand Up @@ -164,6 +166,11 @@ func NewReconciler(p Params) *OpenTelemetryCollectorReconciler {
"ingresses",
true,
},
{
reconcile.ServiceMonitors,
"service monitors",
true,
},
iblancasa marked this conversation as resolved.
Show resolved Hide resolved
{
reconcile.Self,
"opentelemetry",
Expand Down Expand Up @@ -251,6 +258,10 @@ func (r *OpenTelemetryCollectorReconciler) SetupWithManager(mgr ctrl.Manager) er
Owns(&appsv1.DaemonSet{}).
Owns(&appsv1.StatefulSet{})

if featuregate.PrometheusOperatorIsAvailable.IsEnabled() {
builder.Owns(&monitoringv1.ServiceMonitor{})
}

autoscalingVersion := r.config.AutoscalingVersion()
if autoscalingVersion == autodetect.AutoscalingVersionV2 {
builder = builder.Owns(&autoscalingv2.HorizontalPodAutoscaler{})
Expand Down
61 changes: 61 additions & 0 deletions docs/api.md
Original file line number Diff line number Diff line change
Expand Up @@ -3736,6 +3736,13 @@ OpenTelemetryCollectorSpec defines the desired state of OpenTelemetryCollector.
NodeSelector to schedule OpenTelemetry Collector pods. This is only relevant to daemonset, statefulset, and deployment mode<br/>
</td>
<td>false</td>
</tr><tr>
<td><b><a href="#opentelemetrycollectorspecobservability">observability</a></b></td>
<td>object</td>
<td>
ObservabilitySpec defines how telemetry data gets handled.<br/>
</td>
<td>false</td>
</tr><tr>
<td><b>podAnnotations</b></td>
<td>map[string]string</td>
Expand Down Expand Up @@ -8985,6 +8992,60 @@ Liveness config for the OpenTelemetry Collector except the probe handler which i
</table>


### OpenTelemetryCollector.spec.observability
<sup><sup>[↩ Parent](#opentelemetrycollectorspec)</sup></sup>



ObservabilitySpec defines how telemetry data gets handled.

<table>
<thead>
<tr>
<th>Name</th>
<th>Type</th>
<th>Description</th>
<th>Required</th>
</tr>
</thead>
<tbody><tr>
<td><b><a href="#opentelemetrycollectorspecobservabilitymetrics">metrics</a></b></td>
<td>object</td>
<td>
Metrics defines the metrics configuration for operands.<br/>
</td>
<td>false</td>
</tr></tbody>
</table>


### OpenTelemetryCollector.spec.observability.metrics
<sup><sup>[↩ Parent](#opentelemetrycollectorspecobservability)</sup></sup>



Metrics defines the metrics configuration for operands.

<table>
<thead>
<tr>
<th>Name</th>
<th>Type</th>
<th>Description</th>
<th>Required</th>
</tr>
</thead>
<tbody><tr>
<td><b>enableMetrics</b></td>
<td>boolean</td>
<td>
EnableMetrics specifies if ServiceMonitors should be created for the OpenTelemetry Collector. The operator.observability.prometheus feature gate must be enabled to use this feature.<br/>
</td>
<td>false</td>
</tr></tbody>
</table>


### OpenTelemetryCollector.spec.podSecurityContext
<sup><sup>[↩ Parent](#opentelemetrycollectorspec)</sup></sup>

Expand Down
Loading