Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ROX-21530: Updating Cert Expiry Metrics #2063

Merged
merged 7 commits into from
Oct 15, 2024

Conversation

aaa5kameric
Copy link
Contributor

@aaa5kameric aaa5kameric commented Oct 8, 2024

Description

Background: Implement monitoring for certificate expiration, tracking and managing of digital certificates expiration dates. Extracts timestamps from certificates and exposes metrics to Prometheus. (FIXED)

In response to error in: stackrox/rhacs-observability-resources#279
Problem: acs_fleetshard_certificate_expiration_timestamp - seems to retain all certificates even after their namespaces have been deleted. Combined with the ephemeral instances created by the probe, this creates a very large number of time series.
Solution: Fixed the handling of metric deletion by:
-adding namespace informer
-implemented DeletePartialMatch instead of Delete

Previous Certificate Monitoring PR: #1946
Previous Certificate Alerting PR: stackrox/rhacs-observability-resources#276

Tests

Created k9s environment locally with pods/namespaces. Used create-central.sh and delete-central.sh scripts to test local host metric creation and deletion.

@aaa5kameric
Copy link
Contributor Author

/retest

1 similar comment
@ludydoo
Copy link
Collaborator

ludydoo commented Oct 9, 2024

/retest

@kurlov
Copy link
Member

kurlov commented Oct 9, 2024

You need to rebase to fix CI pre commit

Copy link
Contributor

openshift-ci bot commented Oct 15, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: aaa5kameric, ludydoo

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@aaa5kameric aaa5kameric merged commit 4ca077d into main Oct 15, 2024
6 checks passed
@aaa5kameric aaa5kameric deleted the ROX-21530-Updating-Cert-Expiry-Metrics branch October 15, 2024 13:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants