postgres scaler uses full connection string in metric name #1200

ycabrer · 2020-09-28T22:27:43Z

The metric name is exposing the full connection string as the metric name. Ideally, we wouldn't have a metric name with the credentials and URL to hit postgres.

Expected Behavior

A metric name that does not expose DB credentials

Actual Behavior

A metric name that exposes DB credentials

Steps to Reproduce the Problem

Create a postgres scaler with s.metadata.connection defined
Observe full connection string in External Metric Names status

keda/pkg/scalers/postgresql_scaler.go

Line 186 in 934d309

    
           metricName = kedautil.NormalizeString(fmt.Sprintf("%s-%s", metricName, s.metadata.connection))

Specifications

KEDA Version: V2 Beta
Platform & Version: EKS
Kubernetes Version: 1.17.x
Scaler(s): Postgres

The text was updated successfully, but these errors were encountered:

zroubalik · 2020-09-30T07:42:34Z

@ycabrer we are doing this, because we need a way how to generate unique Metric Name for each Trigger in the ScaledObject. Do you have any idea how to modify that part to not contain full connection string, but the resulting Metric Name is unique (and always the same for the same Trigger?).

ycabrer · 2020-09-30T17:33:51Z

@zroubalik That makes sense, I don't think it will work though if multiple triggers use the same connection string. Two triggers with separate queries and the same connection will end up with the same name.

What would be truly unique is somehow a combination of the query being used, database host, and name. A hash value? That's not very human friendly. Ideally, the metric name would be easy to identify and correlate with the trigger. Maybe this isn't a problem Keda should be trying to solve programmatically.

The Prometheus scaler allows you to define a metric name. Like Postgres, Prometheus can potentially have many triggers associated with the same host. I think the same approach should be taken here and have it documented that it must be unique.

zroubalik · 2020-09-30T19:17:00Z

@ycabrer yeah, that's true. We were thinking about this and the benefit is, that user don't have to care about this setting (as it is KEDA internal only) and another point is, that other scalers solve this programatically as well, so we have the same experience for all scalers.

Looking at the code, you are right about the the connection setting and uniqueness, we should fix that first, so it is a combination of both, so it is not broken and than we can try to find a solution for this particular issue (maybe a combination of hashed connection string + query?)

ycabrer · 2020-10-01T17:46:57Z

@zroubalik

Issue 1 (Password in Metric Name):
We can remove the password from the connection string by parsing it as a URL, grabbing the password, and doing a string replacement. That's how the net/http package seems to handle masking credentials.

Issue 2 (Uniqueness of Metric Name):
I like the idea of basing it off of the connection string (without the password) and the query. My only concern is that it may potentially be a very long metric name. I'm not sure if Kubernetes has a character limit for this but it does affect the readability of the created HPA.

I can't think of a better way without adding a metric name field to Postgres metadata struct like the Prometheus scaler. It is kind of a pain though to make a change like that after v2 Keda scaler has already been defined.

ycabrer · 2020-10-01T17:48:46Z

Let me know your thoughts. I can't seem to get out of my thought loop. There might be a better way.

zroubalik · 2020-11-16T16:32:48Z

@ycabrer sorry for the delay. It seems like you are right, for Postres scaler it would make sense to add a metric name field the same way it has been added to Prometheus, as both scalers are based on a long query.

And we cannot assure uniqueness from the other fields, am I right?

To mitigate the problems with changing the scaler API, we could do:

add metric name to the spec as an optional field
if the metric name is not set, try to generate the name programatically (without exposing the password)
add a general check to the main Reconciler loop, to check that metric names in one ScaledObject are unique enough, which should be done anyway )no matter which scaler is being used). This check would inform user via error message/scaledobject status etc that he should modify the metric name.

wdyt?

ycabrer · 2020-11-25T04:51:01Z

@zroubalik That's a good idea!

I will open a PR for the metric name issue.

I will also look into the reconciler loop check.

ycabrer · 2020-11-26T05:15:22Z

@zroubalik Is this what you were thinking of for the metric name uniqueness check?

It only checks for uniqueness within the same ScaledObject.

ycabrer@8427b62

zroubalik · 2020-11-26T09:57:49Z

@ycabrer yeah that's exactly what I was thinking about :)

zroubalik · 2020-12-01T10:13:48Z

@ycabrer just checking the status. Are you planning to open another PR with the check, please? Or should we track it in some issue as something that's needs to be done. Thanks :)

ycabrer · 2020-12-01T15:37:45Z

@zroubalik Yes! Let me add a couple of tests for it first and update the changelog/docs.

stale · 2021-10-13T20:08:44Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.

stale · 2021-10-20T21:06:04Z

This issue has been automatically closed due to inactivity.

ycabrer added the bug Something isn't working label Sep 28, 2020

ycabrer mentioned this issue Nov 25, 2020

Postgres: optional metricName & mask passwords #1381

Merged

4 tasks

zroubalik mentioned this issue Nov 30, 2020

feat: adding influxdb scaler #1326

Merged

4 tasks

stale bot added the stale All issues that are marked as stale due to inactivity label Oct 13, 2021

stale bot closed this as completed Oct 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

postgres scaler uses full connection string in metric name #1200

postgres scaler uses full connection string in metric name #1200

ycabrer commented Sep 28, 2020

zroubalik commented Sep 30, 2020

ycabrer commented Sep 30, 2020

zroubalik commented Sep 30, 2020

ycabrer commented Oct 1, 2020

ycabrer commented Oct 1, 2020

zroubalik commented Nov 16, 2020 •

edited

Loading

ycabrer commented Nov 25, 2020

ycabrer commented Nov 26, 2020

zroubalik commented Nov 26, 2020

zroubalik commented Dec 1, 2020

ycabrer commented Dec 1, 2020

stale bot commented Oct 13, 2021

stale bot commented Oct 20, 2021

postgres scaler uses full connection string in metric name #1200

postgres scaler uses full connection string in metric name #1200

Comments

ycabrer commented Sep 28, 2020

Expected Behavior

Actual Behavior

Steps to Reproduce the Problem

Specifications

zroubalik commented Sep 30, 2020

ycabrer commented Sep 30, 2020

zroubalik commented Sep 30, 2020

ycabrer commented Oct 1, 2020

ycabrer commented Oct 1, 2020

zroubalik commented Nov 16, 2020 • edited Loading

ycabrer commented Nov 25, 2020

ycabrer commented Nov 26, 2020

zroubalik commented Nov 26, 2020

zroubalik commented Dec 1, 2020

ycabrer commented Dec 1, 2020

stale bot commented Oct 13, 2021

stale bot commented Oct 20, 2021

zroubalik commented Nov 16, 2020 •

edited

Loading