Make discovery poll interval configurable #397

zetaab · 2021-02-07T12:18:33Z

current solution is that it will query discovery Poll each minute. However, when we have for instance kops cluster it means that we have 6 etcd manager processes. This means that etcd-manager will query each 10 seconds the status from the cloudprovider API. This means total 8640 queries/day and in case of for instance OpenStack / GCP it makes also 8640 * 3 ~ 26k compute API calls /day. In my opinion this is too much and that is why I would like to configure this value.

It is not problem if we have one cluster in whole cloud, but if we have for instance 100 kops clusters those numbers are pretty high.

mitch000001 · 2021-02-08T10:21:06Z

cmd/etcd-manager/main.go

@@ -81,6 +81,7 @@ func main() {
 	flag.StringVar(&o.ClusterName, "cluster-name", o.ClusterName, "name of cluster")
 	flag.StringVar(&o.BackupStorePath, "backup-store", o.BackupStorePath, "backup store location")
 	flag.StringVar(&o.BackupInterval, "backup-interval", o.BackupInterval, "interval for periodic backups")
+	flag.StringVar(&o.DiscoveryPollInterval, "discovery-poll-interval", o.DiscoveryPollInterval, "interval for discovery poll")


How are we going to set that in the context of kOps? I mean, is it worth enabling the setting via ENV vars also, just like backup retentions?

we need modify kops apis to make it configurable, my plan was to do that instead of just env variables

As those changes need time to go through the versioning cycle I'd recommend also to add the possibility for env vars

yeah there are two PRs open that will both add new flags. IMO that should be implemented then both of these. I would like to get these PRs merged first and after that make common solution for envs

justinsb · 2021-02-26T12:04:13Z

cmd/etcd-manager/main.go

@@ -81,6 +81,7 @@ func main() {
 	flag.StringVar(&o.ClusterName, "cluster-name", o.ClusterName, "name of cluster")
 	flag.StringVar(&o.BackupStorePath, "backup-store", o.BackupStorePath, "backup store location")
 	flag.StringVar(&o.BackupInterval, "backup-interval", o.BackupInterval, "interval for periodic backups")
+	flag.StringVar(&o.DiscoveryPollInterval, "discovery-poll-interval", o.DiscoveryPollInterval, "interval for discovery poll")


There is flag.DurationVar, which I think is the same but just avoids the need to parse

justinsb · 2021-02-26T12:04:54Z

test/integration/harness/node.go

@@ -156,7 +156,8 @@ func (n *TestHarnessNode) Run() {
 		Id:        string(uniqueID),
 		Endpoints: []string{grpcEndpoint},
 	}
-	peerServer, err := privateapi.NewServer(n.ctx, myInfo, serverTLSConfig, disco, grpcPort, dnsProvider, dnsSuffix, clientTLSConfig)
+	discoveryInterval := time.Second * 5


Ooh cool - does this make our tests go faster?

justinsb · 2021-02-26T12:07:37Z

Thanks @zetaab ... we could also think about e.g. having a different interval once we've reached steady-state, or having a way that clients could signal the process that there's a change. But this is a great first step!

In terms of flag proliferation and mapping them, in kops-controller we're starting to define things using a yaml config block, which could be in a configmap. But etcd-manager is so low in the stack, we can't use a ConfigMap, so I'm not sure we have an easy answer here. One answer could be to define a config map, and then have something like the overrides so that we can at least have systematic names for the flags (e.g. --override spec.pollInterval=10m)

justinsb · 2021-02-26T12:07:42Z

/approve
/lgtm

zetaab · 2021-02-26T14:15:06Z

@justinsb I totally agree, we should have faster interval if the cluster is not in steady-state. And after we have that, we should use longer duration.

Changes: * Add user agent to etcd-manager requests [kubernetes#395](kopeio/etcd-manager#395) * Add etcd-manager metrics, add openstack API metrics [kubernetes#396](kopeio/etcd-manager#396) * Make discovery poll interval configurable [kubernetes#397](kopeio/etcd-manager#397) * Add log levels to prevent too verbose logging [kubernetes#394](kopeio/etcd-manager#394)

zetaab added 2 commits February 7, 2021 14:18

Make discovery poll interval configurable

c0dd42a

fix test

1bdb618

mitch000001 reviewed Feb 8, 2021

View reviewed changes

justinsb reviewed Feb 26, 2021

View reviewed changes

justinsb merged commit d2bc348 into kopeio:master Feb 26, 2021

zetaab deleted the feature/discoveryinterval branch February 26, 2021 14:13

justinsb mentioned this pull request Mar 1, 2021

Update etcd-manager to 3.0.20210228 kubernetes/kops#10949

Merged

justinsb mentioned this pull request Mar 1, 2021

Merge in upstream changes to etcd-manager kubernetes-retired/etcdadm#191

Merged

ottosulin mentioned this pull request Mar 4, 2021

Add etcd-manager discoveryPollInterval option kubernetes/kops#10975

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make discovery poll interval configurable #397

Make discovery poll interval configurable #397

zetaab commented Feb 7, 2021 •

edited

Loading

mitch000001 Feb 8, 2021

zetaab Feb 8, 2021 •

edited

Loading

mitch000001 Feb 8, 2021

zetaab Feb 10, 2021

justinsb Feb 26, 2021

justinsb Feb 26, 2021

justinsb commented Feb 26, 2021

justinsb commented Feb 26, 2021

zetaab commented Feb 26, 2021

Make discovery poll interval configurable #397

Make discovery poll interval configurable #397

Conversation

zetaab commented Feb 7, 2021 • edited Loading

mitch000001 Feb 8, 2021

Choose a reason for hiding this comment

zetaab Feb 8, 2021 • edited Loading

Choose a reason for hiding this comment

mitch000001 Feb 8, 2021

Choose a reason for hiding this comment

zetaab Feb 10, 2021

Choose a reason for hiding this comment

justinsb Feb 26, 2021

Choose a reason for hiding this comment

justinsb Feb 26, 2021

Choose a reason for hiding this comment

justinsb commented Feb 26, 2021

justinsb commented Feb 26, 2021

zetaab commented Feb 26, 2021

zetaab commented Feb 7, 2021 •

edited

Loading

zetaab Feb 8, 2021 •

edited

Loading