NETOBSERV-578: decrease informers' memory consumption #317

mariomac · 2022-09-29T08:16:26Z

This PR adds a Transformer for each Kubernetes informer that converts the watched Pods, Services, and Nodes to Info instances, that are stored by the informers cache and contain only the data that is required for the decoration.

This also alleviates the load on the Garbage Collector: before this patch, a new Info instance was created for each flow decoration. Now the same objects are reused on GetInfo invocations for the same IP.

In Flowlogs-Pipelines with mid-to-low loads, no improvements have been observed in terms of CPU and slight improvements in terms of memory. In the image below, the middle part corresponds to the new version of FLP.

github-actions · 2022-09-29T08:18:59Z

New image: ["quay.io/netobserv/flowlogs-pipeline:7314a97"]. It will expire after two weeks.

jotak · 2022-09-29T09:32:04Z

I did some tests on higher load with hey-ho, I also see the memory usage being improved, of a slightly better amount that what you show:

Spiking at 1.16GB (versus 1.39 before) => -16%
Stabilizing at 423MB (versus 642 before) => -34%

jotak · 2022-09-29T09:47:18Z

@mariomac I haven't reviewed the code in deep, but remember the discussion we had about potential blocking calls and fact the cache seemed pre-loaded? Does this PR change anything in that regard / should we try to parallelize transformer processing?

mariomac · 2022-09-29T10:33:50Z

@jotak after working deeply with the informers' code, I can confirm that there aren't blocking calls (neither before nor now).

The initial consideration about parallelizing workers was to share the big amount of memory that the informers' spent so we expected to minimize the overall memory usage by scaling vertically and downscaling horizontally.

There are some reasons that changed my mind with respect to parallelizing (I'll annotate them also in the related JIRA to justify it):

The memory spent by the Informers' cache does not look to be so critical as we initially thought. And after this PR, the memory usage will be noticeably lower.
Since the flows arrive as waves and are internally processed in batches, I'm afraid that the effect of parallelism will be limited.

Given the internal complexity of FLP, I'd say that the effort of reworking some internals to allow this parallelization won't worth the limited improvements we could achieve.

jotak · 2022-09-30T06:36:47Z

pkg/pipeline/transform/kubernetes/kubernetes.go

-		if ownerReference.Kind == "ReplicaSet" {
-			item, ok, err := k.replicaSetInformer.GetIndexer().GetByKey(info.Namespace + "/" + ownerReference.Name)
-			if err != nil {
-				panic(err)


never noticed this panic .. that was abrupt!

jotak

Code lgtm, smart rewrite! thanks @mariomac
I'll do a couple of more tests on my cluster before approving

jotak

lgtm
I smoke-tested the enrichment end to end, everything seems fine

Mario Macias added 6 commits September 28, 2022 10:57

handmade informer

b934031

NETOBSERV-578: decrease informers' memory consumption

210d1f2

Working pod informer

e015a2f

all the informers renewed

ac4cc2a

replicaset informers also moved

c762624

commented and fixed tests

5b3627b

mariomac added the ok-to-test To set manually when a PR is safe to test. Triggers image build on PR. label Sep 29, 2022

mariomac requested review from jotak and OlivierCazade September 29, 2022 08:16

removed unused function

54b0e79

github-actions bot removed the ok-to-test To set manually when a PR is safe to test. Triggers image build on PR. label Sep 29, 2022

removed unneeded struct

1e27276

jotak reviewed Sep 30, 2022

View reviewed changes

jotak approved these changes Sep 30, 2022

View reviewed changes

mariomac merged commit 8cc4bc7 into netobserv:main Sep 30, 2022

mariomac deleted the informer branch December 7, 2022 14:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NETOBSERV-578: decrease informers' memory consumption #317

NETOBSERV-578: decrease informers' memory consumption #317

mariomac commented Sep 29, 2022

github-actions bot commented Sep 29, 2022

jotak commented Sep 29, 2022 •

edited

Loading

jotak commented Sep 29, 2022

mariomac commented Sep 29, 2022

jotak Sep 30, 2022

jotak left a comment

jotak left a comment

NETOBSERV-578: decrease informers' memory consumption #317

NETOBSERV-578: decrease informers' memory consumption #317

Conversation

mariomac commented Sep 29, 2022

github-actions bot commented Sep 29, 2022

jotak commented Sep 29, 2022 • edited Loading

jotak commented Sep 29, 2022

mariomac commented Sep 29, 2022

jotak Sep 30, 2022

Choose a reason for hiding this comment

jotak left a comment

Choose a reason for hiding this comment

jotak left a comment

Choose a reason for hiding this comment

jotak commented Sep 29, 2022 •

edited

Loading