Ability to set upstream zone size and keepalive settings #483

kate-osborn · 2023-03-17T21:23:27Z

As a user of NGF
I want NGF to update the upstream zone size for NGINX
So that if I run into errors when due to exceeding my zone size, I can fix them.

As a user of NGF
I want NGF to enable keepalive connections on my route
So that I can optimize the performance of my application.

Acceptance

The user is able to set NGINX's upstream zone size.
The user is able to enable keepalive connections as defined by the design.
When possible, configuration updates with NGINX Plus should be made using the NGINX Plus API so NGINX is not reloaded.
- zone size
- keepalives connections

Dev Notes:

If a location block is forwarding to an upstream with keepalives, the Connection header must be empty. See http://nginx.org/en/docs/http/ngx_http_upstream_module.html#keepalive

Tasks

Give feedback

Design data plane configuration API for UpstreamSettingsPolicy #2809
Translate UpstreamSettingsPolicy to data plane configuration #2810

refined size/medium stale
Translate data plane UpstreamSettingsPolicy configuration into NGINX configuration #2811

refined size/medium
Implement UpsteamSettingsPolicy Status #2812

blocked enhancement refined
Options

mpstefan · 2023-05-11T15:34:41Z

Would the user expect the system to do this? What is the impact on the user's system if the zone size is not dynamically updated?

brianehlert · 2023-05-11T21:15:55Z

Dynamic calculation of upstream zone accomplishes one thing, as the size of an upstream service grows it ensure that NGINX can handle that.
The positive impact is that the customer can scale upstream services at will to 1000s of pods and the system will dynamically adapt.
The negative impact is the growth in memory utilization as the possibility exists to high limits.

I have an entire write-up around this for NIC to do the same.
This would be a later and advanced capability that has the impact of optimizing the system.

If not dynamically calculated, it needs to be exposed in a configmap for example or however system tuning is exposed.
Whether auto-magic is a requirement for v1 should be discussed.

mpstefan · 2023-09-11T16:03:25Z

Today we discussed on how this would be valuable for NGINX+ as it does not require a reload when upstreams are added or removed.

brianehlert · 2023-10-13T14:11:07Z

NIC has run into a number of customer situations where customers set their limits so lean as even a back end service scaling event can cause OOM or CPUThrottling situations as a result of the configuration change. Without conscious memory consumption increases that are introduced by a feature like this.

While I think this capability is highly valuable,
As I have learned more about how customers are leaning into using Quality of Service and other K8s platform requirements that force the setting of limits - I am hesitant at introducing something like this due to the feared impact on the system as a whole and a situation where the Gateway is unable to start because the configuration alone is forcing the pod into an OOM state.

mpstefan · 2023-10-24T16:42:17Z

blocked by #929

pleshakov · 2023-11-01T15:14:51Z

@brianehlert

NIC has run into a number of customer situations where customers set their limits so lean as even a back end service scaling event can cause OOM or CPUThrottling situations as a result of the configuration change. Without conscious memory consumption increases that are introduced by a feature like this.

While I think this capability is highly valuable,
As I have learned more about how customers are leaning into using Quality of Service and other K8s platform requirements that force the setting of limits - I am hesitant at introducing something like this due to the feared impact on the system as a whole and a situation where the Gateway is unable to start because the configuration alone is forcing the pod into an OOM state.

The amount of configuration does affect memory consumptions of NGINX - more config you have (including TLS secrets), more memory it will consume.

Also note that our architecture includes running the control plane along the data plane, where the control plane has a cache of resources in the cluster in memory. This means that the number of those resources (including HTTPRoutes, Secrets, Endpoints... ) also directly affect memory consumption of NGF pod, without even considering the data plane.

However, traffic will much greater affect memory -- as each connection requires memory.

Additionally, configuration changes (reloading NGINX) temporarily increases memory consumption, as during a reload both old worker processes and new worker processes coexist.

Supporting dynamic calculation of zone sizes will reduce overall memory of NGINX -- because each upstream will use the amount tuned to the number of upstream servers, not some large value that will hold any amount for most cases.

Considering all that, I think dynamic calculations of zone sizes will be beneficial and it will not lead to OOMs - other things will lead to OOMs first.

kate-osborn · 2024-10-31T15:42:34Z

When possible, configuration updates with NGINX Plus should be made using the NGINX Plus API so NGINX is not reloaded.

zone size
keepalives connections

It doesn't look like it is possible to set zone size or keepalive connections using the N+ API. The API doesn't support updating directives for an upstream group. You can only add/modify/delete servers from upstreams: https://demo.nginx.com/swagger-ui/?_ga=2.44370660.1560926404.1730133990-1687392834.1727393286#/

kate-osborn · 2024-10-31T15:54:15Z

Also note:

When using load balancing methods other than the default round-robin method, it is necessary to activate them before the keepalive directive.

kate-osborn added the enhancement New feature or request label Mar 17, 2023

github-project-automation bot added this to NGINX Gateway Fabric Mar 17, 2023

github-project-automation bot moved this to 🆕 New in NGINX Gateway Fabric Mar 17, 2023

kate-osborn added the area/nginx-configuration Relates to nginx configuration label Mar 21, 2023

kate-osborn added this to the v1.0.0 milestone Mar 21, 2023

mpstefan modified the milestones: v1.0.0, v1.0.1 Aug 11, 2023

mpstefan added refined Requirements are refined and the issue is ready to be implemented. size/small Estimated to be completed within ~2 days labels Sep 11, 2023

mpstefan modified the milestones: v1.0.1, v1.1.0 Sep 26, 2023

mpstefan changed the title ~~Dynamically calculate upstream zone size~~ Ability to set upstream zone size Oct 23, 2023

mpstefan added the blocked Blocked by other issue label Oct 24, 2023

ciarams87 removed the blocked Blocked by other issue label Nov 6, 2023

ciarams87 mentioned this issue Nov 9, 2023

Feat: Support OTel tracing using ngx_otel_module #1238

Closed

6 tasks

bjee19 self-assigned this Nov 13, 2023

bjee19 moved this from 🆕 New to 🏗 In Progress in NGINX Gateway Fabric Nov 13, 2023

bjee19 moved this from 🏗 In Progress to 🆕 New in NGINX Gateway Fabric Nov 15, 2023

kate-osborn added the blocked Blocked by other issue label Nov 16, 2023

bjee19 removed their assignment Nov 20, 2023

ja20222 added the backlog Currently unprioritized work. May change with user feedback or as the product progresses. label Nov 20, 2023

ja20222 removed this from the v1.1.0 milestone Nov 20, 2023

mpstefan mentioned this issue Dec 19, 2023

Native NGINX Configuration in Gateway API Design #1258

Closed

mpstefan removed the backlog Currently unprioritized work. May change with user feedback or as the product progresses. label Feb 6, 2024

mpstefan added this to the v1.2.0 milestone Feb 6, 2024

mpstefan removed refined Requirements are refined and the issue is ready to be implemented. size/small Estimated to be completed within ~2 days labels Feb 6, 2024

mpstefan modified the milestones: v1.2.0, v2.0.0 Feb 21, 2024

mpstefan removed the blocked Blocked by other issue label Mar 11, 2024

mpstefan mentioned this issue Mar 29, 2024

ClientSettingsPolicy for Gateways #1760

Closed

mpstefan modified the milestones: v1.3.0, v2.1.0 Mar 29, 2024

mpstefan mentioned this issue Mar 29, 2024

Client Settings Policy #1758

Closed

mpstefan mentioned this issue Aug 26, 2024

Upstream Settings Policy #2162

Open

mpstefan changed the title ~~Ability to set upstream zone size~~ Ability to set upstream zone size and keepalive settings Sep 3, 2024

mpstefan added refined Requirements are refined and the issue is ready to be implemented. size/large Estimated to be completed within two weeks labels Sep 3, 2024

mpstefan modified the milestones: v1.5.0, v2.0.0 Oct 14, 2024

sindhushiv modified the milestones: v2.0.0, v1.5.0 Oct 30, 2024

kate-osborn self-assigned this Oct 31, 2024

kate-osborn moved this from 🆕 New to 🏗 In Progress in NGINX Gateway Fabric Oct 31, 2024

sjberman modified the milestones: v1.5.0, v1.6.0 Nov 20, 2024

sindhushiv assigned bjee19 Nov 21, 2024

kate-osborn moved this from 🏗 In Progress to 👀 In Review in NGINX Gateway Fabric Dec 11, 2024

kate-osborn moved this from 👀 In Review to ✅ Done in NGINX Gateway Fabric Dec 13, 2024

kate-osborn closed this as completed by moving to ✅ Done in NGINX Gateway Fabric Dec 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ability to set upstream zone size and keepalive settings #483

Ability to set upstream zone size and keepalive settings #483

kate-osborn commented Mar 17, 2023 •

edited

Loading

Tasks

mpstefan commented May 11, 2023

brianehlert commented May 11, 2023

mpstefan commented Sep 11, 2023

brianehlert commented Oct 13, 2023 •

edited

Loading

mpstefan commented Oct 24, 2023

pleshakov commented Nov 1, 2023

kate-osborn commented Oct 31, 2024 •

edited

Loading

kate-osborn commented Oct 31, 2024

Ability to set upstream zone size and keepalive settings #483

Ability to set upstream zone size and keepalive settings #483

Comments

kate-osborn commented Mar 17, 2023 • edited Loading

Acceptance

Dev Notes:

Tasks

mpstefan commented May 11, 2023

brianehlert commented May 11, 2023

mpstefan commented Sep 11, 2023

brianehlert commented Oct 13, 2023 • edited Loading

mpstefan commented Oct 24, 2023

pleshakov commented Nov 1, 2023

kate-osborn commented Oct 31, 2024 • edited Loading

kate-osborn commented Oct 31, 2024

kate-osborn commented Mar 17, 2023 •

edited

Loading

brianehlert commented Oct 13, 2023 •

edited

Loading

kate-osborn commented Oct 31, 2024 •

edited

Loading