Slow start configuration understanding #36961

anupam-meesho · 2024-11-04T07:15:48Z

Hi Team,
Some background: We have our whole infra in kubernetes. We have configured contour as an ingress gateway for the means of routing traffic across the clusters.
Question: We have configured slow start for some of our services. We came across two different behaviours for the different pods. One is honouring the slow start other one is takes a way longer time to ramp up the pods to the full capacity. It would be very helpful if you can point to the certain documentation of code that governs this behaviour predictively. Below is the configuration we are using:

slowStart:
    aggression: 0.2
    minPercent: 1
    window: 150s

The text was updated successfully, but these errors were encountered:

KBaichoo · 2024-11-04T16:39:29Z

documentation for slow start mode: https://www.envoyproxy.io/docs/envoy/latest/intro/arch_overview/upstream/load_balancing/slow_start

original PR adding slow start mode: #13176

anupam-meesho · 2024-11-08T08:21:21Z

@KBaichoo I have checked documentation but nothing seems to explain the different start modes like the above two pod traffics. Any other reference if you could guide.

KBaichoo · 2024-11-12T14:48:46Z

cc @nezdolik might be more familiar with this area

nezdolik · 2024-11-18T09:25:50Z

This is being reported quite frequently by users who operate various service mesh tech or Envoy based ingresses, where the control plane enables locality based routing by default. @anupam-meesho can you confirm that your setup does not have pods spread across multiple localities or priorities? (from slow start docs):

Note in case when multiple priorities are used with slow start and lower priority has just one endpoint A, during cross-priority spillover there will be no progressive increase of traffic to endpoint A, all traffic will shift at once. Same applies to locality weighted loadbalancing, when slow start is enabled for the upstream cluster and traffic is routed cross zone to a zone with one endpoint A, there will be no progressive increase of traffic to endpoint A.

github-actions · 2024-12-18T12:01:41Z

This issue has been automatically marked as stale because it has not had activity in the last 30 days. It will be closed in the next 7 days unless it is tagged "help wanted" or "no stalebot" or other activity occurs. Thank you for your contributions.

github-actions · 2024-12-25T16:01:08Z

This issue has been automatically closed because it has not had activity in the last 37 days. If this issue is still valid, please ping a maintainer and ask them to label it as "help wanted" or "no stalebot". Thank you for your contributions.

anupam-meesho added the triage Issue requires triage label Nov 4, 2024

KBaichoo added question Questions that are neither investigations, bugs, nor enhancements area/cluster and removed triage Issue requires triage labels Nov 4, 2024

github-actions bot added the stale stalebot believes this issue/PR has not been touched recently label Dec 18, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Dec 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slow start configuration understanding #36961

Slow start configuration understanding #36961

anupam-meesho commented Nov 4, 2024

KBaichoo commented Nov 4, 2024

anupam-meesho commented Nov 8, 2024

KBaichoo commented Nov 12, 2024

nezdolik commented Nov 18, 2024

github-actions bot commented Dec 18, 2024

github-actions bot commented Dec 25, 2024

Slow start configuration understanding #36961

Slow start configuration understanding #36961

Comments

anupam-meesho commented Nov 4, 2024

KBaichoo commented Nov 4, 2024

anupam-meesho commented Nov 8, 2024

KBaichoo commented Nov 12, 2024

nezdolik commented Nov 18, 2024

github-actions bot commented Dec 18, 2024

github-actions bot commented Dec 25, 2024