Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Slow start configuration understanding #36961

Closed
anupam-meesho opened this issue Nov 4, 2024 · 6 comments
Closed

Slow start configuration understanding #36961

anupam-meesho opened this issue Nov 4, 2024 · 6 comments
Labels
area/cluster question Questions that are neither investigations, bugs, nor enhancements stale stalebot believes this issue/PR has not been touched recently

Comments

@anupam-meesho
Copy link

Hi Team,
Some background: We have our whole infra in kubernetes. We have configured contour as an ingress gateway for the means of routing traffic across the clusters.
Question: We have configured slow start for some of our services. We came across two different behaviours for the different pods. One is honouring the slow start other one is takes a way longer time to ramp up the pods to the full capacity. It would be very helpful if you can point to the certain documentation of code that governs this behaviour predictively. Below is the configuration we are using:

slowStart:
    aggression: 0.2
    minPercent: 1
    window: 150s
slowStart
@anupam-meesho anupam-meesho added the triage Issue requires triage label Nov 4, 2024
@KBaichoo
Copy link
Contributor

KBaichoo commented Nov 4, 2024

documentation for slow start mode: https://www.envoyproxy.io/docs/envoy/latest/intro/arch_overview/upstream/load_balancing/slow_start

original PR adding slow start mode: #13176

@KBaichoo KBaichoo added question Questions that are neither investigations, bugs, nor enhancements area/cluster and removed triage Issue requires triage labels Nov 4, 2024
@anupam-meesho
Copy link
Author

@KBaichoo I have checked documentation but nothing seems to explain the different start modes like the above two pod traffics. Any other reference if you could guide.

@KBaichoo
Copy link
Contributor

cc @nezdolik might be more familiar with this area

@nezdolik
Copy link
Member

This is being reported quite frequently by users who operate various service mesh tech or Envoy based ingresses, where the control plane enables locality based routing by default. @anupam-meesho can you confirm that your setup does not have pods spread across multiple localities or priorities? (from slow start docs):

Note in case when multiple priorities are used with slow start and lower priority has just one endpoint A, during cross-priority spillover there will be no progressive increase of traffic to endpoint A, all traffic will shift at once. Same applies to locality weighted loadbalancing, when slow start is enabled for the upstream cluster and traffic is routed cross zone to a zone with one endpoint A, there will be no progressive increase of traffic to endpoint A.

Copy link

This issue has been automatically marked as stale because it has not had activity in the last 30 days. It will be closed in the next 7 days unless it is tagged "help wanted" or "no stalebot" or other activity occurs. Thank you for your contributions.

@github-actions github-actions bot added the stale stalebot believes this issue/PR has not been touched recently label Dec 18, 2024
Copy link

This issue has been automatically closed because it has not had activity in the last 37 days. If this issue is still valid, please ping a maintainer and ask them to label it as "help wanted" or "no stalebot". Thank you for your contributions.

@github-actions github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Dec 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/cluster question Questions that are neither investigations, bugs, nor enhancements stale stalebot believes this issue/PR has not been touched recently
Projects
None yet
Development

No branches or pull requests

3 participants