-
Notifications
You must be signed in to change notification settings - Fork 716
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
evaluate the retry logic for API calls #1606
Comments
I think that we need to arrange for a code-walkthrough and add retries to everything is accessing api-server/etcd during join and fix it consistently |
In the meantime, I'll give this one a go. /assign |
I'd like to help this. cloud I assign? |
i think @rosti was working on an idea on how to create a generic retry client. |
that sounds cool, ping me if you @rosti need help. |
/kind design feature |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
I don't think we had a POC. Are you seeing any particular problems?
…On Feb 17, 2022 12:05, "Paco Xu" ***@***.***> wrote:
i think @rosti <https://github.com/rosti> was working on an idea on how
to create a generic retry client.
I cannot find new PR from @rosti <https://github.com/rosti> about this.
Any update for this?
https://github.com/kubernetes/kubernetes/blame/
e777f721638cf585b4e9e5d933d27e753a35fabe/cmd/kubeadm/app/
util/apiclient/idempotency.go#L342-L362
—
Reply to this email directly, view it on GitHub
<#1606 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AACRATHIWXJDJBZOV5E6VF3U3TCABANCNFSM4HXKNECQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
You are receiving this because you were assigned.Message ID:
***@***.***>
|
After walking through the thread in #78915, it works well. The discussion focus on whether there are any hidden problems in the apiserver side or proxy/LB side. To summarize it:
Why not log the |
retry does help and IIRC the ideas was to have all API calls behind a client that retries for a per-use-case time.
some of these warnings in polls can spam the logs a lot. i wouldn't mind closing this ticket until further notice. |
If so, I think the priority would be |
moving to 1.30 with priority soon. xref |
WIP PR |
@neolit123 regarding kubernetes/kubernetes#112411, I was able to figure out what the problem was, at least in my case. I had installed containerd using |
thanks for the update. so it's bound to a specific containerd version or config; i'd assume the alternative version is much newer? |
we recommend to users to install containerd using their guide in the containerd repo |
Unfortunately I did not check the version that came as the default Debian package, but I subsequently used https://docs.docker.com/engine/install/debian/ to install containerd.io. |
with the merge of PR:
kubernetes/kubernetes#78915
we added retry logic to kubeadm when fetching ConfigMaps.
The PR also added a TODO to evaluate if this can be done better.
This work should happen in the 1.16 cycle.
update:
we are seeing users having random API server downtime which trips our API calls without retry, such as the ones in uplaod-certs phase of init:
this is tracked for a fix in 1.30, but no backports are planned.
as a first step we should add retries for all calls in idempotency.go
The text was updated successfully, but these errors were encountered: