-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New containers unable to reach the network for a while when using Calico #2308
Comments
Is it only when first bringing up the cluster or an ongoing issue with new containers? The former would probably just be time to spinup calico, the latter is probably something more interesting and might need help from the calico folks (realistically as long as the default is working fine I probably won't be able to dig into this anytime soon, not sure about the other maintainers). |
@BenTheElder it's an ongoing issue with new containers. |
I don't see this behaviour on my machine at all (dell laptop, running Ubuntu 20.04). Here's my session output:
Ah - my kind version is wrong. I'll try again with the same version as OP. |
Same result with kind 0.11.1. Though as before, the big difference is I'm on linux, not mac. Is there something wrong with the way I'm reproing? |
It could be something with how the docker for mac linux VM is setup, e.g. there was an issue with a bad proxy recently. #2208 (comment) Thanks for looking into this and @aojea for reaching out in #sig-network. |
I've tested on Mac here with the same script from @lwr20 and it was almost instantly. Used @bandesz manifest to create the kind cluster, and deployed with Calico v3.19 from docs. @bandesz anything that can direct me to a specific point of problem? Can you take a look into the docker bridges that exists (docker network ls, docker network inspect kind) to check if there's any difference with my env? Thanks |
perhaps more or less resources allocated to the docker VM, or the version of docker desktop? |
I did some additional testing, following the exact steps by @lwr20. I get different results, tried alpine/busybox to see if there is any difference. I mostly see delays around ~10 seconds, but the worst I saw was ~40 seconds.
Docker Desktop version: 3.3.3 (6 CPUs, 6Gb memory)
|
Just upgraded Docker Desktop to 3.4.0 (latest), but no change. |
Additional things I've tried, but no change:
|
Tried on another Macbook, I get the same result. Same kind and Docker for Mac versions. It's interesting that so far the very first time (in a new kind cluster in the first test) I get an immediate response from wget, so if anyone's testing this, you have to repeat the test with multiple new containers to make sure you don't see any significant latency. |
reported today in the slack channel https://kubernetes.slack.com/archives/CEKK1KTN2/p1624167563435000 It seems very related
|
I've had the exact same issue, and had to introduce som waits and extra checks in our setup to compensate. I'm 99% sure I did not upgrade kind in the period where this startet happening, and 100% sure I've updated Docker For Mac multiple times, so for what it is worth it could be explained by an issue with Docker For Mac! |
can we blame docker for mac and close it then 😄 |
First of all I know that Calico is not officially supported, but I'm leaving this question here for others to find. Maybe others have/had the same issue.
When I disable the default CNI and install Calico, new containers are unable to reach the network for at least ~30 seconds.
In my tests I was trying to reach a service on a cluster IP to rule out any DNS resolution issues.
When I remove Calico and use the built-in CNI, new containers can reach the network instantly.
OS: macOS 11.4
Kind version: v0.11.1
Calico versions tested: v3.16.10, v3.19.1
Docker Desktop: 3.3.3 (6 CPUs, 6Gb memory)
Kind config:
Calico manifests used:
https://docs.projectcalico.org/[v3.16|v3.19]/manifests/calico.yaml
The text was updated successfully, but these errors were encountered: