-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New Node creation, pull image from private repo fails "Forbidden" only for first 10-15 minutes of new node creation #3877
Comments
Is the artifactserver a GCR / ECR server? Where are the credentials stored? |
@justinsb the artifactserver is Artifactory. Which is added as insecureRegistry to the |
@mikesplain that's it!!!! have you found a solution? Haven't seen any traction on #3224 in a while @chrislovecnm might know if this is being handled outside that issue? |
@mikesplain should we switch to using something other than calico? |
@cdenneen Glad to hear it! Well looks like we'll have a path forward soon based on #3224 (comment). Anyway, my current workaround is a cleanup script that we schedule a cronjob. Give me a few and I'll open source it. |
@cdenneen Take a look at this, I haven't tested this directly, since I run it via a helm chart, but it should help you out: |
@mikesplain Thanks...
|
So I got the cronjob installed but I'm not able to find it using API server is running with To load it need to do |
@cdenneen it's under the I'm not positive this will solve your issue since you are getting some sort of response... hmm |
Yeah this might not be the same issue...
I know it's not a connectivity and permission issue from the private repo because if I login to the node and do the |
A workaround is to have a hook that does an image pull, but that is a work around. Can you get us kubelet logs? I am guessing that this is an upstream issue btw. Anyone else agree / disagree? |
|
Here is the kubelet info from the node: |
Anyone know how I can add to kops my nodes ig some sort of hook to do the "docker pull" automatically rather than logging in to each of these nodes to get past the delay? |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
/remove-lifecycle stale |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
Stale issues rot after 30d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
Rotten issues close after 30d of inactivity. Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
@fejta-bot: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
New node spins up with the following:
Failed to pull image "artifactserver.example.com/gitlab/gitlab-runner:v1.11.5": rpc error: code = 2 desc = Error response from daemon: {"message":"unknown: Forbidden"}
Error syncing pod
If I wait 10-15 minutes it eventually works.
However if I login to the node and do a
docker pull artifactserver.example.com/gitlab/gitlab-runner:v1.11.5
it pulls down with no issue and then the pod starts within a few seconds upon next retry.Basically what I'm trying to understand is why the 10-15 minute delay with new nodes pulling that image from private registry. Why when I pull it manually it behaves any different than the pod creation does.
The text was updated successfully, but these errors were encountered: