Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ci-kubernetes-conformance-kind-ga-only is experiencing pod scheduling timeout #18507

Closed
hasheddan opened this issue Jul 28, 2020 · 8 comments
Closed
Assignees
Labels
kind/bug Categorizes issue or PR as related to a bug.

Comments

@hasheddan
Copy link
Contributor

What happened:

ci-kubernetes-conformance-kind-ga-only is experiencing pod scheduling timeout since #18471

What you expected to happen:

For the pod to be scheduled.

Please provide links to example occurrences, if any:

https://prow.k8s.io/view/gcs/kubernetes-jenkins/logs/ci-kubernetes-conformance-kind-ga-only/1288101820190691330

Anything else we need to know?:

@spiffxp I am having trouble determining what is causing the scheduling timeout based on looking at the metrics. Do you have any insight?

/assign

@hasheddan hasheddan added the kind/bug Categorizes issue or PR as related to a bug. label Jul 28, 2020
@hasheddan
Copy link
Contributor Author

@BenTheElder
Copy link
Member

#18530

@RobertKielty
Copy link
Member

For ref conformance-all#conformance-ga-only

@spiffxp
Copy link
Member

spiffxp commented Aug 3, 2020

https://prow.k8s.io/?job=ci-kubernetes-conformance-kind-ga-only

Perhaps we should bring the CPU limit down a bit? Or should we somehow be forcing the cluster to be larger?

/assign @BenTheElder

@hasheddan
Copy link
Contributor Author

@spiffxp also seeing this same issue with https://testgrid.k8s.io/sig-release-master-blocking#kind-master-parallel. It appears both of these jobs are requesting a full node (i.e. 7300m CPU). Is this necessary?

@BenTheElder
Copy link
Member

The full node is to avoid noisy neighbors.
We cannot schedule for disk IO
I'm looking at moving some things into memory but for now yes, the goal is an isolated node. This is comparable to the boskos resources created by a cloudy e2e

The cluster should be bigger

@spiffxp
Copy link
Member

spiffxp commented Aug 4, 2020

/close

https://prow.k8s.io/?job=ci-kubernetes-conformance-kind-ga-only
https://prow.k8s.io/?job=ci-kubernetes-kind-e2e-parallel

Neither have had an error status (grey triangle) since #18637 landed

@k8s-ci-robot
Copy link
Contributor

@spiffxp: Closing this issue.

In response to this:

/close

https://prow.k8s.io/?job=ci-kubernetes-conformance-kind-ga-only
https://prow.k8s.io/?job=ci-kubernetes-kind-e2e-parallel

Neither have had an error status (grey triangle) since #18637 landed

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Categorizes issue or PR as related to a bug.
Projects
None yet
Development

No branches or pull requests

5 participants