-
Notifications
You must be signed in to change notification settings - Fork 431
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Enable Windows e2e logging #1351
Enable Windows e2e logging #1351
Conversation
Edit: ssh has default connection of 10 so it was dropping connections. I've addressed that issue |
43949db
to
a7a961e
Compare
looks like the node didn't come online at all |
ran into #1328 and got logs from the windows node: https://gcsweb.k8s.io/gcs/kubernetes-jenkins/pr-logs/pull/kubernetes-sigs_cluster-api-provider-azure/1351/pull-cluster-api-provider-azure-e2e-windows/1387946030623887360/artifacts/clusters/capz-e2e-vjj1yu/machines/capz-e2e-vjj1yu-md-win-76d8f65f89-fs7tg/ looking to the failure |
It looks like node didn't come online properly. Kubeproxy and flannel were both not running and kubelet looks to have crashed as well. Some of the powershell output was truncated, Will update this to fix that and add a few more logs to collect |
I also found that logs are not collected for machinepools: #1352 |
acca60f
to
274add5
Compare
Looks like this still has patches from #1350 ? |
yes, added them back for testing. |
Running Windows job again to see if restart of kubelet works /test pull-cluster-api-provider-azure-e2e-windows |
try 2 passed, try 3: /test pull-cluster-api-provider-azure-e2e-windows |
try 3 passed, try 4: /test pull-cluster-api-provider-azure-e2e-windows |
try 4 failed with nodes failing to come online, logged: #1359 try 5: /test pull-cluster-api-provider-azure-e2e-windows |
try 5 passed, try 6: /test pull-cluster-api-provider-azure-e2e-windows |
ab17555
to
c9a8a75
Compare
try 6 passed, I've re-based to remove the patches from #1350 |
If this is actually fixing stuff (besides logging), let's add that in the pr description / title |
c9a8a75
to
aae71ff
Compare
Split out the fix to #1366 |
aae71ff
to
a4d8526
Compare
/assign @marosset |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Few nits, otherwise LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Little fix up for logging.
Outside of that, it would be super cool to see the log collection functionality end up in https://github.com/Azure/azure-capi-cli-extension. It would be so useful to be able to ask a user to gather logs from their cluster for debugging.
a4d8526
to
0e732ef
Compare
/lgtm over to @devigned |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
/approve
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: devigned The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
What type of PR is this?
/kind feature
What this PR does / why we need it:
We can't diagnose windows CI failures with out the logs from the workload clusters.
Which issue(s) this PR fixes (optional, in
fixes #<issue number>(, fixes #<issue_number>, ...)
format, will close the issue(s) when PR gets merged):Fixes #1286
Special notes for your reviewer:
Please confirm that if this PR changes any image versions, then that's the sole change this PR makes.
TODOs:
Release note: