-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug 1835851: Update DeleteNodesTwice test #151
Bug 1835851: Update DeleteNodesTwice test #151
Conversation
return err | ||
} | ||
|
||
// Ensure the update worked |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
curious if there's any particular reason you saw to add this extra check? seems fair to trust the error returned in 879.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This was actually to make sure the update had propagated and the client that is used by the nodegroup could read the deletiontimestamp, I'll update the comment, I'll also try seeing if this is as stable if I remove this.
I had used stress before to try and get it to fail but it didn't, I can retry that
// To make sure we don't run into any flakes in CI | ||
// I've chosen to make this sleep duration 3s. | ||
time.Sleep(3 * time.Second) | ||
expectedSize := len(testConfig.machines) - len(testConfig.machines[7:]) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we take this chance to put the magic number 7 in a constant with a comment?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah will do
@enxebre I reworked this a bit again to make it a bit more obvious and to use the same flow for fetching machines that the NodeGroups does. Just ran stress and seems happy, plus it fails if I remove the deletion timestamp check
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
tested this out locally and it passes for me as well. thanks Joel!
/approve
/hold |
Agreed. I'll open a PR upstream and raise that discussion |
0979036
to
135b53f
Compare
Upstream PR: kubernetes#3125 |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: elmiko The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
@enxebre This is actually not too bad using |
@JoelSpeed: This pull request references Bugzilla bug 1835851, which is invalid:
Comment In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/bugzilla refresh |
@JoelSpeed: This pull request references Bugzilla bug 1835851, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker. 3 validation(s) were run on this bug
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
e75294a
to
7189c10
Compare
/hold cancel This is potentially blocking other bugs from being merged. Let's prioritise this and fixup the upstream later |
} | ||
return targetSize == expectedSize, nil | ||
}); err != nil { | ||
t.Fatalf("unexpected error: %v", err) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: I think this error might go meaningless printed this way, the likes of "you got a time out but I'm not telling you where or why".
/lgtm |
7189c10
to
3ec2062
Compare
/lgtm |
/retest Please review the full test history for this PR and help us cut down flakes. |
@JoelSpeed: The following test failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
/retest Please review the full test history for this PR and help us cut down flakes. |
@JoelSpeed: All pull requests linked via external trackers have merged: openshift/kubernetes-autoscaler#151. Bugzilla bug 1835851 has been moved to the MODIFIED state. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
The test was relying on the
TargetSize
to determine if the nodes had been deleted twice. Until #147 was merged, this value would have been the original value from when the nodegroup was constructed and did not reflect the true size of the MachineSet/Deployment underlying the nodegroupI've updated the test to: