fix(#4922): Fix flaky TestHealthTrait #5346

christophd · 2024-04-10T08:47:20Z

Avoid failing assertion on condition status ready=false due to temporary deployment ready condition status

Release Note

NONE

christophd · 2024-04-10T08:48:43Z

Fixes #4922 and #5345

e2e/common/traits/health_test.go

squakez · 2024-04-10T09:07:42Z

e2e/common/traits/health_test.go

@@ -362,6 +362,8 @@ func TestHealthTrait(t *testing.T) {

 			g.Eventually(IntegrationPodPhase(t, ctx, ns, name), TestTimeoutLong).Should(Equal(corev1.PodRunning))
 			g.Eventually(IntegrationPhase(t, ctx, ns, name), TestTimeoutShort).Should(Equal(v1.IntegrationPhaseRunning))
+			// Wait for the integration condition to become ready=false and then check that it remains not ready for some time - fixes some test flakiness


I think that normally, the Ready condition should start with a false value. So, this verification may hide the problem. I think we need to understand the root cause, and, if it's not already the case, initialize a Ready condition as false.

You can find my analysis of the root cause in #5345. It is the Deployment ready state that causes the intermediate Integration condition status ready=true. This is before the health trait is able to set the condition status to false.

My intention in this PR is to fix the flaky E2E test. In case we are not happy with the intermediate condition status ready=true I'd suggest to open a new issue and discuss

The point is that, according to your analysis, the test is not wrong. What's wrong is the logic where we are blindly setting a value to something which should not be. If we change this test we'd be promoting a wrong behavior turning the bug into a feature. I am planning to retake the work of #5096 for next release which may intersect with this problem. I'd suggest to either work on the root cause or keep this on hold for the moment.

What you say implies that the logic has changed recently and the E2E test has become flaky due to that regression recently. I think the flakiness in this test exists for quite some time since #4922 is reporting it already on Camel K 2.1 and it may be flaky even before that.

I am fine with working on the root cause and fix the condition status setting but let's open a new issue now to track that in particular and let's not keep the flaky test until this is tackled and completely resolved. In a few weeks nobody will remember that the flakiness in this E2E test points to a misbehavior that should be fixed. We need a new issue for that.

No, this behavior exists since long time, also in 1.x branches probably. Reason why I advocate for working on the root cause. In any case, it's just my opinion.

Created #5351 to track the ready condition status flaw

- Avoid failing assertion on condition status ready=false due to temporary deployment ready condition status

squakez · 2024-04-11T07:51:18Z

Probably it has to be backported, thanks.

oscerd approved these changes Apr 10, 2024

View reviewed changes

squakez reviewed Apr 10, 2024

View reviewed changes

fix(apache#4922): Fix flaky TestHealthTrait

6c6c76f

- Avoid failing assertion on condition status ready=false due to temporary deployment ready condition status

christophd force-pushed the issue/4922/flaky-e2e-test branch from 6033d54 to 6c6c76f Compare April 10, 2024 09:28

christophd merged commit 28b8d47 into apache:main Apr 10, 2024
14 checks passed

christophd mentioned this pull request Apr 24, 2024

Fix flaky health trait test #4922

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(#4922): Fix flaky TestHealthTrait #5346

fix(#4922): Fix flaky TestHealthTrait #5346

christophd commented Apr 10, 2024

christophd commented Apr 10, 2024

squakez Apr 10, 2024

christophd Apr 10, 2024

squakez Apr 10, 2024

christophd Apr 10, 2024

squakez Apr 10, 2024

christophd Apr 10, 2024

squakez commented Apr 11, 2024

fix(#4922): Fix flaky TestHealthTrait #5346

fix(#4922): Fix flaky TestHealthTrait #5346

Conversation

christophd commented Apr 10, 2024

christophd commented Apr 10, 2024

squakez Apr 10, 2024

Choose a reason for hiding this comment

christophd Apr 10, 2024

Choose a reason for hiding this comment

squakez Apr 10, 2024

Choose a reason for hiding this comment

christophd Apr 10, 2024

Choose a reason for hiding this comment

squakez Apr 10, 2024

Choose a reason for hiding this comment

christophd Apr 10, 2024

Choose a reason for hiding this comment

squakez commented Apr 11, 2024