[Security Solution][Entity Analytics]WIP: determining cypress test flake #169714

rylnd · 2023-10-24T20:02:54Z

Seeing if this is a timing issue, or whether data from another test is to blame.

Relates to #169154.

Seeing if this is a timing issue, or whether data from another test is to blame.

rylnd · 2023-10-24T20:04:50Z

Flaky test run: https://buildkite.com/elastic/kibana-flaky-test-suite-runner/builds/3712

This should reduce the time/noise in the flaky test runner, but not running other tests means these should definitely pass.

rylnd · 2023-10-25T03:10:16Z

https://buildkite.com/elastic/kibana-flaky-test-suite-runner/builds/3712 was all green, but upon closer inspection of @jpdjere 's flaky run it looks like the tests only failed legitimately 2/150 times.

I'm going to run this one more time (well, 150 more times) to see if I can't reproduce the failure in isolation like this: follow along here

rylnd · 2023-10-25T14:05:55Z

Previous test run succeeded (with one random failure unrelated to the above issue). HOWEVER, taking an even closer look at @jpdjere 's flaky run it appears that the failing test there is NOT the one that had been skipped 🤷‍♂️ .

I think this invalidates the above run. I'm going to run both tests in this file, and see how the 150 runs behave: https://buildkite.com/elastic/kibana-flaky-test-suite-runner/builds/3734

rylnd · 2023-10-25T16:23:31Z

No (legit/expected) failures on the isolated EA FTR run; running again with all risk engine cypress tests to see if we can't get a failure: https://buildkite.com/elastic/kibana-flaky-test-suite-runner/builds/3744

Just to be safe, do it before every test, too.

rylnd · 2023-10-26T18:27:59Z

We had another 2/150 legit failures on the "run all EA cypress tests build".

I'm now adding some data guards to the failing tests and rerunning them. If these pass, it will confirm that it's data from other tests causing the issue. At that point, we'll either just keep the guards (good) or try to track down the contaminating tests (better).

rylnd · 2023-10-26T21:14:59Z

The above tests did not fail, which is a good sign. Since the failure rate is so low, though (1/75), I'm running them another 200 times to try and surface an error: https://buildkite.com/elastic/kibana-flaky-test-suite-runner/builds/3768

Conflicts: x-pack/test/security_solution_cypress/cypress/e2e/entity_analytics/enrichments.cy.ts x-pack/test/security_solution_cypress/package.json

rylnd · 2023-11-07T21:43:40Z

Tests failed above, so we're not quite there. It occurred to me in the interim, however, that this behavior we're seeing may not just be due to old risk scores, but also due to alerts containing risk enrichments. Based on that theory, I'm going to try another run that additionally deletes alerts. If those pass, I'll probably keep the potentially-unnecessary data guards prior to this as "just in case" test setup.

New run: https://buildkite.com/elastic/kibana-flaky-test-suite-runner/builds/3952

It was removed in elastic#170636, and appears not to have been replaced.

kibana-ci · 2023-11-08T19:50:55Z

💔 Build Failed

Failed CI Steps

Test Failures

[job] [logs] Serverless Security Cypress Tests #1 / Enrichment Custom query rule from legacy risk scores Should has enrichment fields from legacy risk Should has enrichment fields from legacy risk
[job] [logs] Serverless Security Cypress Tests #1 / Enrichment Custom query rule from legacy risk scores Should has enrichment fields from legacy risk Should has enrichment fields from legacy risk
[job] [logs] FTR Configs #68 / EPM Endpoints Install endpoint package install should have installed the [endpoint.metadata_current-default] transform

Metrics [docs]

✅ unchanged

History

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

rylnd · 2023-11-22T15:42:42Z

Tests continue to fail, seemingly due to the presence of "old" risk score data on alerts. However, after deleting all alerts AND all risk score data before each test, they continue to fail. I'm stumped as to what's going on here, I'm going to have to rope in @nkhristinin for help as the original author.

WIP: giving load/unload a little more time, and run only this test

b8cabf4

Seeing if this is a timing issue, or whether data from another test is to blame.

Run only EA cypress tests

5e545ae

This should reduce the time/noise in the flaky test runner, but not running other tests means these should definitely pass.

rylnd changed the title ~~[Security Solution][Entity Analytics]WIP: giving load/unload a little more time, and run only this test~~ [Security Solution][Entity Analytics]WIP: determining cypress test flake Oct 25, 2023

Run all enrichment tests. Run only enrichment tests.

863dd22

Broaden test pattern to run all risk-related cypress tests

d7d0d18

rylnd added 2 commits October 26, 2023 12:31

Add back missing quote

df39578

Unload all possible risk data from the environment

967bcec

Just to be safe, do it before every test, too.

rylnd added 2 commits November 7, 2023 15:39

Merge branch 'main' into fix_flaky_cypress_test

1a8e7af

Conflicts: x-pack/test/security_solution_cypress/cypress/e2e/entity_analytics/enrichments.cy.ts x-pack/test/security_solution_cypress/package.json

Delete rules before each enrichments test

2ecbd46

Remove usage of removed helper method

9246efa

It was removed in elastic#170636, and appears not to have been replaced.

rylnd mentioned this pull request Dec 22, 2023

Unskip enrichments tests #171983

Merged

rylnd mentioned this pull request Feb 23, 2024

[Entity Analytics] Hopefully fix flaky tests 🤞 #177421

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Security Solution][Entity Analytics]WIP: determining cypress test flake #169714

[Security Solution][Entity Analytics]WIP: determining cypress test flake #169714

rylnd commented Oct 24, 2023

rylnd commented Oct 24, 2023 •

edited

Loading

rylnd commented Oct 25, 2023

rylnd commented Oct 25, 2023

rylnd commented Oct 25, 2023

rylnd commented Oct 26, 2023

rylnd commented Oct 26, 2023

rylnd commented Nov 7, 2023 •

edited

Loading

kibana-ci commented Nov 8, 2023 •

edited

Loading

rylnd commented Nov 22, 2023

[Security Solution][Entity Analytics]WIP: determining cypress test flake #169714

Are you sure you want to change the base?

[Security Solution][Entity Analytics]WIP: determining cypress test flake #169714

Conversation

rylnd commented Oct 24, 2023

rylnd commented Oct 24, 2023 • edited Loading

rylnd commented Oct 25, 2023

rylnd commented Oct 25, 2023

rylnd commented Oct 25, 2023

rylnd commented Oct 26, 2023

rylnd commented Oct 26, 2023

rylnd commented Nov 7, 2023 • edited Loading

kibana-ci commented Nov 8, 2023 • edited Loading

💔 Build Failed

Failed CI Steps

Test Failures

Metrics [docs]

History

rylnd commented Nov 22, 2023

rylnd commented Oct 24, 2023 •

edited

Loading

rylnd commented Nov 7, 2023 •

edited

Loading

kibana-ci commented Nov 8, 2023 •

edited

Loading