[Security Solution][Detections] Fix EQL cypress tests #80440

rylnd · 2020-10-14T00:45:41Z

Summary

This EQL suite was previously skipped. While these were skipped, a bug was introduced in elasticsearch that broke EQL rules. This bug should be fixed in elastic/elasticsearch#63573, which should fix these tests, but let's see if CI disagrees.

Checklist

Unit or functional tests were updated or added to match the most common scenarios

For maintainers

This was checked for breaking API changes and was labeled appropriately

elasticmachine · 2020-10-14T21:36:58Z

Pinging @elastic/siem (Team:SIEM)

rylnd · 2020-10-14T21:47:57Z

Moving back to Draft as it looks like there were some suspicious failures on the 7.x branch. Going to try and repro/fix locally.

These _should_ be fixed with the latest ES on master, but let's see if CI disagrees.

Occasionally our tests hit a scenario where the rule has executed (its status is "succeeded"), but the generated alerts have not populated in the same time frame. In this case the test fails oddly, saying that the "alert count" element is not there when it is. I attempted to improve the error message by using a .should() with a callback, but that lead to even stranger behavior as the .should() would fail once (expected), and then not be able to find the element a second time. :( So we instead focus on fixing the real problem, here: wait until alerts populate (have a non-zero count) before performing the assertion. Because the page will not update automatically, we can't rely on cypress' retryability and must instead assert, click Refresh, and assert again, much like we're doing while waiting for the rule to execute. And like `waitForTheRuleToBeExecuted`, we're using a while loop that has no guarantee of ever exiting :(

* Uses should with a text matcher instead of using invoke('text') * Use of not.equal between a string and an element may have been a false positive

We have a few tasks that require polling for some background work to be completed. The basic form is: assert the byproduct, or refresh the page and try again. We were previously doing this with a while loop, which was not guaranteed to ever complete, leading to cryptic failures if the process ever hung. Instead, this implements a safer polling mechanism with a definite termination similar to the cypress-wait-until plugin.

* Do not automatically refresh the page * This is only necessary if we're not in the state we need. The `waitFor` helper functions automatically reload whatever needs to be reloaded, so we're delegating this task to them. * Ensure we wait for alerts to be nonzero before our assertion * Otherwise we get some strange behavior around this field's availability; see previous commits

rylnd · 2020-10-19T17:32:07Z

@elasticmachine merge upstream

rylnd · 2020-10-19T17:33:32Z

@MadameSheema I ended up implementing a waitUntil function in order to get rid of those unbounded while loops. This may have made some other (non-EQL) tests less flaky as well, but I haven't been able to verify that.

MadameSheema

Lots of thanks for this fix @rylnd! Great work :)

Threat Match Rules introduced an additional query input, causing our CUSTOM_QUERY_INPUT to be ambiguous. However, instead of failing due to the ambiguity, the behavior of cypress seems to be to pass! While I haven't yet tracked down the cause of these false positives, disambiguating these selectors is the immediate fix.

rylnd · 2020-10-19T23:44:41Z

@MadameSheema I think that the behavior causing the above test failures is also present on master; however on master it leads to a passing test! I believe I've fixed the immediate problem in 24748c9, but we need to diagnose and prevent whatever is causing these false positives in the abstract.

rylnd · 2020-10-19T23:47:18Z

How the false positive appears in cypress:open:

in cypress:run:

rylnd · 2020-10-20T01:01:12Z

Ok, I figured out the cause of the false positive: native promises. I found this issue that seemed to describe the behavior we were seeing, and sure enough, commenting out our use of waitForTheRuleToBeExecuted caused the error to propagate into a failure.

Because waitForTheRuleToBeExecuted was an async function, those native promises were causing all this weird behavior. Since I've updated that function on this branch, we saw the expected failure.

kibanamachine · 2020-10-20T01:19:04Z

💚 Build Succeeded

continuous-integration/kibana-ci/pull-request
Commit: 24748c9

Metrics [docs]

✅ unchanged

History

💔 Build #82559 failed 245970d
💔 Build #82532 failed 2242714
💚 Build #81492 succeeded 54cb278361322cae643248b47ac4cc09fb75d587

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

* Unskip EQL tests These _should_ be fixed with the latest ES on master, but let's see if CI disagrees. * Wait until alerts have populated on Rule Details Occasionally our tests hit a scenario where the rule has executed (its status is "succeeded"), but the generated alerts have not populated in the same time frame. In this case the test fails oddly, saying that the "alert count" element is not there when it is. I attempted to improve the error message by using a .should() with a callback, but that lead to even stranger behavior as the .should() would fail once (expected), and then not be able to find the element a second time. :( So we instead focus on fixing the real problem, here: wait until alerts populate (have a non-zero count) before performing the assertion. Because the page will not update automatically, we can't rely on cypress' retryability and must instead assert, click Refresh, and assert again, much like we're doing while waiting for the rule to execute. And like `waitForTheRuleToBeExecuted`, we're using a while loop that has no guarantee of ever exiting :( * More robust cypress assertions * Uses should with a text matcher instead of using invoke('text') * Use of not.equal between a string and an element may have been a false positive * Perform cypress loops in a manner guaranteed to exit We have a few tasks that require polling for some background work to be completed. The basic form is: assert the byproduct, or refresh the page and try again. We were previously doing this with a while loop, which was not guaranteed to ever complete, leading to cryptic failures if the process ever hung. Instead, this implements a safer polling mechanism with a definite termination similar to the cypress-wait-until plugin. * Update other specs that are asserting on alerts * Do not automatically refresh the page * This is only necessary if we're not in the state we need. The `waitFor` helper functions automatically reload whatever needs to be reloaded, so we're delegating this task to them. * Ensure we wait for alerts to be nonzero before our assertion * Otherwise we get some strange behavior around this field's availability; see previous commits * Remove unused import * Fix false positive in Rule Creation specs Threat Match Rules introduced an additional query input, causing our CUSTOM_QUERY_INPUT to be ambiguous. However, instead of failing due to the ambiguity, the behavior of cypress seems to be to pass! While I haven't yet tracked down the cause of these false positives, disambiguating these selectors is the immediate fix. Co-authored-by: Kibana Machine <[email protected]> # Conflicts: # x-pack/plugins/security_solution/cypress/integration/alerts_detection_rules_eql.spec.ts

* Unskip EQL tests These _should_ be fixed with the latest ES on master, but let's see if CI disagrees. * Wait until alerts have populated on Rule Details Occasionally our tests hit a scenario where the rule has executed (its status is "succeeded"), but the generated alerts have not populated in the same time frame. In this case the test fails oddly, saying that the "alert count" element is not there when it is. I attempted to improve the error message by using a .should() with a callback, but that lead to even stranger behavior as the .should() would fail once (expected), and then not be able to find the element a second time. :( So we instead focus on fixing the real problem, here: wait until alerts populate (have a non-zero count) before performing the assertion. Because the page will not update automatically, we can't rely on cypress' retryability and must instead assert, click Refresh, and assert again, much like we're doing while waiting for the rule to execute. And like `waitForTheRuleToBeExecuted`, we're using a while loop that has no guarantee of ever exiting :( * More robust cypress assertions * Uses should with a text matcher instead of using invoke('text') * Use of not.equal between a string and an element may have been a false positive * Perform cypress loops in a manner guaranteed to exit We have a few tasks that require polling for some background work to be completed. The basic form is: assert the byproduct, or refresh the page and try again. We were previously doing this with a while loop, which was not guaranteed to ever complete, leading to cryptic failures if the process ever hung. Instead, this implements a safer polling mechanism with a definite termination similar to the cypress-wait-until plugin. * Update other specs that are asserting on alerts * Do not automatically refresh the page * This is only necessary if we're not in the state we need. The `waitFor` helper functions automatically reload whatever needs to be reloaded, so we're delegating this task to them. * Ensure we wait for alerts to be nonzero before our assertion * Otherwise we get some strange behavior around this field's availability; see previous commits * Remove unused import * Fix false positive in Rule Creation specs Threat Match Rules introduced an additional query input, causing our CUSTOM_QUERY_INPUT to be ambiguous. However, instead of failing due to the ambiguity, the behavior of cypress seems to be to pass! While I haven't yet tracked down the cause of these false positives, disambiguating these selectors is the immediate fix. Co-authored-by: Kibana Machine <[email protected]> Co-authored-by: Kibana Machine <[email protected]>

* Unskip EQL tests These _should_ be fixed with the latest ES on master, but let's see if CI disagrees. * Wait until alerts have populated on Rule Details Occasionally our tests hit a scenario where the rule has executed (its status is "succeeded"), but the generated alerts have not populated in the same time frame. In this case the test fails oddly, saying that the "alert count" element is not there when it is. I attempted to improve the error message by using a .should() with a callback, but that lead to even stranger behavior as the .should() would fail once (expected), and then not be able to find the element a second time. :( So we instead focus on fixing the real problem, here: wait until alerts populate (have a non-zero count) before performing the assertion. Because the page will not update automatically, we can't rely on cypress' retryability and must instead assert, click Refresh, and assert again, much like we're doing while waiting for the rule to execute. And like `waitForTheRuleToBeExecuted`, we're using a while loop that has no guarantee of ever exiting :( * More robust cypress assertions * Uses should with a text matcher instead of using invoke('text') * Use of not.equal between a string and an element may have been a false positive * Perform cypress loops in a manner guaranteed to exit We have a few tasks that require polling for some background work to be completed. The basic form is: assert the byproduct, or refresh the page and try again. We were previously doing this with a while loop, which was not guaranteed to ever complete, leading to cryptic failures if the process ever hung. Instead, this implements a safer polling mechanism with a definite termination similar to the cypress-wait-until plugin. * Update other specs that are asserting on alerts * Do not automatically refresh the page * This is only necessary if we're not in the state we need. The `waitFor` helper functions automatically reload whatever needs to be reloaded, so we're delegating this task to them. * Ensure we wait for alerts to be nonzero before our assertion * Otherwise we get some strange behavior around this field's availability; see previous commits * Remove unused import * Fix false positive in Rule Creation specs Threat Match Rules introduced an additional query input, causing our CUSTOM_QUERY_INPUT to be ambiguous. However, instead of failing due to the ambiguity, the behavior of cypress seems to be to pass! While I haven't yet tracked down the cause of these false positives, disambiguating these selectors is the immediate fix. Co-authored-by: Kibana Machine <[email protected]> # Conflicts: # x-pack/plugins/security_solution/cypress/integration/alerts_detection_rules_eql.spec.ts

* Unskip EQL tests These _should_ be fixed with the latest ES on master, but let's see if CI disagrees. * Wait until alerts have populated on Rule Details Occasionally our tests hit a scenario where the rule has executed (its status is "succeeded"), but the generated alerts have not populated in the same time frame. In this case the test fails oddly, saying that the "alert count" element is not there when it is. I attempted to improve the error message by using a .should() with a callback, but that lead to even stranger behavior as the .should() would fail once (expected), and then not be able to find the element a second time. :( So we instead focus on fixing the real problem, here: wait until alerts populate (have a non-zero count) before performing the assertion. Because the page will not update automatically, we can't rely on cypress' retryability and must instead assert, click Refresh, and assert again, much like we're doing while waiting for the rule to execute. And like `waitForTheRuleToBeExecuted`, we're using a while loop that has no guarantee of ever exiting :( * More robust cypress assertions * Uses should with a text matcher instead of using invoke('text') * Use of not.equal between a string and an element may have been a false positive * Perform cypress loops in a manner guaranteed to exit We have a few tasks that require polling for some background work to be completed. The basic form is: assert the byproduct, or refresh the page and try again. We were previously doing this with a while loop, which was not guaranteed to ever complete, leading to cryptic failures if the process ever hung. Instead, this implements a safer polling mechanism with a definite termination similar to the cypress-wait-until plugin. * Update other specs that are asserting on alerts * Do not automatically refresh the page * This is only necessary if we're not in the state we need. The `waitFor` helper functions automatically reload whatever needs to be reloaded, so we're delegating this task to them. * Ensure we wait for alerts to be nonzero before our assertion * Otherwise we get some strange behavior around this field's availability; see previous commits * Remove unused import * Fix false positive in Rule Creation specs Threat Match Rules introduced an additional query input, causing our CUSTOM_QUERY_INPUT to be ambiguous. However, instead of failing due to the ambiguity, the behavior of cypress seems to be to pass! While I haven't yet tracked down the cause of these false positives, disambiguating these selectors is the immediate fix. Co-authored-by: Kibana Machine <[email protected]> # Conflicts: # x-pack/plugins/security_solution/cypress/integration/alerts_detection_rules_eql.spec.ts (cherry picked from commit 3fc1f8c)

elasticmachine · 2021-09-22T15:35:40Z

Pinging @elastic/security-solution (Team: SecuritySolution)

rylnd added Team:SIEM v8.0.0 release_note:skip Skip the PR/issue when compiling release notes Team:Detections and Resp Security Detection Response Team labels Oct 14, 2020

rylnd self-assigned this Oct 14, 2020

rylnd mentioned this pull request Oct 14, 2020

[Security Solution] [Detections] EQL rule cannot be created #80126

Closed

rylnd added v7.10.0 v7.11.0 labels Oct 14, 2020

rylnd marked this pull request as ready for review October 14, 2020 21:36

rylnd requested review from a team as code owners October 14, 2020 21:36

rylnd marked this pull request as draft October 14, 2020 21:45

rylnd added 5 commits October 16, 2020 19:42

Unskip EQL tests

659c34b

These _should_ be fixed with the latest ES on master, but let's see if CI disagrees.

More robust cypress assertions

9aca928

* Uses should with a text matcher instead of using invoke('text') * Use of not.equal between a string and an element may have been a false positive

rylnd force-pushed the fix_eql_cypress branch from 54cb278 to dff4645 Compare October 19, 2020 17:29

rylnd marked this pull request as ready for review October 19, 2020 17:31

Merge branch 'master' into fix_eql_cypress

2242714

MadameSheema approved these changes Oct 19, 2020

View reviewed changes

rylnd added 2 commits October 19, 2020 13:28

Remove unused import

245970d

rylnd merged commit b7ffefb into elastic:master Oct 20, 2020

rylnd deleted the fix_eql_cypress branch October 20, 2020 16:44

This was referenced Oct 20, 2020

[7.x] [Security Solution][Detections] Fix EQL cypress tests (#80440) #81204

Merged

[7.10] [Security Solution][Detections] Fix EQL cypress tests (#80440) #81211

Merged

rylnd mentioned this pull request Oct 20, 2020

[Security Solution] Adds EQL sequence rule test #79287

Merged

rylnd mentioned this pull request Oct 20, 2020

Failing test: Creates and activates a new EQL rule with a sequence - Detection rules, EQL Creates and activates a new EQL rule with a sequence #79522

Closed

MindyRS added the Team: SecuritySolution Security Solutions Team working on SIEM, Endpoint, Timeline, Resolver, etc. label Sep 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Security Solution][Detections] Fix EQL cypress tests #80440

[Security Solution][Detections] Fix EQL cypress tests #80440

rylnd commented Oct 14, 2020

elasticmachine commented Oct 14, 2020

rylnd commented Oct 14, 2020

rylnd commented Oct 19, 2020

rylnd commented Oct 19, 2020

MadameSheema left a comment

rylnd commented Oct 19, 2020

rylnd commented Oct 19, 2020

rylnd commented Oct 20, 2020

kibanamachine commented Oct 20, 2020

elasticmachine commented Sep 22, 2021

[Security Solution][Detections] Fix EQL cypress tests #80440

[Security Solution][Detections] Fix EQL cypress tests #80440

Conversation

rylnd commented Oct 14, 2020

Summary

Checklist

For maintainers

elasticmachine commented Oct 14, 2020

rylnd commented Oct 14, 2020

rylnd commented Oct 19, 2020

rylnd commented Oct 19, 2020

MadameSheema left a comment

Choose a reason for hiding this comment

rylnd commented Oct 19, 2020

rylnd commented Oct 19, 2020

rylnd commented Oct 20, 2020

kibanamachine commented Oct 20, 2020

💚 Build Succeeded

Metrics [docs]

History

elasticmachine commented Sep 22, 2021