AD model performance benchmark #729

kaituo · 2022-11-14T22:12:20Z

Description

This PR adds an AD model performance benchmark so that we can compare model performance across versions.

For the single stream detector, we refactored tests in DetectionResultEvalutationIT and moved it to SingleStreamModelPerfIT.

For the HCAD detector, we randomly generated synthetic data with known anomalies inserted throughout the signal. In particular, these are one/two/four dimensional data where each dimension is a noisy cosine wave. Anomalies are inserted into one dimension with 0.003 probability. Anomalies across each dimension can be independent or dependent. We have approximately 5000 observations per data set. The data set is generated using the same random seed so the result is comparable across versions.

The PR also fixed jackson issue due to https://github.com/opensearch-project/OpenSearch/pull/5105/files. The issue caused run time test failures like

 2> org.gradle.internal.exceptions.DefaultMultiCauseException: Multiple Failures (2 failures)
        java.lang.NoSuchFieldError: EXACT_FLOATS
        java.lang.NullPointerException: <no message>
        at app//org.junit.vintage.engine.execution.TestRun.getStoredResultOrSuccessful(TestRun.java:196)
        at app//org.junit.vintage.engine.execution.RunListenerAdapter.fireExecutionFinished(RunListenerAdapter.java:226)
        at app//org.junit.vintage.engine.execution.RunListenerAdapter.testFinished(RunListenerAdapter.java:192)
        at app//org.junit.vintage.engine.execution.RunListenerAdapter.testFinished(RunListenerAdapter.java:79)

Testing done:

added unit tests to run the benchmark.

Signed-off-by: Kaituo Li [email protected]

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

This PR adds an AD model performance benchmark so that we can compare model performance across versions. Regarding benchmark data, we randomly generated synthetic data with known anomalies inserted throughout the signal. In particular, these are one/two/four dimensional data where each dimension is a noisy cosine wave. Anomalies are inserted into one dimension with 0.003 probability. Anomalies across each dimension can be independent or dependent. We have approximately 5000 observations per data set. The data set is generated using the same random seed so the result is comparable across versions. We also backported opensearch-project#600 so that we can capture the performance data in CI output. Testing done: * added unit tests to run the benchmark. Signed-off-by: Kaituo Li <[email protected]>

amitgalitz · 2022-11-17T23:28:14Z

build.gradle

@@ -134,7 +134,7 @@ configurations.all {
    if (it.state != Configuration.State.UNRESOLVED) return
    resolutionStrategy {
        force "joda-time:joda-time:${versions.joda}"
-        force "com.fasterxml.jackson.core:jackson-core:2.13.4"
+        force "com.fasterxml.jackson.core:jackson-core:2.14.0"


should we maybe just inherit jackson from core?

will check if it is feasible in the main branch PR and merge this branch PR.

opensearch-trigger-bot · 2022-11-22T23:59:37Z

The backport to 2.x failed:

The process '/usr/bin/git' failed with exit code 1

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add .worktrees/backport-2.x 2.x
# Navigate to the new working tree
cd .worktrees/backport-2.x
# Create a new branch
git switch --create backport/backport-729-to-2.x
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 c6bf52ced608689a0a10f3a81afaa53738aad5a0
# Push it to GitHub
git push --set-upstream origin backport/backport-729-to-2.x
# Go back to the original working tree
cd ../..
# Delete the working tree
git worktree remove .worktrees/backport-2.x

Then, create a pull request where the base branch is 2.x and the compare/head branch is backport/backport-729-to-2.x.

This PR adds an AD model performance benchmark so that we can compare model performance across versions. Regarding benchmark data, we randomly generated synthetic data with known anomalies inserted throughout the signal. In particular, these are one/two/four dimensional data where each dimension is a noisy cosine wave. Anomalies are inserted into one dimension with 0.003 probability. Anomalies across each dimension can be independent or dependent. We have approximately 5000 observations per data set. The data set is generated using the same random seed so the result is comparable across versions. We also backported opensearch-project#600 so that we can capture the performance data in CI output. Testing done: * added unit tests to run the benchmark. Signed-off-by: Kaituo Li <[email protected]>

This PR adds an AD model performance benchmark so that we can compare model performance across versions. Regarding benchmark data, we randomly generated synthetic data with known anomalies inserted throughout the signal. In particular, these are one/two/four dimensional data where each dimension is a noisy cosine wave. Anomalies are inserted into one dimension with 0.003 probability. Anomalies across each dimension can be independent or dependent. We have approximately 5000 observations per data set. The data set is generated using the same random seed so the result is comparable across versions. We also backported #600 so that we can capture the performance data in CI output. Testing done: * added unit tests to run the benchmark. Signed-off-by: Kaituo Li <[email protected]>

kaituo added the backport 2.x label Nov 14, 2022

kaituo requested review from a team, amitgalitz and ohltyler November 14, 2022 22:12

kaituo force-pushed the precision_2.4_2 branch 2 times, most recently from 71b2d16 to b4acc8a Compare November 15, 2022 00:03

kaituo force-pushed the precision_2.4_2 branch from b4acc8a to d67ee59 Compare November 15, 2022 17:41

ohltyler approved these changes Nov 17, 2022

View reviewed changes

amitgalitz reviewed Nov 17, 2022

View reviewed changes

amitgalitz approved these changes Nov 18, 2022

View reviewed changes

kaituo merged commit c6bf52c into opensearch-project:2.4 Nov 22, 2022

kaituo mentioned this pull request Nov 23, 2022

AD model performance benchmark #728

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AD model performance benchmark #729

AD model performance benchmark #729

kaituo commented Nov 14, 2022

amitgalitz Nov 17, 2022

kaituo Nov 22, 2022

opensearch-trigger-bot bot commented Nov 22, 2022

AD model performance benchmark #729

AD model performance benchmark #729

Conversation

kaituo commented Nov 14, 2022

Description

amitgalitz Nov 17, 2022

Choose a reason for hiding this comment

kaituo Nov 22, 2022

Choose a reason for hiding this comment

opensearch-trigger-bot bot commented Nov 22, 2022