add test evaluating anomaly results #13

wnbts · 2019-12-19T01:32:53Z

This change adds an anomaly detection test running on rest api and a synthetic dataset with known anomalies to verify the machine learning algorithms in the anomaly detection core along with some basic functionality such as creating detectors.

kaituo · 2019-12-27T01:18:02Z

src/test/java/com/amazon/opendistroforelasticsearch/ad/e2e/DetectionResultEvalutationIT.java

+    private void verifyAnomaly(String datasetName, int intervalMinutes, int trainTestSplit, int shingleSize,
+        double minPrecision, double minRecall, double maxError) throws Exception {
+
+        RestClient client = client();


How many nodes do we have for this cluster? We want both single node and multi-node cluster to test.

Default setting is a single node. I will add a multi-node test case in a separate pr.

kaituo · 2019-12-27T01:18:33Z

src/test/java/com/amazon/opendistroforelasticsearch/ad/e2e/DetectionResultEvalutationIT.java

+public class DetectionResultEvalutationIT extends ESRestTestCase {
+
+    public void testDataset() throws Exception {
+        verifyAnomaly("synthetic", 1, 1500, 8, .9, .9, 10);


How long does the test take to run?

about a minute

kaituo · 2019-12-27T01:21:05Z

src/test/java/com/amazon/opendistroforelasticsearch/ad/e2e/DetectionResultEvalutationIT.java

+                Map<String, Object> response = entityAsMap(client.performRequest(request));
+                double anomalyGrade = (double)response.get("anomalyGrade");
+                if (anomalyGrade > 0) {
+                    System.out.println("LLL," + begin + "," + anomalyGrade);


Replace with log?

removed in the update

kaituo · 2019-12-27T23:46:09Z

src/test/java/com/amazon/opendistroforelasticsearch/ad/e2e/DetectionResultEvalutationIT.java

+        double precision = positives > 0 ? truePositives / positives : 1;
+        assertTrue(precision >= minPrecision);
+
+        double recall = anomalies.size() > 0 ? positiveAnomalies / anomalies.size() : 1;


Could you add a comment to explain how you compute precision and recall? It's not easy to understand as precision is computed based on points and recall is based on windows.

added comments showing the definitions in the update

kaituo · 2019-12-28T00:01:54Z

src/test/java/com/amazon/opendistroforelasticsearch/ad/e2e/DetectionResultEvalutationIT.java

+            + " \"Feature1\": { \"type\": \"double\" }, \"Feature2\": { \"type\": \"double\" } } } }";
+        request.setJsonEntity(requestBody);
+        client.performRequest(request);
+        Thread.sleep(1_000);


Is it possible that sometimes sleeping 1 second is not enough? Have you tried to run the test multiple times (say 100 times)? If yes, would it always pass? I am afraid concurrency and inter-process communication(if you are running the test in a simulated multi-node cluster) would sometimes cause the test to fail.

Also, would https://www.elastic.co/guide/en/elasticsearch/reference/current/indices-refresh.html help?

I have run it for many times (certainly more than 30 times and less than 100 times) without failure. And it passes github checks multiple times. From my experience, it has been more reliable than some randomized unit tests.

jmazanec15

Looks good to me. A few minor suggestions.

jmazanec15 · 2020-01-02T23:33:11Z

src/test/java/com/amazon/opendistroforelasticsearch/ad/e2e/DetectionResultEvalutationIT.java

+        for (int i = trainTestSplit; i < data.size(); i++) {
+            Instant begin = Instant.from(DateTimeFormatter.ISO_INSTANT.parse(data.get(i).get("timestamp").getAsString()));
+            Instant end = begin.plus(intervalMinutes, ChronoUnit.MINUTES);
+            String requestBody = String.format("{ \"period_start\": %d, \"period_end\": %d }", begin.toEpochMilli(), end.toEpochMilli());


To remove duplicated code, it may be worth adding a function that runs a detector given a detectorID, period_start, and period end and returns the response.

jmazanec15 · 2020-01-02T23:35:18Z

src/test/java/com/amazon/opendistroforelasticsearch/ad/e2e/DetectionResultEvalutationIT.java

+                e.printStackTrace();
+            }
+        }
+        return new double[] {positives, truePositives, positiveAnomalies.size(), errors};


Will truePositives and positiveAnomalies.size() always be the same?

no, because truePositives is a count of correct data points. positiveAnomalies is a count of correctly found anomaly windows. The former is no less than the latter.

jmazanec15 · 2020-01-02T23:38:09Z

src/test/java/com/amazon/opendistroforelasticsearch/ad/e2e/DetectionResultEvalutationIT.java

+                } catch (Exception e ) {
+                    throw new RuntimeException(e);
+                } });
+        Thread.sleep(1000);


add test evaluating anomaly results

ed319b9

wnbts marked this pull request as ready for review December 19, 2019 01:37

kaituo reviewed Dec 28, 2019

View reviewed changes

updates

de02f89

jmazanec15 approved these changes Jan 2, 2020

View reviewed changes

updates

834b8b0

kaituo approved these changes Jan 11, 2020

View reviewed changes

wnbts merged commit 2af2ee1 into opendistro-for-elasticsearch:development Jan 11, 2020

wnbts deleted the e2etest branch January 11, 2020 00:24

wnbts restored the e2etest branch January 11, 2020 00:28

wnbts mentioned this pull request Jan 11, 2020

Revert "add test evaluating anomaly results" #27

Closed

wnbts deleted the e2etest branch May 26, 2020 18:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add test evaluating anomaly results #13

add test evaluating anomaly results #13

wnbts commented Dec 19, 2019

kaituo Dec 27, 2019

wnbts Dec 28, 2019

kaituo Dec 27, 2019

wnbts Dec 28, 2019

kaituo Dec 27, 2019

wnbts Dec 28, 2019

kaituo Dec 27, 2019

wnbts Dec 28, 2019

kaituo Dec 28, 2019

wnbts Dec 28, 2019

jmazanec15 left a comment

jmazanec15 Jan 2, 2020

wnbts Jan 3, 2020

jmazanec15 Jan 2, 2020

wnbts Jan 3, 2020

jmazanec15 Jan 2, 2020

wnbts Jan 3, 2020

add test evaluating anomaly results #13

add test evaluating anomaly results #13

Conversation

wnbts commented Dec 19, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmazanec15 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment