Fix long initialization issues in anomaly detection #133

ylwu-amzn · 2020-05-13T00:18:06Z

Detector initialization process needs at least 6 data points for continuous 8 intervals to complete shingle process. If no data or has not enough data, user may experience long initialization period. We should tune the error message to show something like "no data found" or "no enough data". So user can know why initialization takes a long time.

We can query feature data when create detector to make sure there is enough data. And we can add max empty query limit to stop detector.

ylwu-amzn · 2020-05-14T21:36:46Z

We should add real auto refresh. Currently the initialization state just show a loading spinner, will not send request to query latest state.

…or-elasticsearch#133

yizheliu-amazon · 2020-05-19T17:59:05Z

Currently we just show message in UI if long initialization is found, but don't stop detector. In the long term, I think we should stop detector from backend if we identify long initialization, and treat it as initialization failure if possible.

epotocko · 2020-05-20T02:20:36Z

Could existing data in the ElasticSearch index be used to initialize the detector? From my usage, I've only seen records created while the detector is running used for anomaly detection. I would have expected it to use all data available in the index or at least a subset of recent data.

ylwu-amzn · 2020-05-20T09:11:32Z

Could existing data in the ElasticSearch index be used to initialize the detector? From my usage, I've only seen records created while the detector is running used for anomaly detection. I would have expected it to use all data available in the index or at least a subset of recent data.

Good question. @wnbts , can you help explain?

wnbts · 2020-05-20T20:19:36Z

Hi @epotocko , the current "initialization" page is misleading/overloaded. The system does use existing data to complete model training on the backend. However, the real time data stream might be a root cause. If you use the rest api, do you see produced results?

epotocko · 2020-05-21T03:16:28Z

Which API call are you referring to? Shortly after creating and starting a detector with 4+ months of data points and a 60 minute detector interval:
The _preview API returns: { "anomaly_result" : [ ]..........
The _profile API returns: { "state" : "INIT" }

I can reproduce this consistently with different data sets. The _profile API will always return an INIT state until enough "new" records are received.

wnbts · 2020-05-21T16:36:06Z

@epotocko I was referring to get anomaly results api. Do you see anomaly results and feature values since the start of the detector? And show maybe 10~20 examples of recent results if you can to see if there are any errors. That will help us see whether it's a data stream issue or a system issue. Thanks!

epotocko · 2020-05-21T20:40:45Z

@wnbts Immediately after creating the detector, the get anomaly results api returns 0 hits. I checked about an hour later and there was one hit with the error "No full shingle in current detection window".

I have the detector interval set to 60 minutes. I checked the elasticsearch data and every 60 minute period has at least 50 records. Let me know if that sounds like the expected behavior.

wnbts · 2020-05-21T20:48:47Z

@epotocko thanks so much. I understand the situation much better. It is functioning as expected. Currently, the system is trying to get 8 points from the real-time stream to actually produce results. With your configuration, it will take roughly 6~8 hours. We do have a discussion of using indexed data to speed up that data collection process. I am going to create an issue for that discussion to detail this behavior. Also keep me posted on what the results look like after 8 hours.

* Add proper message in case of long initialization. Issue:#133 * Remove 'sufficient' to avoid confusion

yizheliu-amazon · 2020-06-22T21:21:05Z

We should add real auto refresh. Currently the initialization state just show a loading spinner, will not send request to query latest state.

Added in PR: #232

ohltyler · 2020-07-30T02:44:38Z

With PRs #248 and #253 merged how do we feel about marking this as complete, or close it and create a new updated issue?

ohltyler · 2020-10-16T00:36:46Z

Closing this issue because of the different initialization callouts and progress percentage changes that have been added.

ylwu-amzn added the enhancement Enhance current feature for better performance, user experience, etc label May 13, 2020

ylwu-amzn changed the title ~~Tune callout message for long initialization~~ Long initialization May 13, 2020

yizheliu-amazon added a commit to yizheliu-amazon/anomaly-detection-kibana-plugin that referenced this issue May 19, 2020

Add proper message in case of long initialization. Issue:opendistro-f…

8886443

…or-elasticsearch#133

yizheliu-amazon mentioned this issue May 19, 2020

Add proper message in case of long initialization. Issue:#133 #159

Merged

yizheliu-amazon added a commit that referenced this issue May 22, 2020

Add proper message in case of long initialization. Issue:#133 (#159)

e75b44a

* Add proper message in case of long initialization. Issue:#133 * Remove 'sufficient' to avoid confusion

ohltyler pushed a commit that referenced this issue May 27, 2020

Add proper message in case of long initialization. Issue:#133 (#159)

62b08f8

* Add proper message in case of long initialization. Issue:#133 * Remove 'sufficient' to avoid confusion

sean-zheng-amazon added the AnomalyDetection Item related to Anomaly Detection and AD Kibana plugin label Jun 10, 2020

sean-zheng-amazon changed the title ~~Long initialization~~ Fix long initialization Jun 24, 2020

anirudha changed the title ~~Fix long initialization~~ Fix long initialization issues in anomaly detection Jul 8, 2020

ohltyler closed this as completed Oct 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix long initialization issues in anomaly detection #133

Fix long initialization issues in anomaly detection #133

ylwu-amzn commented May 13, 2020 •

edited

Loading

ylwu-amzn commented May 14, 2020

yizheliu-amazon commented May 19, 2020

epotocko commented May 20, 2020

ylwu-amzn commented May 20, 2020

wnbts commented May 20, 2020

epotocko commented May 21, 2020

wnbts commented May 21, 2020

epotocko commented May 21, 2020

wnbts commented May 21, 2020

yizheliu-amazon commented Jun 22, 2020

ohltyler commented Jul 30, 2020

ohltyler commented Oct 16, 2020

Fix long initialization issues in anomaly detection #133

Fix long initialization issues in anomaly detection #133

Comments

ylwu-amzn commented May 13, 2020 • edited Loading

ylwu-amzn commented May 14, 2020

yizheliu-amazon commented May 19, 2020

epotocko commented May 20, 2020

ylwu-amzn commented May 20, 2020

wnbts commented May 20, 2020

epotocko commented May 21, 2020

wnbts commented May 21, 2020

epotocko commented May 21, 2020

wnbts commented May 21, 2020

yizheliu-amazon commented Jun 22, 2020

ohltyler commented Jul 30, 2020

ohltyler commented Oct 16, 2020

ylwu-amzn commented May 13, 2020 •

edited

Loading