New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[ML] Improve messaging and support for datafeed using aggregated and scripted fields #84594

Merged

qn895 merged 35 commits into elastic:master from qn895:ml-messaging-unsupported-aggs-script

Dec 10, 2020

Member

qn895 commented Nov 30, 2020 •

edited

Loading

Summary

This PR adds better messaging for unsupported configurations and improved support for aggregated and scripted fields. Changes include:

Add better messaging on why some jobs cannot be viewed using the Single Metric Viewer

Add ability to create and view job when datafeed uses something other than buckets in the datafeed's aggregation (e.g. an arbitrary name like 'my_buckets'). Previously, this is only supported via the API. This PR makes it so that the fields are listed correctly in the job creation wizard and makes the charts plot correctly.
Ability to plot Anomaly Explorer charts for jobs with aggregated or scripted fields with model plot data in cases if source chart is not chartable.

Before
After

Add fix to we can no longer click on disabled buttons

Checklist

Delete any items that are not applicable to this PR.

Any text added follows EUI's writing guidelines, uses sentence case text and includes i18n support

For maintainers

This was checked for breaking API changes and was labeled appropriately

qn895 added 10 commits

November 17, 2020 14:15


          [ML] Remove nested Link and Button because we don't need it anymore

6e9a007


          [ML] Add isSupportedTimeSeriesViewJob check

4b9b294


          Merge remote-tracking branch 'upstream/master' into ml-messaging-unsu…

7c5110b

…pported-aggs-script


          [ML] Add message

17c3d19


          Merge remote-tracking branch 'upstream/master' into ml-messaging-unsu…

be013e9

…pported-aggs-script


          [ML] Add support for my_buckets in job creator wizard

5f5860b


          [ML] Add support for my_buckets in job creator wizard

30324ba


          [ML] Add support for my_buckets in job creator wizard

12db65c


          [ML] Fix so button not disabled with model plot & update message

2e48175


          [ML] Update i18n id

8d7af66

qn895 added release_note:enhancement :ml Feature:Anomaly Detection v8.0.0 v7.11.0 labels

qn895 requested review from walterra and peteharverson

November 30, 2020 23:33

qn895 self-assigned this

qn895 requested a review from a team as a code owner

November 30, 2020 23:33

Contributor

elasticmachine commented Nov 30, 2020

Pinging @elastic/ml-ui (:ml)

qn895 requested a review from lcawl

November 30, 2020 23:33

lcawl reviewed

View reviewed changes

x-pack/plugins/ml/common/util/job_utils.ts Outdated

+                          'xpack.ml.timeSeriesJob.varyingBucketSpanAggregationInterval',
+                          {
+                            defaultMessage:
+                              'bucket span and aggregation interval is not the same for datafeed with aggregation fields',

Contributor

lcawl Dec 1, 2020

I think this could be clarified a bit. For example:

           'the datafeed has aggregation fields and the aggregation interval is not the same as the bucket span',

or:

           'bucket span and aggregation interval are not the same for a datafeed with aggregation fields',

I prefer the first suggestion.

Contributor

peteharverson Dec 1, 2020

My vote goes to the first suggestion from @lcawl

Member Author

qn895 Dec 7, 2020

Updated here 02a579b

peteharverson reviewed

View reviewed changes

x-pack/plugins/ml/common/util/job_utils.ts Outdated

+                          'xpack.ml.timeSeriesJob.varyingBucketSpanAggregationInterval',
+                          {
+                            defaultMessage:
+                              'bucket span and aggregation interval is not the same for datafeed with aggregation fields',

Contributor

peteharverson Dec 1, 2020

My vote goes to the first suggestion from @lcawl

x-pack/plugins/ml/common/util/job_utils.ts Show resolved Hide resolved

x-pack/plugins/ml/public/application/components/custom_hooks/use_create_ad_links.ts Show resolved Hide resolved

...rame_analytics/pages/analytics_creation/components/back_to_list_panel/back_to_list_panel.tsx Show resolved Hide resolved

peteharverson reviewed

View reviewed changes

x-pack/plugins/ml/common/util/job_utils.ts Outdated

+                        }
+                        // if aggregation interval is different from bucket span
+                        const datetimeBucket = aggs[aggBucketsName].date_histogram;

Contributor

peteharverson Dec 1, 2020

For this use case, I think we should enable access to the Single Metric Viewer, and show the charts in the Anomaly Explorer, but display a Toast / Callout which explains that the numbers on the chart may differ from the values reported for the anomaly. For example, this chart provides a lot of value:

Contributor

peteharverson Dec 3, 2020

Update - as discussed, we should just show the charts in this case, with no Toast / Callout about the differing bucket span and aggregation interval, as otherwise they would show up for jobs created in the Single Metric wizard, where we set the agg interval to 10% of the bucket span. We can address that edge case in a future PR, to show a toast warning only if the function used in the detector is different to that used in the datafeed aggregation.

peteharverson reviewed

View reviewed changes

.../ml/public/application/timeseriesexplorer/timeseriesexplorer_utils/validate_job_selection.ts Show resolved Hide resolved

peteharverson reviewed

View reviewed changes

x-pack/plugins/ml/common/util/job_utils.ts Outdated

               // Returns a flag to indicate whether the job is suitable for viewing
               // in the Time Series dashboard.
-              export function isTimeSeriesViewJob(job: CombinedJob): boolean {
+              export function isTimeSeriesViewableJob(job: CombinedJob): boolean {

Contributor

peteharverson Dec 1, 2020

I noticed that some of my jobs are greyed out in the job selector in the Single Metric Viewer when they shouldn't be. Is this because some of the logic in here is not quite correct? Jobs which don't use partitioning fields are being disabled here:

Member Author

qn895 Dec 7, 2020

Removed here 15cc5ed

qn895 added 4 commits

December 1, 2020 11:04


          Merge remote-tracking branch 'upstream/master' into ml-messaging-unsu…

d4ffb10

…pported-aggs-script


          [ML] Update text

02a579b


          [ML] Remove check when agg interval not same as bucket span

b47459d


          [ML] Disable SMV button

8ce8b5f

peteharverson reviewed

View reviewed changes

.../plugins/ml/public/application/explorer/explorer_charts/explorer_charts_container_service.js Outdated

+                      jobsErrorMessage[record.job_id] = i18n.translate(
+                        'xpack.ml.timeSeriesJob.sourceDataModelPlotNotChartableMessage',
+                        {
+                          defaultMessage: 'both source data and model plot not chartable for this detector',

Contributor

peteharverson Dec 8, 2020

I think this needs are inserted, source data and model plot are not chartable

Member Author

qn895 Dec 8, 2020

Updated here ce964f5

peteharverson reviewed

View reviewed changes

...ck/plugins/ml/public/application/explorer/explorer_charts/explorer_charts_error_callouts.tsx Outdated

+                  <EuiCallOut color={'warning'} size="s">
+                    <FormattedMessage
+                      id="xpack.ml.explorerCharts.errorCallOutMessage"
+                      defaultMessage="You can't view anomaly records for {jobs} because {reason}."

Contributor

peteharverson Dec 8, 2020

Swap the word records for charts.

Member Author

qn895 Dec 8, 2020

Updated here ce964f5

qn895 added 6 commits

December 8, 2020 11:01


          [ML] Fix source data check


          [ML] Fix callout missing key

bddd17d


          [ML] Remove nested terms check and replace with more specific guard

a42acb3


          Merge remote-tracking branch 'upstream/master' into ml-messaging-unsu…

f8cdf1d

…pported-aggs-script


          [ML] Update messages

ce964f5


          [ML] Add functional test for supported and unsupported jobs with aggr…

10ea7e4

…egated/scripted fields

qn895 requested a review from pheyos

December 8, 2020 23:47

Member Author

qn895 commented Dec 8, 2020

As discussed I have updated a logic to be less strict so that it will not necessarily disable when there's nested term aggregations or when the bucket span != aggregation interval. This is so examples like below will still show up, without callouts, despite the anomaly markers not plotting correctly:

{
  "datafeed_id": "",
  "job_id": "",
  "indices": [
    "cloudwatch-*"
  ],
  "query": {
    "bool": {
      "must": [
        {
          "match_all": {}
        }
      ]
    }
  },
  "aggregations": {
    "buckets": {
      "date_histogram": {
        "field": "@timestamp",
        "interval": "15m",
        "time_zone": "UTC"
      },
      "aggregations": {
        "@timestamp": {
          "max": {
            "field": "@timestamp"
          }
        },
        "instance": {
          "terms": {
            "field": "instance"
          },
          "aggs": {
            "DiskReadBytesSum": {
              "sum": {
                "field": "DiskReadBytes"
              }
            }
          }
        }
      }
    }
  }
}

  "analysis_config": {
    "bucket_span": "30m",
    "summary_count_field_name": "doc_count",
    "detectors": [
      {
        "detector_description": "mean DiskReadBytesSum partition instance",
        "function": "mean",
        "field_name": "DiskReadBytesSum",
        "partition_field_name": "instance",
        "detector_index": 0
      }
    ],
    "influencers": [
      "instance"
    ]
  },

walterra approved these changes

View reviewed changes

Contributor

walterra left a comment

Latest changes LGTM, great you added some functional tests too! 👍

peteharverson approved these changes

View reviewed changes

Contributor

peteharverson left a comment

Tested latest edits and LGTM!

pheyos reviewed

View reviewed changes

x-pack/test/functional/apps/ml/anomaly_detection/aggregated_scripted_job.ts Outdated Show resolved Hide resolved

x-pack/test/functional/apps/ml/anomaly_detection/aggregated_scripted_job.ts Show resolved Hide resolved

x-pack/test/functional/apps/ml/anomaly_detection/aggregated_scripted_job.ts Outdated Show resolved Hide resolved

x-pack/test/functional/services/ml/job_selection.ts Outdated Show resolved Hide resolved

qn895 added 2 commits

December 9, 2020 10:26


          [ML] Update test

49899e9


          Merge remote-tracking branch 'upstream/master' into ml-messaging-unsu…

53d06b6

…pported-aggs-script

Member Author

qn895 commented Dec 9, 2020

Started flaky test suite runner...

pheyos reviewed

View reviewed changes

x-pack/test/functional/services/ml/job_selection.ts Outdated Show resolved Hide resolved

x-pack/test/functional/apps/ml/anomaly_detection/aggregated_scripted_job.ts Outdated Show resolved Hide resolved


          [ML] Move assertDisabledJobReasonWarningToastExist and update assertJ…

c95a44a

…obSelectionNotContains

Member Author

qn895 commented Dec 10, 2020

Started flaky test suite runner...

pheyos approved these changes

View reviewed changes

Member

pheyos left a comment •

edited

Loading

Functional tests LGTM, thanks for adding them as part of this PR! 🎉
Just one small nit after the latest change, but feel free to merge as is if you want.

x-pack/test/functional/services/ml/job_selection.ts Outdated Show resolved Hide resolved

qn895 added 2 commits

December 10, 2020 09:44


          [ML] Update actualJobOrGroupLabels

0a30cd3


          Merge remote-tracking branch 'upstream/master' into ml-messaging-unsu…

eb36a3a

…pported-aggs-script

Contributor

kibanamachine commented Dec 10, 2020

💚 Build Succeeded

continuous-integration/kibana-ci/pull-request
Commit: eb36a3a

Metrics [docs]

Module Count

Fewer modules leads to a faster build time

id	before	after	diff
`ml`	1592	1593	+1

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id	before	after	diff
`ml`	5.2MB	5.3MB	+2.8KB

Distributable file count

id	before	after	diff
`default`	47010	47770	+760

History

💚 Build #93257 succeeded c95a44a
💚 Build #93161 succeeded 53d06b6
💔 Build #92951 failed 10ea7e4
💚 Build #92801 succeeded 3835583
💚 Build #92408 succeeded 18df04e

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

qn895 merged commit 008a420 into elastic:master

qn895 deleted the ml-messaging-unsupported-aggs-script branch

December 10, 2020 17:35

This was referenced Dec 10, 2020

[7.x] [ML] Improve messaging and support for datafeed using aggregated and scripted fields (#84594) #85613

Merged

[ML] Anomalies chart doesn't plot values for datafeeds comprising aggregations #65223

Closed

qn895 added a commit to qn895/kibana that referenced this pull request


          [ML] Improve messaging and support for datafeed using aggregated and …

8c198ac

…scripted fields (elastic#84594)

qn895 mentioned this pull request

[ML] Support plot of metric data when datafeed uses scripted field #18464

Closed

qn895 added a commit that referenced this pull request


          [7.x] [ML] Improve messaging and support for datafeed using aggregate…

…d and scripted fields (#84594) (#85613)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Feature:Anomaly Detection :ml release_note:enhancement v7.11.0 v8.0.0