[ML] DFA exploration: ROC chart not showing for Classification jobs with non default results field #96603

alvarezmelissa87 · 2021-04-08T16:28:18Z

Found in latest kibana.

Describe the bug:
When viewing the exploration page of a DFA classification job with results_field set to something other than the default ml , the ROC chart fails to load and shows an error callout.

Steps to reproduce:

Use the mushroom dataset to create a Classification job via the DFA wizard
Set dependent variable to edibility
Set results_field to something other than the default value (in my example I set it to "bob")
Open the results view and see that the ROC chart doesn't load

Expected behavior:
ROC chart should load correctly

Errors in browser console (if relevant):
Request sent to _evaluate endpoint:

{
   "index":"mushroom-class-01",
   "query":{
      "bool":{
         "must":[
            
         ]
      }
   },
   "evaluation":{
      "classification":{
         "actual_field":"edibility",
         "predicted_field":"bob.edibility_prediction",
         "metrics":{
            "accuracy":{
               
            },
            "recall":{
               
            },
            "auc_roc":{
               "include_curve":true,
               "class_name":"e"
            }
         }
      }
   }
}

Error message returned from _evaluate endpoint:

{
   "statusCode":400,
   "error":"Bad Request",
   "message":"[status_exception]: No documents found containing all the required fields [edibility, bob.edibility_prediction, ml.top_classes.class_name, ml.top_classes.class_probability]",
   "attributes":{
      "body":{
         "error":{
            "root_cause":[
               {
                  "type":"status_exception",
                  "reason":"No documents found containing all the required fields [edibility, bob.edibility_prediction, ml.top_classes.class_name, ml.top_classes.class_probability]"
               }
            ],
            "type":"status_exception",
            "reason":"No documents found containing all the required fields [edibility, bob.edibility_prediction, ml.top_classes.class_name, ml.top_classes.class_probability]"
         },
         "status":400
      }
   }
}

The text was updated successfully, but these errors were encountered:

elasticmachine · 2021-04-08T16:28:31Z

Pinging @elastic/ml-ui (:ml)

alvarezmelissa87 · 2021-04-08T16:56:11Z

@dimitris-athanasiou - I took a look at the evaluate docs and I don't see any specific examples for when results_field is not the default 'ml' value.

Also looks like the request sent is the same for Classification jobs with results_field set to the default and with a non default value. Curious if you have some insight into what we might be missing in sending to _evaluate?

dimitris-athanasiou · 2021-04-09T12:12:29Z

For classification, when the results field is different, you also need to provide the path to the top_classes field. As the _evaluate API is decoupled from the job config (in order to allow usage with indices not created with a DFA job), the API doesn't know of the job's results field. Basically, the request should be:

{
   "index":"mushroom-class-01",
   "query":{
      "bool":{
         "must":[
            
         ]
      }
   },
   "evaluation":{
      "classification":{
         "actual_field":"edibility",
         "predicted_field":"bob.edibility_prediction",
         "top_classes_field": "bob.top_classes",
         "metrics":{
            "accuracy":{
               
            },
            "recall":{
               
            },
            "auc_roc":{
               "include_curve":true,
               "class_name":"e"
            }
         }
      }
   }
}

Note that the UI app could always set that to {results_field}.top_classes where results_field is ml when the default results_field is used in order to avoid if-logic.

alvarezmelissa87 added the bug Fixes for quality problems that affect the customer experience label Apr 8, 2021

botelastic bot added the needs-team Issues missing a team label label Apr 8, 2021

alvarezmelissa87 added :ml Feature:Data Frame Analytics ML data frame analytics features v7.13.0 labels Apr 8, 2021

botelastic bot removed the needs-team Issues missing a team label label Apr 8, 2021

alvarezmelissa87 mentioned this issue Apr 8, 2021

[ML] Data Frame Analytics exploration: ensure training filters work as expected #96500

Merged

2 tasks

alvarezmelissa87 mentioned this issue Apr 12, 2021

[ML] Ensure ROC chart gets loaded correctly #96890

Merged

1 task

alvarezmelissa87 closed this as completed in #96890 Apr 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] DFA exploration: ROC chart not showing for Classification jobs with non default results field #96603

[ML] DFA exploration: ROC chart not showing for Classification jobs with non default results field #96603

alvarezmelissa87 commented Apr 8, 2021

elasticmachine commented Apr 8, 2021

alvarezmelissa87 commented Apr 8, 2021

dimitris-athanasiou commented Apr 9, 2021 •

edited

Loading

[ML] DFA exploration: ROC chart not showing for Classification jobs with non default results field #96603

[ML] DFA exploration: ROC chart not showing for Classification jobs with non default results field #96603

Comments

alvarezmelissa87 commented Apr 8, 2021

elasticmachine commented Apr 8, 2021

alvarezmelissa87 commented Apr 8, 2021

dimitris-athanasiou commented Apr 9, 2021 • edited Loading

dimitris-athanasiou commented Apr 9, 2021 •

edited

Loading