[ML] Changes to calendars or filters do not update autodetect if handled on non-master node #31803

dimitris-athanasiou · 2018-07-04T17:27:56Z

In a multi-node cluster, any changes to calendars or filters used by a running job will not update the autodetect process if the corresponding actions are run on a non-master node.

The reason for this is because those updates are submitted to the UpdateJobProcessNotifier which is only running on the master node. So, even though it exists on non-master nodes, it's not running and the updates fall through.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2018-07-04T17:27:57Z

Pinging @elastic/ml-core

Job updates or changes to calendars or filters may result into updating the job process if it has been running. To preserve the order of updates, process updates are queued through the UpdateJobProcessNotifier which is only running on the master node. All actions performing such updates must run on the master node. However, the CRUD actions for calendars and filters are not master node actions. They have been submitting the updates to the UpdateJobProcessNotifier even though it might have not been running (given the action was run on a non-master node). When that happens, the update never reaches the process. This commit fixes this problem by ensuring the notifier runs on all nodes and by ensuring the process update action gets the resources again before updating the process (instead of having those resources passed in the request). This ensures that even if the order of the updates gets messed up, the latest update will read the latest state of those resource and the process will get back in sync. This leaves us with 2 types of updates: 1. updates to the job config should happen on the master node. This is because we cannot refetch the entire job and update it. We need to know the parts that have been changed. 2. updates to resources the job uses. Those can be handled on non-master nodes but they should be re-fetched by the update process action. Closes elastic#31803

Job updates or changes to calendars or filters may result into updating the job process if it has been running. To preserve the order of updates, process updates are queued through the UpdateJobProcessNotifier which is only running on the master node. All actions performing such updates must run on the master node. However, the CRUD actions for calendars and filters are not master node actions. They have been submitting the updates to the UpdateJobProcessNotifier even though it might have not been running (given the action was run on a non-master node). When that happens, the update never reaches the process. This commit fixes this problem by ensuring the notifier runs on all nodes and by ensuring the process update action gets the resources again before updating the process (instead of having those resources passed in the request). This ensures that even if the order of the updates gets messed up, the latest update will read the latest state of those resource and the process will get back in sync. This leaves us with 2 types of updates: 1. updates to the job config should happen on the master node. This is because we cannot refetch the entire job and update it. We need to know the parts that have been changed. 2. updates to resources the job uses. Those can be handled on non-master nodes but they should be re-fetched by the update process action. Closes #31803

dimitris-athanasiou added >bug :ml Machine learning labels Jul 4, 2018

dimitris-athanasiou self-assigned this Jul 4, 2018

dimitris-athanasiou mentioned this issue Jul 4, 2018

[ML] Fix calendar and filter updates from non-master nodes #31804

Merged

dimitris-athanasiou closed this as completed in #31804 Jul 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Changes to calendars or filters do not update autodetect if handled on non-master node #31803

[ML] Changes to calendars or filters do not update autodetect if handled on non-master node #31803

dimitris-athanasiou commented Jul 4, 2018

elasticmachine commented Jul 4, 2018

[ML] Changes to calendars or filters do not update autodetect if handled on non-master node #31803

[ML] Changes to calendars or filters do not update autodetect if handled on non-master node #31803

Comments

dimitris-athanasiou commented Jul 4, 2018

elasticmachine commented Jul 4, 2018