-
Notifications
You must be signed in to change notification settings - Fork 24.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Transform] can't delete transform, stop start after rolling upgrade / node role change #69260
Comments
Pinging @elastic/ml-core (Team:ML) |
Retrospective 7.2 - 7.6 Originally transform ran on any node in a cluster. Although the transform task is lightweight - it only coordinates search and index requests - user wished to disable transform on certain nodes: #52200 A solution was implemented in #52712, version 7.7 7.7 - 7.9 This implementation allowed you to opt out certain nodes from acting as transform nodes. As default starting from 7.7 transform only executed on data nodes (so it also prevented to run transform on low-capacity nodes like coordinator or dedicated specialized nodes like ml). To opt out use: 7.9 - 7.last
Instead of e.g. However, this change basically switches the logic from an Deleting entries like 8.0 - It is planned to remove the deprecated Summary By switching node roles from an opt out (implicit) to an opt in(explicit) model it can happen that a user accidentally removes the transform role. However as the old syntax is still supported at least for |
allow stop transform to stop a transform task if its waiting for assignment(e.g. if the cluster lacks a transform node) fixes #69260
allow stop transform to stop a transform task if its waiting for assignment(e.g. if the cluster lacks a transform node) fixes elastic#69260
allow stop transform to stop a transform task if its waiting for assignment(e.g. if the cluster lacks a transform node) fixes elastic#69260
Upstream issue: elastic/kibana#91570
Affected versions: 7.7.0 - 7.11.2
After a change to the cluster - could by a rolling upgrade or fine-tuning of settings - a formerly running transform reports it is stopped, e.g.:
However trying to delete it, claims it is running:
Trying to delete it with
force
times out and trying to start it, claims the task as well.As a result it is not possible to delete the transform or use it.
Mitigation:
Transform requires a transform node to run on. To verify whether you have a node that can run transform check the output of
GET _cat/nodes
:The above output is not valid, because only data and master node roles are available. You must have at least 1 node that has a
t
, e.g.:The above shows a data and transform node. Note, you only need 1 node with a
t
aka transform node.Solution:
Add a transform node to your cluster, see https://www.elastic.co/guide/en/elasticsearch/reference/current/modules-node.html
At least 1 node should specify the transform role:
Note: if you specify no roles, you automatically use all roles, see the docs for details.
Fix:
The solution of this problem has 2 aspects:
The text was updated successfully, but these errors were encountered: