-
Notifications
You must be signed in to change notification settings - Fork 25k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ML] ML nodes autoscaling not down to 0 in stateful and serverless #114930
Comments
Pinging @elastic/ml-core (Team:ML) |
Regarding stateful, the response says:
So it looks like this Who consuming this result? I think the |
BTW, I don't fully understand:
If we need a total of 0 processors and 0 memory, why each node (of the 0) have 4 processors. |
I don't really know the serverless API, but
sounds like the autoscaler should scale down to 0 nodes. |
I have verified this in a stateful cluster.
github commit: https://github.com/elastic/elasticsearch/commits/979710150c133840ef0852edaa4aee02c144fdb2, which should include the fix. Unfortunately the problem is still there after hour wait. the decider now says:
|
Thanks, investigating... Have you also tried serverless? The autoscaling code for that is different. |
This should fix stateful: #115082 |
I can confirm serverless works. our serverless QA environment just had rollout yesterday night, the current commit is
https://github.com/elastic/elasticsearch/commits/d3fceaddefcc32c71321768d05f268bce2374634 it includes the fix I tried the below steps:
|
Reopen this, after the #115082, not scaling up event is broken (Classic stateful environment). After
model stats:
but
it is not right. because no ml nodes scaling up events will be triggered. |
Next fix: #115189 |
Environment
build
Step to reproduce
GET _ml/trained_models/elser-endpoint/_stats
Observed:
After hours wait, ml nodes autoscaling (down to 0) didnt happen
GET /_autoscaling/capacity/
returns:GET /_internal/serverless/autoscaling
returns:The text was updated successfully, but these errors were encountered: