-
Notifications
You must be signed in to change notification settings - Fork 138
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cluster restart model auto redeploy #1627
Changes from all commits
bc39114
9a50c61
a8c71f1
1b8a75c
459c438
226fae3
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -468,7 +468,8 @@ public Collection<Object> createComponents( | |
threadPool, | ||
nodeHelper, | ||
mlIndicesHandler, | ||
encryptor | ||
encryptor, | ||
mlModelAutoRedeployer | ||
); | ||
|
||
// TODO move this into MLFeatureEnabledSetting | ||
|
@@ -685,6 +686,7 @@ public List<Setting<?>> getSettings() { | |
MLCommonsSettings.ML_COMMONS_ENABLE_INHOUSE_PYTHON_MODEL, | ||
MLCommonsSettings.ML_COMMONS_MODEL_AUTO_REDEPLOY_ENABLE, | ||
MLCommonsSettings.ML_COMMONS_MODEL_AUTO_REDEPLOY_LIFETIME_RETRY_TIMES, | ||
MLCommonsSettings.ML_COMMONS_MODEL_AUTO_REDEPLOY_SUCCESS_RATIO, | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This setting is not used anywhere in this PR? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Didn't expose this setting when implementing the auto redeploy feature, so exposing it now to enable customer to change this ratio value. |
||
MLCommonsSettings.ML_COMMONS_ALLOW_MODEL_URL, | ||
MLCommonsSettings.ML_COMMONS_ALLOW_LOCAL_FILE_UPLOAD, | ||
MLCommonsSettings.ML_COMMONS_MODEL_ACCESS_CONTROL_ENABLED, | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we start auto reload immediately ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No, cluster manager ready doesn't mean cluster is ready, if we start auto reload immediately we can get a
blocked by: [SERVICE_UNAVAILABLE/1/state not recovered / initialized];
exception when querying needs reload models.