Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] ML Model stuck in deploying status #2808

Open
Zhangxunmt opened this issue Aug 6, 2024 · 0 comments
Open

[BUG] ML Model stuck in deploying status #2808

Zhangxunmt opened this issue Aug 6, 2024 · 0 comments
Assignees
Labels
bug Something isn't working

Comments

@Zhangxunmt
Copy link
Collaborator

What is the bug?
It's reported from several cases that a ML model stuck in the "DEPLOYING" status and never refresh/update the status, when the cluster is under 1)heavy load or 2) scale up and downs. We need to revisit this function for edge cases that may cause the system into this condition. https://github.com/opensearch-project/ml-commons/blob/main/plugin/src/main/java/org/opensearch/ml/cluster/MLSyncUpCron.java#L363.

Currently this issue hasn't been reported by any open source customers so it's not duplicated in any open source environment. Developers will need to duplicate this error and propose a fix.

How can one reproduce the bug?
Steps to reproduce the behavior:

  1. Go to '...'
  2. Click on '....'
  3. Scroll down to '....'
  4. See error

What is the expected behavior?
A clear and concise description of what you expected to happen.

What is your host/environment?

  • OS: [e.g. iOS]
  • Version [e.g. 22]
  • Plugins

Do you have any screenshots?
If applicable, add screenshots to help explain your problem.

Do you have any additional context?
Add any other context about the problem.

@Zhangxunmt Zhangxunmt added bug Something isn't working untriaged and removed untriaged labels Aug 6, 2024
@ylwu-amzn ylwu-amzn moved this to Backlog in ml-commons projects Aug 27, 2024
@dhrubo-os dhrubo-os moved this from Backlog to In Progress in ml-commons projects Oct 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: In Progress
Development

No branches or pull requests

2 participants