Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Local models failed to be auto redeployed. #3272

Open
jngz-es opened this issue Dec 12, 2024 · 3 comments
Open

[BUG] Local models failed to be auto redeployed. #3272

jngz-es opened this issue Dec 12, 2024 · 3 comments
Assignees
Labels
bug Something isn't working

Comments

@jngz-es
Copy link
Collaborator

jngz-es commented Dec 12, 2024

What is the bug?
When adding/removing nodes in the cluster, local models will fail to be auto-redeployed sometimes.

How can one reproduce the bug?
TBD

What is the expected behavior?
Local modes should be auto deployed.

What is your host/environment?
OS 2.15

Do you have any screenshots?
N/A

Do you have any additional context?
N/A

@jngz-es jngz-es added bug Something isn't working untriaged labels Dec 12, 2024
@brianf-aws
Copy link
Contributor

Hey Jing Im certain this bug is fixed with this merged PR https://github.com/opensearch-project/ml-commons/pull/3241/files

But I think @rbhavna can say more about whether it can be closed with the said PR

@rbhavna
Copy link
Collaborator

rbhavna commented Dec 16, 2024

@brianf-aws its a different bug specific to 2.17 for sync-up job not working. But this issue seemed to be on 2.15 cluster which needs further analysis

@dhrubo-os
Copy link
Collaborator

@jngz-es are you going to look into this?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: In Progress
Development

No branches or pull requests

4 participants