Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Model is getting stuck in deploying state #2970

Open
gaurav7830 opened this issue Sep 18, 2024 · 1 comment
Open

[BUG] Model is getting stuck in deploying state #2970

gaurav7830 opened this issue Sep 18, 2024 · 1 comment
Labels
bug Something isn't working untriaged

Comments

@gaurav7830
Copy link

gaurav7830 commented Sep 18, 2024

What is the bug?
Model is getting stuck in deploying state while registering it on the cluster. We have seen cases where the model is not found on the few nodes.

Scenario

  1. Model stuck in DEPLOYING state.
  2. Call model undeploy api on the cluster returning the following response.
    "NodeId": {
        "stats": {
            "ModelId": "not_found"
        }
    },
    "NodeId": {
        "stats": {
            "ModelId": "not_found"
        }
    },
    "NodeId": {
        "stats": {
            "ModelId": "undeployed"
        }
    },
    "NodeId": {
        "stats": {
            "ModelId": "not_found"
        }
    },
    "NodeId": {
        "stats": {
            "ModelId": "undeployed"
        }
    },
    "NodeId": {
        "stats": {
            "ModelId": "not_found"
        }
    },
    "NodeId": {
        "stats": {
            "ModelId": "not_found"
        }
    },
    "NodeId": {
        "stats": {
            "ModelId": "undeployed"
        }
    },
    "NodeId": {
        "stats": {
            "ModelId": "not_found"
        }
    },
    "NodeId": {
        "stats": {
            "ModelId": "not_found"
        }
    }
}
  1. Called GetModel api and it returning model state as DEPLOYING.

What is the expected behavior?
Model should be undeployed.

@gaurav7830 gaurav7830 added bug Something isn't working untriaged labels Sep 18, 2024
@ylwu-amzn
Copy link
Collaborator

@Zhangxunmt I know you have some suggestion to enhance this part. Please help take a look.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working untriaged
Projects
None yet
Development

No branches or pull requests

3 participants