-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Zombie process exception #165
Comments
Hi! Any luck solving this? I am in the same situation for the |
@rauldiaz this was the fix, but it wasn't properly propagated to the instances. The recent new releases would have solved all the issues, if they updated Can see also the ongoing conversation here. |
I am using |
Hi there, I wanted to report that after a night, the issue seemed to resolve itself, and everything was working fine. However, when I updated the endpoint, the same error occurred again. Is this a known issue that tends to happen shortly after deployment? Thanks! |
Describe the bug
Getting zombie process exception as already reported for the sagemaker-inference-toolkit
To reproduce
Using
763104351884.dkr.ecr.eu-central-1.amazonaws.com/pytorch-inference:2.2.0-gpu-py310-cu118-ubuntu20.04-sagemaker
and custom inference script in a batch-transform causes to trigger such error. Even a simple initialtime.sleep(60)
in the inference.py script can be used to trigger the error.A custom requirements.txt file also needs to be provided with custom inference script.
Here the full traceback:
System information
A description of your system. Please provide:
763104351884.dkr.ecr.eu-central-1.amazonaws.com/pytorch-inference:2.2.0-gpu-py310-cu118-ubuntu20.04-sagemaker
The text was updated successfully, but these errors were encountered: