-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cannot change the CloudWatch log level #83
Comments
related: aws/sagemaker-inference-toolkit#70 |
I think the easiest way of implementing this would be allowing the customer to provide their own log4j config file through the dependencies arg here. The file should follow a naming convention, something like |
+1, this would be very useful |
Any news on this? This is very problematic for anything using PySpark (both training and inference), which outputs a lot of logs, and 99% are totally useless |
+1, any update? |
Seems, I'm experiencing the same issue: |
Describe the bug
The PyTorch SageMaker endpoint cloudwatch log level is INFO only which cannot be changed without creating a BYO container.
Hence all the access including /ping besides the /invocations are generating logs that clutters the cloudwarch log stream making it difficult to go directly to the errors for troubleshooting. In my understanding, this will incur the cloudwatch cost as well.
The AWS support case 7309023801 was opened and it was indicated the log level cannot be changed, or need to build our own container to control the log level.
To reproduce
Deploy a PyTorch Model where Python log level is set to logging.ERROR via SageMaker SDK and refer to the cloudwach log for /aws/sagemaker/Endpoints/<endpoint_name>.
Expected behavior
The log level configuration is reflected and only ERROR will be logged in the cloudwatch.
System information
SageMaker endpoint in us-east-1.
Toolkit version:
Not sure
Framework version:
PyTorch 1.4.0, 1.5.1
Python version:
Python 3.6
CPU or GPU:
GPU
Custom Docker image (Y/N):
N
Additional context
Endpoint startup message in the cloudwatch.
The text was updated successfully, but these errors were encountered: