-
Notifications
You must be signed in to change notification settings - Fork 716
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training Operator V2 Installation - Certificate error #2404
Comments
Thank you for testing the Kubeflow Trainer V2 @Sharathmk99!
It looks like that the cert-rotator can't update the Secret. |
Thank you @andreyvelich for quick response. I made sure no secret was there in cluster,
And applied using,
I see below output,
I see new secret created,
Still I see some errors related to CA rotation,
When I try running,
I still get error,
In operator logs I see,
I tried to restart the operator and I don't see any error related to CA rotation,
Still same error,
I'm pretty sure something related to Certificate and not sure how to fix it. |
Do you have any specific configuration in your cluster ? |
What happened?
Tried to install training operator v2 using,
kustomize build overlays/standalone | k apply --server-side -f -
But I see few errors in operator logs,
When I run example mnist script I get below error,
What did you expect to happen?
Run example script
Environment
Kubernetes version:
Training Operator version:
$ kubectl get pods -n kubeflow -l control-plane=kubeflow-training-operator -o jsonpath="{.items[*].spec.containers[*].image}"
Training Operator Python SDK version:
Impacted by this bug?
Give it a 👍 We prioritize the issues with most 👍
The text was updated successfully, but these errors were encountered: