-
Notifications
You must be signed in to change notification settings - Fork 549
Pure K8S Alpha Release Plan #3382
Comments
Details about AAD in #3378. |
Job keep retry and cannot be stopped. |
@yqwang-ms can you take a look at the "job keeps retrying" issue? |
This is caused by someone else's test, not related to the code, the whole bed is down. |
All nodes becomes unknown, so controller and scheduler in statefulset is down. |
Seems kubelet are removed in all worker nodes. (probably caused by someone cleaned the k8s cluster) Even no kubelet history in 10.151.41.21 (15 bed worker)
|
Synced with Hanyu, seems it is caused by his paictl operations. |
15 is recovered. |
Alpha release for k8s based PAI
Code complete date: Nov. 12
Plan Items
Deferred
Finished
completing, retry pending
torunning, waiting
#3636 [Web Portal] Seperate 'waiting' and 'running' states on task role's statistics #3727 [Web Portal] display stopped task count in task role's header #3840Backlogs
The text was updated successfully, but these errors were encountered: