-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tolerate missing detached EC2 instances #10319
Tolerate missing detached EC2 instances #10319
Conversation
Welcome @hwoarang! |
Hi @hwoarang. Thanks for your PR. I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Sometimes we see the following error during a rolling update: I1125 18:12:46.467059 165 instancegroups.go:340] Draining the node: "ip-X-X-X-X.X.compute.internal". I1125 18:12:46.473365 165 instancegroups.go:359] deleting node "ip-X-X-X-X.X.compute.internal" from kubernetes I1125 18:12:46.476756 165 instancegroups.go:486] Stopping instance "i-XXXXXXXX", node "ip-X-X-X-X.X.compute.internal", in group "X" (this may take a while). E1125 18:12:46.523269 165 instancegroups.go:367] error deleting instance "i-XXXXXXXX", node "ip-X-X-X-X.X.compute.internal": error deleting instance "i-XXXXXXXX", node "ip-X-X-X-X.X.compute.internal": error deleting instance "i-XXXXXXXX": InvalidInstanceID.NotFound: The instance ID 'i-XXXXXXXXX' does not exist status code: 400, request id: 91238c21-1caf-41eb-91d7-534d4ca67ed0 It's possible that the EC2 instance to have disappeared by the time it was detached (it may have been a spot instance for example) In any case, we can't do much when we do not find an instance id, and throwing this error during the update is not very user friendly. As such, we can simply report and tolerate this problem instead of exiting with non-zero code. This is similar to how we handle missing EC2 when updating an IG[1] [1] kubernetes#594
a66e180
to
0eb114d
Compare
Thanks @hwoarang! |
Hey @hwoarang! So we contribute to the same project again :) Thanks /lgtm |
@olemarkus it's a small world :) |
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: hakman, hwoarang The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
…-upstream-release-1.19 Automated cherry pick of #10319: Tolerate missing detached EC2 instances
…-upstream-release-1.18 Automated cherry pick of #10319: Tolerate missing detached EC2 instances
Sometimes we see the following error during a rolling update:
It's possible that the EC2 instance to have disappeared by the time it
was detached (it may have been a spot instance for example)
In any case, we can't do much when we do not find an instance id, and
throwing this error during the update is not very user friendly.
As such, we can simply report and tolerate this problem instead of
exiting with non-zero code. This is similar to how we handle missing
EC2 when updating an IG[1]
[1] #594