Activator endpoint probing can race with pre-stop hook #9355

julz · 2020-09-10T12:39:54Z

(writing up some slack conversations).

wordy version: Activator has logic to probe the QP when it sees a pod go to notReady state, if the probe succeeds it will treat the pod as ready regardless of its status. This is particularly good at startup because it means we can see the pod is ready and start forwarding traffic before etcd and the kubernetes Rube Goldberg machine have caught up. However, when terminating it is possible for the activator to see the pod go notReady before the QP has received the pre-stop signal (both Endpoints and Kubelet watch for the Terminating state independently, so the pre-stop signal is not guaranteed to be received by the QP before the pod goes notReady in activator's endpoints lister, see https://kubernetes.io/docs/concepts/workloads/pods/pod-lifecycle/#container-probes). In this case we'll never actually consider the pod notReady (because we saw a good probe), even though it's now in a terminating state and will at some point start failing requests we forward to it.

tl;dr: activator probing of notReady pods should be resilient to case where QP hasn't yet received pre-stop hook.

potential fix: the probing logic is only really needed on startup. When going from never-seen-before to notReady we should probe to get cold start times as low as possible, but when transitioning ready->notReady we can't distinguish readiness blips from terminating pods, and should err on the side of avoiding the race by waiting for the k8s api to report it ready.

note: I found this while looking at a bug we see where intermittently sending a lot of curls in a row we see errors while things are scaling down, I haven't verified yet that this is the problem there -- but it does seem like a problem :).

vagababov · 2020-09-10T14:29:05Z

for full picture -- we can ignore reprobing those but:

there's always race anyway -- you just moved it a bit
after second iteration of endpoints changing we already have no idea whether this was a new endpoints that's yet not ready or it's the one that became (sure we can try too bookkeep that, but think of activator restart/re-subsetting)

markusthoemmes · 2020-09-11T12:09:19Z

We had a bit of a discussion in Slack today, so here are more datapoints:

When a pod is deleted (i.e. scaled down) it isn't actually moved into the nonReady addresses but rather removed from the endpoints completely without that intermediate step. (see https://github.com/kubernetes/kubernetes/blob/119c94214c8b11a9f585557bff49bef26faf88b1/pkg/controller/endpoint/endpoints_controller.go#L415-L418).

With that in mind, is this even still a problem? I mean there is somewhat of a race, sure, but we shouldn't be reprobing and reconsidering pods so I'd think the race isn't better (or worse) than the race seen in any loadbalancer when removing a pod and our shutdown grace period should fix any errors here.

vagababov · 2020-09-11T16:04:19Z

Yeah, and there's always a race due to informers not being instantenous anyway...

julz · 2020-09-14T08:41:34Z

yah, thinking about this there's a race, but given terminating pods immediately exit without hitting nonReady, I don't think it's nearly so important a race (and, selfishly, I don't think it's the race causing the problem I was initially investigating :D). I'm going to keep digging a little in to this code path, but gonna close this for now since I don't think this'd be a worthwhile change, given our current understanding.

/close

knative-prow-robot · 2020-09-14T08:41:41Z

@julz: Closing this issue.

In response to this:

yah, thinking about this there's a race, but given terminating pods immediately exit without hitting nonReady, I don't think it's nearly so important a race (and, selfishly, I don't think it's the race causing the problem I was initially investigating :D). I'm going to keep digging a little in to this code path, but gonna close this for now since I don't think this'd be a worthwhile change, given our current understanding.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

julz added the kind/bug Categorizes issue or PR as related to a bug. label Sep 10, 2020

knative-prow-robot closed this as completed Sep 14, 2020

dspeck1 mentioned this issue May 28, 2024

Requests sent to terminating pods #15211

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Activator endpoint probing can race with pre-stop hook #9355

Activator endpoint probing can race with pre-stop hook #9355

julz commented Sep 10, 2020

vagababov commented Sep 10, 2020

markusthoemmes commented Sep 11, 2020

vagababov commented Sep 11, 2020

julz commented Sep 14, 2020

knative-prow-robot commented Sep 14, 2020

Activator endpoint probing can race with pre-stop hook #9355

Activator endpoint probing can race with pre-stop hook #9355

Comments

julz commented Sep 10, 2020

vagababov commented Sep 10, 2020

markusthoemmes commented Sep 11, 2020

vagababov commented Sep 11, 2020

julz commented Sep 14, 2020

knative-prow-robot commented Sep 14, 2020