[feature request] Sidecar lifecycle control #658

miaojianwei · 2021-07-02T09:33:32Z

What would you like to be added:
I want to be able to configure the sidecar’s lifecycle, let the sidecar to stop running when the main container stops running, with no need for the main container to provide additional interface for status detection.

Why is this needed:
I add a sidecar container to the pod running the user‘s computing task such as AI training, which is used to provide some necessary services for smart cards. However, when the computing container stops, sidecar is still running, so the pod status is still running. But the user is not aware of the existence of the sidecar container, they expect the pod status to be complete.
So I hope the sidecar container can stop itself when the computing container stops.

FillZpp · 2021-07-07T11:29:33Z

/unassign @jian-he
/assign @FillZpp

FillZpp · 2021-07-08T06:22:52Z

@miaojianwei Understand. For those pods with restartPolicy never or onFailure, sidecar container will remain running, even if the main container has already stopped. We suppose to solve this problem in feature releases.

stale · 2021-10-06T07:07:47Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

FillZpp · 2021-10-08T02:53:37Z

/pinned

FillZpp · 2021-11-10T15:26:27Z

/unassign @FillZpp
/assign @veophi

trondhindenes · 2021-12-28T13:03:29Z

We have a similar but not identical need: We're trying to find a way to control the lifecycle of sidecar containers related to the "main" container during shutdown.
We need to guarantee that the sidecar never shuts down before the main container. This is problematic in cases where the sidecar is needed to perform some operation, and the main container takes a few seconds to finish off some semi-long-running task after having received the SIGTERM signal. If the sidecar is able to exit right away, the main container will not be able to gracefully finish it's task before shutting down.

it would be very valuable to have something like a apps.kruise.io/container-termination-priority annotation to control the order in which containers are stopped, similar to apps.kruise.io/container-launch-priority (sidecar termination control is a well-known problem in Kubernetes, so if Openkruise would solve this it would be very beneficial for the community)

FillZpp · 2021-12-29T03:07:51Z

it would be very valuable to have something like a apps.kruise.io/container-termination-priority annotation to control the order in which containers are stopped, similar to apps.kruise.io/container-launch-priority

Yeah, we know that. But unfortunately, there is no way to control the sequence of container termination, except the preStop hook, which should be configured by users. It means you have to customize the preStop hook script of your sidecar container, it keeps waiting for the process in main container exited (maybe via identify file in shared volume) to make sure sidecar terminating after main.

What's more, now there is a new Keystone containers KEP that aims to terminate pods on the basis of keystone(main/app/essential) containers completion status. You can join its discussion, although I think it is unlikely to be accepted by K8s community.

Hope this can help you.

trondhindenes · 2021-12-29T08:37:33Z

thanks for the detailed explanation @FillZpp . My thinking was that it would be possible to use the same patch operation to remove a container from a running pod as (I'm assuming) openkruse uses to add containers in order, and then have a admission controller intercept pod delete commands to trigger the "ordered deletion". It might not be feasable tho, so thanks again for the reply.

FillZpp · 2021-12-29T09:14:32Z

@trondhindenes Aha, I understand your thought, but it does not work like that. In fact, we can not intercept the container start/stop operations executed by kubelet with admission webhook, which could only intercept update or delete to the pod objects.

So we have to control the container launch priority using configmap reference, which is some kind of a 'hook' in the progress, you can see detail in this doc.

But when you intercept a pod deletion request via admission webhook, the pod is not terminating yet (deletionTimestamp is nil) and kubelet will start the container if you trigger a container to be stopped at this time. But once the pod deletion request is passed, kubelet will begin to stop the containers immediately and there is no 'hook' here to affect the order.

Not sure if I made it clear.

trondhindenes · 2021-12-29T14:11:34Z

Thanks. I played a bit with the kubernetes api today, and realized that patching a pod is a lot stricted than what I'd previously thought - it seems like updating the image if a container is pretty much the only allowed mutating operation (as well as adding more containers to a running pod ofc).

Thanks again for the feedback, appreciate it!

furykerry · 2022-01-04T10:06:14Z

We need to guarantee that the sidecar never shuts down before the main container. This is problematic in cases where the sidecar is needed to perform some operation, and the main container takes a few seconds to finish off some semi-long-running task after having received the SIGTERM signal. If the sidecar is able to exit right away, the main container will not be able to gracefully finish it's task before shutting down.

k8s lacks the ability to natively support container termination sequence, yet it does not mean it's impossible. It's just hacky. One solution comes into my mind is that openkruise can takeover the container lifecycle management such as prestop probe, and inject a dummy prestop probe for kubelet ( sleep N seconds). One problem with this solution is that when you investigate the pod yaml, the prestop probe will not be replaced, which may be confusing for some people

trondhindenes · 2022-01-04T11:23:43Z

Thanks! We ended up with a custom mutating webhook that does what we need, and like you say we're injecting a prestop hook to coordinate shutdown between containers in the pod - it works surprisingly well (https://github.com/trondhindenes/daps)

stale · 2022-04-04T12:10:24Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

FillZpp · 2022-04-05T02:11:44Z

/pinned

stale · 2022-07-04T06:53:24Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

FillZpp · 2022-07-12T02:57:21Z

/reopen

kruise-bot · 2022-07-12T02:57:24Z

@FillZpp: Reopened this issue.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

stale · 2022-10-10T03:31:57Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

miaojianwei added the kind/feature-request label Jul 2, 2021

miaojianwei assigned jian-he Jul 2, 2021

miaojianwei changed the title ~~[feature request] sidecar lifecycle control~~ [feature request] Sidecar lifecycle control Jul 2, 2021

kruise-bot assigned FillZpp and unassigned jian-he Jul 7, 2021

stale bot added the wontfix This will not be worked on label Oct 6, 2021

stale bot removed the wontfix This will not be worked on label Oct 8, 2021

kruise-bot assigned veophi and unassigned FillZpp Nov 10, 2021

stale bot added the wontfix This will not be worked on label Apr 4, 2022

stale bot removed the wontfix This will not be worked on label Apr 5, 2022

stale bot added the wontfix This will not be worked on label Jul 4, 2022

stale bot closed this as completed Jul 11, 2022

kruise-bot reopened this Jul 12, 2022

stale bot removed the wontfix This will not be worked on label Jul 12, 2022

stale bot added the wontfix This will not be worked on label Oct 10, 2022

stale bot closed this as completed Oct 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature request] Sidecar lifecycle control #658

[feature request] Sidecar lifecycle control #658

miaojianwei commented Jul 2, 2021

FillZpp commented Jul 7, 2021

FillZpp commented Jul 8, 2021

stale bot commented Oct 6, 2021

FillZpp commented Oct 8, 2021

FillZpp commented Nov 10, 2021

trondhindenes commented Dec 28, 2021

FillZpp commented Dec 29, 2021 •

edited

Loading

trondhindenes commented Dec 29, 2021

FillZpp commented Dec 29, 2021

trondhindenes commented Dec 29, 2021

furykerry commented Jan 4, 2022

trondhindenes commented Jan 4, 2022 •

edited

Loading

stale bot commented Apr 4, 2022

FillZpp commented Apr 5, 2022

stale bot commented Jul 4, 2022

FillZpp commented Jul 12, 2022

kruise-bot commented Jul 12, 2022

stale bot commented Oct 10, 2022

[feature request] Sidecar lifecycle control #658

[feature request] Sidecar lifecycle control #658

Comments

miaojianwei commented Jul 2, 2021

FillZpp commented Jul 7, 2021

FillZpp commented Jul 8, 2021

stale bot commented Oct 6, 2021

FillZpp commented Oct 8, 2021

FillZpp commented Nov 10, 2021

trondhindenes commented Dec 28, 2021

FillZpp commented Dec 29, 2021 • edited Loading

trondhindenes commented Dec 29, 2021

FillZpp commented Dec 29, 2021

trondhindenes commented Dec 29, 2021

furykerry commented Jan 4, 2022

trondhindenes commented Jan 4, 2022 • edited Loading

stale bot commented Apr 4, 2022

FillZpp commented Apr 5, 2022

stale bot commented Jul 4, 2022

FillZpp commented Jul 12, 2022

kruise-bot commented Jul 12, 2022

stale bot commented Oct 10, 2022

FillZpp commented Dec 29, 2021 •

edited

Loading

trondhindenes commented Jan 4, 2022 •

edited

Loading