Move some lifecycle management from doTask -> shutdown for the mm-less task runner #14895

georgew5656 · 2023-08-22T17:39:14Z

Description

The mm-less task runner differs in behavior from the other runners that run on the overlord because it tries to handle the cleanup lifecycle on its own by immediately deleting k8s jobs and clearing it's tasks map as soon as it has finished running a task.

The HttpRemoteTaskRunner and RemoteTaskRunner don't do this. Instead, they rely on the TaskQueue's handlers for the futures that all the task runners return to call shutdown on the task once it has completed.

Updating the mm-less task runner to use logic more similar to the other task runners has a couple benefits.

The task location (including the k8sPodName) is successfully persisted to taskStorage in TaskQueue.notifyStatus. Currently the mm-less task runner reports no location in this function call because its run lifecycle will have already cleaned up the K8s Job and its tasks map.
Currently, when a task completes, the taskQueue handler will try to call shutdown on the k8s task runner after the runner has already shut down the task. this creates a bunch of "Ignoring request to cancel unknown task" logs and in general seems likely to cause unexpected behavior in the future. Changing the logic will remove this duplication.

Changes

In KubernetesPeonLifecycle, stop calling shutdown (to delete the K8s job) after a job has finished.
In KubernetesTaskRunner.doTask, stop removing the taskId from tasks in the logic of the run future.
In KubernetesTaskRunner.shutdown, remove taskId from tasks in addition to calling shutdown on the job. When taskQueue calls this shutdown function, everything in the task will be cleaned up as expected.
Remove the shutdownRequested flag from KubernetesWorkItem since we can now treat the presence (or lack of presence) of the taskId in the tasks map as a indicator of whether the task was shutdown.

Release note

Update mm-less task runner lifecycle logic to better match the logic in the HTTP and Zookeeper worker task runners.

Key changed/added classes in this PR

KubernetesPeonLifecycle
KubernetesTaskRunner
KubernetesWorkItem

This PR has:

…cation

kfaraz

Looks good. Left a minor comment.

@georgew5656 , what would be the side effect of this if the TaskQueue is hypothetically slow in cleaning up finished tasks?

kfaraz · 2023-08-24T04:20:31Z

...overlord-extensions/src/main/java/org/apache/druid/k8s/overlord/KubernetesPeonLifecycle.java

@@ -188,7 +186,7 @@ protected synchronized TaskStatus join(long timeout) throws IllegalStateExceptio
   */
  protected void shutdown()
  {
-    if (State.PENDING.equals(state.get()) || State.RUNNING.equals(state.get())) {
+    if (State.PENDING.equals(state.get()) || State.RUNNING.equals(state.get()) || State.STOPPED.equals(state.get())) {


Why do we need to handle STOPPED state here? Wouldn't the job have already finished?

this is saying thats its okay to shutdown if its stopped

georgew5656 · 2023-08-24T17:47:49Z

Looks good. Left a minor comment.

@georgew5656 , what would be the side effect of this if the TaskQueue is hypothetically slow in cleaning up finished tasks?

The two things that will hang around for a while
The completed K8s job/pod. I don't think this is a huge issue since its not actually consuming any resources (basically a key/value entry in etcd).
The entry in the tasks map. The API call for listing tasks will still return this value and some of the task slot metrics will report the task as still running. New tasks will still be able to be run since the thread executing the future for the task will have completed.

YongGang

Nice cleanup, left one minor comment

YongGang · 2023-08-25T16:17:34Z

...es-overlord-extensions/src/main/java/org/apache/druid/k8s/overlord/KubernetesTaskRunner.java

@@ -271,6 +262,10 @@ public void shutdown(String taskid, String reason)
      return;
    }

+    synchronized (tasks) {


nit: the code can be simplified:

KubernetesWorkItem workItem; synchronized (tasks) { workItem = tasks.remove(taskid); } if (workItem == null) { log.info("Ignoring request to cancel unknown task [%s]", taskid); return; }

georgew5656 added 7 commits August 9, 2023 11:30

save work

a8e921f

Merge branch 'master' of github.com:georgew5656/druid into saveTaskLo…

633f391

…cation

Add syncronized

ba4d1e2

Don't shutdown in run

137f740

Adding unit tests

3cb910b

Cleanup lifecycle

7f5caef

Fix tests

7d0a3ea

kfaraz approved these changes Aug 24, 2023

View reviewed changes

georgew5656 added 2 commits August 25, 2023 10:06

Fix merge conflicts

dccca17

remove newline

f72a3ca

YongGang approved these changes Aug 25, 2023

View reviewed changes

dclim merged commit 95b0de6 into apache:master Aug 25, 2023

LakshSingla added this to the 28.0 milestone Oct 12, 2023

LakshSingla mentioned this pull request Nov 4, 2023

[DRAFT] 28.0.0 release notes #15326

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move some lifecycle management from doTask -> shutdown for the mm-less task runner #14895

Move some lifecycle management from doTask -> shutdown for the mm-less task runner #14895

georgew5656 commented Aug 22, 2023

kfaraz left a comment

kfaraz Aug 24, 2023

georgew5656 Aug 24, 2023

georgew5656 commented Aug 24, 2023

YongGang left a comment

YongGang Aug 25, 2023

Move some lifecycle management from doTask -> shutdown for the mm-less task runner #14895

Move some lifecycle management from doTask -> shutdown for the mm-less task runner #14895

Conversation

georgew5656 commented Aug 22, 2023

Description

Release note

Key changed/added classes in this PR

kfaraz left a comment

Choose a reason for hiding this comment

kfaraz Aug 24, 2023

Choose a reason for hiding this comment

georgew5656 Aug 24, 2023

Choose a reason for hiding this comment

georgew5656 commented Aug 24, 2023

YongGang left a comment

Choose a reason for hiding this comment

YongGang Aug 25, 2023

Choose a reason for hiding this comment