[SPARK-41525][K8S] Improve `onNewSnapshots` to use unique lists of known executor IDs and PVC names #39070

dongjoon-hyun · 2022-12-15T07:24:36Z

What changes were proposed in this pull request?

This PR improve ExecutorPodsAllocator.onNewSnapshots by removing duplications at k8sKnownExecIds and k8sKnownPVCNames. In the large cluster, this causes inefficiency.

Why are the changes needed?

The existing variables have lots of duplications because snapshots is Seq[ExecutorPodsSnapshot].

val k8sKnownExecIds = snapshots.flatMap(_.executorPods.keys)

For example, if we print out the values, it looks like the following.

22/12/15 07:09:37 INFO ExecutorPodsAllocator: 1
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 2
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 1
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 2
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 1
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 2
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 1
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 2
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 1
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 2
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 1
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 2
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 3
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 1
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 2
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 3
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 1
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 2
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 3
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 1
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 2
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 3
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 1
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 2
22/12/15 07:09:37 INFO ExecutorPodsAllocator: 3

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Manual review because this is an improvement on the local variable computation.

…s in onNewSnapshots

dongjoon-hyun · 2022-12-15T07:27:59Z

cc @viirya and @attilapiros

dongjoon-hyun · 2022-12-15T07:37:31Z

...netes/core/src/main/scala/org/apache/spark/scheduler/cluster/k8s/ExecutorPodsAllocator.scala

@@ -152,7 +152,7 @@ class ExecutorPodsAllocator(
      applicationId: String,
      schedulerBackend: KubernetesClusterSchedulerBackend,
      snapshots: Seq[ExecutorPodsSnapshot]): Unit = {
-    val k8sKnownExecIds = snapshots.flatMap(_.executorPods.keys)
+    val k8sKnownExecIds = snapshots.flatMap(_.executorPods.keys).distinct


The original code is in Spark 3.1.2 and Spark 3.2.0.

So snapshots may contain duplicates too?

snapshots is defined as a sequence of snapshot history originally. Logically, there can be a few snapshot which the same contents, but in the most cases, they will be different by object metadata or status.

Thanks for clarifying it.

dongjoon-hyun · 2022-12-15T07:38:10Z

...netes/core/src/main/scala/org/apache/spark/scheduler/cluster/k8s/ExecutorPodsAllocator.scala

@@ -162,7 +162,7 @@ class ExecutorPodsAllocator(
    val k8sKnownPVCNames = snapshots.flatMap(_.executorPods.values.map(_.pod)).flatMap { pod =>
      pod.getSpec.getVolumes.asScala
        .flatMap { v => Option(v.getPersistentVolumeClaim).map(_.getClaimName) }
-    }
+    }.distinct


This is since Spark 3.2.0.

dongjoon-hyun · 2022-12-15T07:45:23Z

Thank you, @viirya !

dongjoon-hyun · 2022-12-15T08:13:31Z

All tests passed except the master branch dependency test failure. Merged to master.

[SPARK-41525][K8S] Use unique list of known executor IDs and PVC name…

295cf3b

…s in onNewSnapshots

dongjoon-hyun changed the title ~~[SPARK-41525][K8S] Use unique list of known executor IDs and PVC names in onNewSnapshots~~ [SPARK-41525][K8S] Use unique list of known executor IDs and PVC names in onNewSnapshots Dec 15, 2022

github-actions bot added the KUBERNETES label Dec 15, 2022

dongjoon-hyun changed the title ~~[SPARK-41525][K8S] Use unique list of known executor IDs and PVC names in onNewSnapshots~~ [SPARK-41525][K8S] Improve onNewSnapshots to use unique list of known executor IDs and PVC names Dec 15, 2022

dongjoon-hyun changed the title ~~[SPARK-41525][K8S] Improve onNewSnapshots to use unique list of known executor IDs and PVC names~~ [SPARK-41525][K8S] Improve onNewSnapshots to use unique list of known executor IDs and PVC names Dec 15, 2022

dongjoon-hyun changed the title ~~[SPARK-41525][K8S] Improve onNewSnapshots to use unique list of known executor IDs and PVC names~~ [SPARK-41525][K8S] Improve onNewSnapshots to use unique lists of known executor IDs and PVC names Dec 15, 2022

dongjoon-hyun commented Dec 15, 2022

View reviewed changes

viirya approved these changes Dec 15, 2022

View reviewed changes

dongjoon-hyun closed this in 8b2a2d1 Dec 15, 2022

dongjoon-hyun deleted the SPARK-41525 branch December 15, 2022 08:16

dcoliversun mentioned this pull request Jan 21, 2023

[SPARK-41781][K8S] Add the ability to create pvc before creating driver/executor pod #39306

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-41525][K8S] Improve `onNewSnapshots` to use unique lists of known executor IDs and PVC names #39070

[SPARK-41525][K8S] Improve `onNewSnapshots` to use unique lists of known executor IDs and PVC names #39070

dongjoon-hyun commented Dec 15, 2022 •

edited

Loading

dongjoon-hyun commented Dec 15, 2022

dongjoon-hyun Dec 15, 2022

viirya Dec 15, 2022

dongjoon-hyun Dec 15, 2022

viirya Dec 15, 2022

dongjoon-hyun Dec 15, 2022

dongjoon-hyun commented Dec 15, 2022

dongjoon-hyun commented Dec 15, 2022

[SPARK-41525][K8S] Improve onNewSnapshots to use unique lists of known executor IDs and PVC names #39070

[SPARK-41525][K8S] Improve onNewSnapshots to use unique lists of known executor IDs and PVC names #39070

Conversation

dongjoon-hyun commented Dec 15, 2022 • edited Loading

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

dongjoon-hyun commented Dec 15, 2022

dongjoon-hyun Dec 15, 2022

Choose a reason for hiding this comment

viirya Dec 15, 2022

Choose a reason for hiding this comment

dongjoon-hyun Dec 15, 2022

Choose a reason for hiding this comment

viirya Dec 15, 2022

Choose a reason for hiding this comment

dongjoon-hyun Dec 15, 2022

Choose a reason for hiding this comment

dongjoon-hyun commented Dec 15, 2022

dongjoon-hyun commented Dec 15, 2022

[SPARK-41525][K8S] Improve `onNewSnapshots` to use unique lists of known executor IDs and PVC names #39070

[SPARK-41525][K8S] Improve `onNewSnapshots` to use unique lists of known executor IDs and PVC names #39070

dongjoon-hyun commented Dec 15, 2022 •

edited

Loading