[SPARK-23637][YARN]Yarn might allocate more resource if a same executor is killed multiple times. #20781

jinxing64 · 2018-03-09T05:55:08Z

What changes were proposed in this pull request?

YarnAllocator uses numExecutorsRunning to track the number of running executor. numExecutorsRunning is used to check if there're executors missing and need to allocate more.

In current code, numExecutorsRunning can be negative when driver asks to kill a same idle executor multiple times.

How was this patch tested?

UT added

…or is killed multiple times.

SparkQA · 2018-03-09T06:16:51Z

Test build #88116 has finished for PR 20781 at commit bd6f8a1.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

jinxing64 · 2018-03-09T07:05:27Z

cc @vanzin @tgravescs @cloud-fan @djvulee
Could you please help review this ?

jerryshao · 2018-03-09T08:00:01Z

~~Does it happen only in dynamic allocation enabled scenario?~~

NVM.

can be negative when driver asks to kill a same idle executor multiple times.

Can you please describe how this happened?

jinxing64 · 2018-03-09T08:02:34Z

@jerryshao Thanks for taking look.

Yes, it does happen. we have jobs which have already finished all the tasks but still holding 40~100 executors.

Well I'm not sure if it exists in non dynamic scenario.

jerryshao · 2018-03-09T08:05:12Z

This basically means that drive send multiple same kill requests to AM, right? I'm wondering how this would happen, shall we also guarantee this in the driver side?

…tiple times.

jinxing64 · 2018-03-09T13:26:01Z

@jerryshao
Thanks for advice. I spent some time digging to find why multiple kill sent from Driver to AM, but didn't figure out a way to reproduce.

I come to find that it's possible YarnAllocator process same completed container multiple times(https://github.com/apache/spark/blob/master/resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/YarnAllocator.scala#L573 this log is printed multiple times for same container), which can also make numExecutorsRunning negative. And I made another change(see last commit in this pr) to propose my idea -- replace numExecutorsRunning with a set.

jinxing64 · 2018-03-09T13:28:51Z

Since the change for YarnAllocator: killExecutor is easy. Do you think it's worth to have this defense?
Thanks again for review.

SparkQA · 2018-03-09T13:40:06Z

Test build #88127 has finished for PR 20781 at commit a177a63.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

jerryshao · 2018-03-12T07:31:03Z

resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/YarnAllocator.scala

@@ -81,7 +81,7 @@ private[yarn] class YarnAllocator(
  private val releasedContainers = Collections.newSetFromMap[ContainerId](
    new ConcurrentHashMap[ContainerId, java.lang.Boolean])

-  private val numExecutorsRunning = new AtomicInteger(0)
+  private val runningExecutors = new java.util.concurrent.ConcurrentHashMap[String, Unit]()


This can be changed to Collections.newSetFromMap, since we only need Set instead of Map.

jerryshao · 2018-03-12T07:32:29Z

Still I'm not so sure about the root cause, but adding defensive code seems no harm.

jinxing64 · 2018-03-12T12:50:36Z

@jerryshao
Thanks again for review.
It does exist in my cluster that same completed container can be processed multiple times, which will make numExecutorsRunning negative. I think I've ever seen such issue in another Spark jira, but I cannot find it now.

SparkQA · 2018-03-12T13:48:28Z

Test build #88178 has finished for PR 20781 at commit 049ed49.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

vanzin · 2018-03-12T19:09:55Z

The change looks good, but did you look at why the code is trying to kill the same executor multiple times? That sounds like it could be a possible bug on the scheduler backend, which should be keeping track of these things.

jinxing64 · 2018-03-13T02:01:54Z

@vanzin
Thanks for review~

I spent some time but didn't find the reason why same executor is killed multiple times and I cannot reproduce either.
I found that same completed container can be processed multiple times. It happens now and then. Seems yarn doesn't promise that same completed container only returned in one response (https://github.com/apache/spark/blob/master/resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/YarnAllocator.scala#L268)

vanzin · 2018-04-02T22:14:10Z

retest this please

SparkQA · 2018-04-02T22:41:05Z

Test build #88837 has finished for PR 20781 at commit 049ed49.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

vanzin · 2018-04-04T22:50:44Z

Merging to master / 2.3.

…tor is killed multiple times. ## What changes were proposed in this pull request? `YarnAllocator` uses `numExecutorsRunning` to track the number of running executor. `numExecutorsRunning` is used to check if there're executors missing and need to allocate more. In current code, `numExecutorsRunning` can be negative when driver asks to kill a same idle executor multiple times. ## How was this patch tested? UT added Author: jinxing <[email protected]> Closes #20781 from jinxing64/SPARK-23637. (cherry picked from commit d3bd043) Signed-off-by: Marcelo Vanzin <[email protected]>

…tor is killed multiple times. ## What changes were proposed in this pull request? `YarnAllocator` uses `numExecutorsRunning` to track the number of running executor. `numExecutorsRunning` is used to check if there're executors missing and need to allocate more. In current code, `numExecutorsRunning` can be negative when driver asks to kill a same idle executor multiple times. ## How was this patch tested? UT added Author: jinxing <[email protected]> Closes apache#20781 from jinxing64/SPARK-23637.

jinxing64 · 2018-04-08T08:40:20Z

@vanzin Thanks for merging.

…tor is killed multiple times. `YarnAllocator` uses `numExecutorsRunning` to track the number of running executor. `numExecutorsRunning` is used to check if there're executors missing and need to allocate more. In current code, `numExecutorsRunning` can be negative when driver asks to kill a same idle executor multiple times. UT added Author: jinxing <[email protected]> Closes apache#20781 from jinxing64/SPARK-23637. (cherry picked from commit d3bd043) Signed-off-by: Marcelo Vanzin <[email protected]> Change-Id: I5b70fa55343828b303f96e0672525259f33e43ab

[SPARK-23637][YARN]Yarn might allocate more resource if a same execut…

bd6f8a1

…or is killed multiple times.

it's possible that YarnAllocator process same completed container mul…

a177a63

…tiple times.

jerryshao reviewed Mar 12, 2018

View reviewed changes

Use runningExecutors as a set.

049ed49

asfgit closed this in d3bd043 Apr 4, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-23637][YARN]Yarn might allocate more resource if a same executor is killed multiple times. #20781

[SPARK-23637][YARN]Yarn might allocate more resource if a same executor is killed multiple times. #20781

jinxing64 commented Mar 9, 2018

SparkQA commented Mar 9, 2018

jinxing64 commented Mar 9, 2018

jerryshao commented Mar 9, 2018 •

edited

Loading

jinxing64 commented Mar 9, 2018 •

edited

Loading

jerryshao commented Mar 9, 2018

jinxing64 commented Mar 9, 2018 •

edited

Loading

jinxing64 commented Mar 9, 2018

SparkQA commented Mar 9, 2018

jerryshao Mar 12, 2018

jerryshao commented Mar 12, 2018

jinxing64 commented Mar 12, 2018 •

edited

Loading

SparkQA commented Mar 12, 2018

vanzin commented Mar 12, 2018

jinxing64 commented Mar 13, 2018

vanzin commented Apr 2, 2018

SparkQA commented Apr 2, 2018

vanzin commented Apr 4, 2018

jinxing64 commented Apr 8, 2018

[SPARK-23637][YARN]Yarn might allocate more resource if a same executor is killed multiple times. #20781

[SPARK-23637][YARN]Yarn might allocate more resource if a same executor is killed multiple times. #20781

Conversation

jinxing64 commented Mar 9, 2018

What changes were proposed in this pull request?

How was this patch tested?

SparkQA commented Mar 9, 2018

jinxing64 commented Mar 9, 2018

jerryshao commented Mar 9, 2018 • edited Loading

jinxing64 commented Mar 9, 2018 • edited Loading

jerryshao commented Mar 9, 2018

jinxing64 commented Mar 9, 2018 • edited Loading

jinxing64 commented Mar 9, 2018

SparkQA commented Mar 9, 2018

jerryshao Mar 12, 2018

Choose a reason for hiding this comment

jerryshao commented Mar 12, 2018

jinxing64 commented Mar 12, 2018 • edited Loading

SparkQA commented Mar 12, 2018

vanzin commented Mar 12, 2018

jinxing64 commented Mar 13, 2018

vanzin commented Apr 2, 2018

SparkQA commented Apr 2, 2018

vanzin commented Apr 4, 2018

jinxing64 commented Apr 8, 2018

jerryshao commented Mar 9, 2018 •

edited

Loading

jinxing64 commented Mar 9, 2018 •

edited

Loading

jinxing64 commented Mar 9, 2018 •

edited

Loading

jinxing64 commented Mar 12, 2018 •

edited

Loading