[SPARK-23640][CORE] Fix hadoop config may override spark config #20785

wangyum · 2018-03-09T11:29:05Z

What changes were proposed in this pull request?

It may be get spark.shuffle.service.port from

spark/core/src/main/scala/org/apache/spark/deploy/SparkHadoopUtil.scala

Line 459 in 9745ec3

val hadoopConf = new Configuration()

Therefore, the client configuration spark.shuffle.service.port does not working unless the configuration is spark.hadoop.spark.shuffle.service.port.

This configuration is not working:

bin/spark-sql --master yarn --conf spark.shuffle.service.port=7338

This configuration works:

bin/spark-sql --master yarn --conf spark.hadoop.spark.shuffle.service.port=7338

This PR fix this issue.

How was this patch tested?

It's difficult to carry out unit testing. But I've tested it manually.

SparkQA · 2018-03-09T14:48:21Z

Test build #88123 has finished for PR 20785 at commit 9745ec3.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

wangyum · 2018-03-09T15:15:06Z

retest this please

vanzin · 2018-03-09T18:52:08Z

The fix is not correct. If you want to change the order of precedence of these configs, you need to change Utils.getSparkOrYarnConfig.

Just to double check, this has always worked like this, right? I checked both 2.2 and 2.3 and both choose the YARN configuration over the Spark configuration when it's set.

SparkQA · 2018-03-09T19:17:08Z

Test build #88130 has finished for PR 20785 at commit 9745ec3.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-03-10T17:42:38Z

Test build #88152 has finished for PR 20785 at commit 0034a58.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

wangyum · 2018-03-10T23:22:28Z

retest this please

SparkQA · 2018-03-11T03:21:24Z

Test build #88155 has finished for PR 20785 at commit 0034a58.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

jerryshao · 2018-03-12T07:46:06Z

I think if you're running on yarn, semantically spark.shuffle.service.port is a yarn configuration specified in yarn-site.xml. So it seems correct from semantic point.

wangyum · 2018-03-12T10:49:59Z

You are right.
In fact, our cluster has two shuffle services, one for production and one for development. We configure spark.shuffle.service.port to decide which shuffle service to use.

vanzin · 2018-03-16T00:10:26Z

core/src/main/scala/org/apache/spark/util/Utils.scala

@@ -2434,7 +2434,8 @@ private[spark] object Utils extends Logging {
   */
  def getSparkOrYarnConfig(conf: SparkConf, key: String, default: String): String = {
    val sparkValue = conf.get(key, default)
-    if (conf.get(SparkLauncher.SPARK_MASTER, null) == "yarn") {
+    if (conf.get(SparkLauncher.SPARK_MASTER, null) == "yarn"


No.

The logic you want here is the equivalent of:

if conf.contains(key) get spark conf elif is_running_on_yarn() get conf from yarn else return default

Assuming that --conf spark.shuffle.service.port = 7338 is configured, 7338 is displayed on the tab of the environment, but 7337 is actually used.
So my idea is get value from SparkConf if key starting with spark. except for spark.hadoop..

YarnConfiguration can only configure one spark.shuffle.service.port value.
We can gradually upgrade the shuffle service if get spark.shuffle.service.port value from SparkConf because we can set different values for different applications.

I'm not sure I follow what you're saying, but let me explain how the configuration is expected to work.

"spark." options are set in "SparkConf". "spark.hadoop.*" options, on top of those, should also be reflected in any Hadoop Configuration objects that are created.

So you should never need to directly reference "spark.hadoop." properties in Spark code. They are not meant to be used by Spark, they are meant to be Hadoop configs. That's why I'm saying your code should not be doing what it is doing.

From what I understand of what you're trying to do, you want "spark.shuffle.service.port" to have precedence over the YARN configuration. For that, you just do what I suggested above. Check whether it's set in the Spark configuration before you even look at any Hadoop configuration.

The current order of precedence should be:

spark.hadoop.spark.shuffle.service.port (since it overrides Hadoop config)

hadoop config (spark.shuffle.service.port set in xml files)

spark.shuffle.service.port

You're proposing moving the lowest one to the top. That's a simple change. If you're trying to also fix something else, then it means there's a problem in another place.

SparkQA · 2018-03-17T04:16:46Z

Test build #88331 has finished for PR 20785 at commit 06bb6f8.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

wangyum · 2018-03-29T07:39:19Z

Ping @vanzin

vanzin · 2018-03-29T17:54:19Z

core/src/main/scala/org/apache/spark/util/Utils.scala

-    val sparkValue = conf.get(key, default)
-    if (conf.get(SparkLauncher.SPARK_MASTER, null) == "yarn") {
-      new YarnConfiguration(SparkHadoopUtil.get.newConfiguration(conf)).get(key, sparkValue)
+    if (conf.contains(key)) {


The scaladoc above is now wrong, since it still refers to the old order of precedence. Otherwise looks ok.

SparkQA · 2018-03-30T16:35:55Z

Test build #88750 has finished for PR 20785 at commit a1eb874.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

vanzin · 2018-03-30T21:08:50Z

Merging to master.

## What changes were proposed in this pull request? It may be get `spark.shuffle.service.port` from https://github.com/apache/spark/blob/9745ec3a61c99be59ef6a9d5eebd445e8af65b7a/core/src/main/scala/org/apache/spark/deploy/SparkHadoopUtil.scala#L459 Therefore, the client configuration `spark.shuffle.service.port` does not working unless the configuration is `spark.hadoop.spark.shuffle.service.port`. - This configuration is not working: ``` bin/spark-sql --master yarn --conf spark.shuffle.service.port=7338 ``` - This configuration works: ``` bin/spark-sql --master yarn --conf spark.hadoop.spark.shuffle.service.port=7338 ``` This PR fix this issue. ## How was this patch tested? It's difficult to carry out unit testing. But I've tested it manually. Author: Yuming Wang <[email protected]> Closes apache#20785 from wangyum/SPARK-23640.

Fix hadoop config may override spark config

9745ec3

Change Utils.getSparkOrYarnConfig

0034a58

vanzin reviewed Mar 16, 2018

View reviewed changes

Change getSparkOrYarnConfig value order

06bb6f8

vanzin reviewed Mar 29, 2018

View reviewed changes

Update scaladoc

a1eb874

asfgit closed this in ae91720 Mar 30, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-23640][CORE] Fix hadoop config may override spark config #20785

[SPARK-23640][CORE] Fix hadoop config may override spark config #20785

wangyum commented Mar 9, 2018 •

edited

Loading

SparkQA commented Mar 9, 2018

wangyum commented Mar 9, 2018

vanzin commented Mar 9, 2018

SparkQA commented Mar 9, 2018

SparkQA commented Mar 10, 2018

wangyum commented Mar 10, 2018

SparkQA commented Mar 11, 2018

jerryshao commented Mar 12, 2018

wangyum commented Mar 12, 2018

vanzin Mar 16, 2018

wangyum Mar 16, 2018

wangyum Mar 16, 2018

vanzin Mar 16, 2018

SparkQA commented Mar 17, 2018

wangyum commented Mar 29, 2018

vanzin Mar 29, 2018

SparkQA commented Mar 30, 2018

vanzin commented Mar 30, 2018

[SPARK-23640][CORE] Fix hadoop config may override spark config #20785

[SPARK-23640][CORE] Fix hadoop config may override spark config #20785

Conversation

wangyum commented Mar 9, 2018 • edited Loading

What changes were proposed in this pull request?

How was this patch tested?

SparkQA commented Mar 9, 2018

wangyum commented Mar 9, 2018

vanzin commented Mar 9, 2018

SparkQA commented Mar 9, 2018

SparkQA commented Mar 10, 2018

wangyum commented Mar 10, 2018

SparkQA commented Mar 11, 2018

jerryshao commented Mar 12, 2018

wangyum commented Mar 12, 2018

vanzin Mar 16, 2018

Choose a reason for hiding this comment

wangyum Mar 16, 2018

Choose a reason for hiding this comment

wangyum Mar 16, 2018

Choose a reason for hiding this comment

vanzin Mar 16, 2018

Choose a reason for hiding this comment

SparkQA commented Mar 17, 2018

wangyum commented Mar 29, 2018

vanzin Mar 29, 2018

Choose a reason for hiding this comment

SparkQA commented Mar 30, 2018

vanzin commented Mar 30, 2018

wangyum commented Mar 9, 2018 •

edited

Loading