[SPARK-2524] missing document about spark.deploy.retainedDrivers #1443

lianhuiwang · 2014-07-16T14:42:58Z

https://issues.apache.org/jira/browse/SPARK-2524
The configuration on spark.deploy.retainedDrivers is undocumented but actually used
https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/deploy/master/Master.scala#L60

The configuration on spark.deploy.retainedDrivers is undocumented but actually used https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/deploy/master/Master.scala#L60

SparkQA · 2014-07-16T14:47:48Z

QA tests have started for PR 1443. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16730/consoleFull

SparkQA · 2014-07-16T16:27:24Z

QA results for PR 1443:
- This patch FAILED unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16730/consoleFull

pwendell · 2014-07-16T21:14:59Z

docs/spark-standalone.md

+  <td><code>spark.deploy.retainedApplications</code></td>
+  <td>200</td>
+  <td>
+    The number of completedApps to retain. If this cap is exceeded, then the oldest completedApps will be removed. <br/>


This language is specific to internal naming. It might be better to say something like this:

The maximum number of completed applications to display. Older applications will be dropped from the UI to maintain this limit.

And likewise below.

lianhuiwang · 2014-07-17T01:37:31Z

@pwendell thanks. i address your comments.

SparkQA · 2014-07-17T01:37:50Z

QA tests have started for PR 1443. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16757/consoleFull

SparkQA · 2014-07-17T03:15:07Z

QA results for PR 1443:
- This patch PASSES unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16757/consoleFull

pwendell · 2014-07-20T03:46:47Z

Thanks - I can merge this.

https://issues.apache.org/jira/browse/SPARK-2524 The configuration on spark.deploy.retainedDrivers is undocumented but actually used https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/deploy/master/Master.scala#L60 Author: lianhuiwang <[email protected]> Author: Wang Lianhui <[email protected]> Author: unknown <[email protected]> Closes #1443 from lianhuiwang/SPARK-2524 and squashes the following commits: 64660fd [Wang Lianhui] address pwendell's comments 5f6bbb7 [Wang Lianhui] missing document about spark.deploy.retainedDrivers 44a3f50 [unknown] Merge remote-tracking branch 'upstream/master' eacf933 [lianhuiwang] Merge remote-tracking branch 'upstream/master' 8bbfe76 [lianhuiwang] Merge remote-tracking branch 'upstream/master' 480ce94 [lianhuiwang] address aarondav comments f2b5970 [lianhuiwang] bugfix worker DriverStateChanged state should match DriverState.FAILED (cherry picked from commit 4da01e3) Signed-off-by: Patrick Wendell <[email protected]>

https://issues.apache.org/jira/browse/SPARK-2524 The configuration on spark.deploy.retainedDrivers is undocumented but actually used https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/deploy/master/Master.scala#L60 Author: lianhuiwang <[email protected]> Author: Wang Lianhui <[email protected]> Author: unknown <[email protected]> Closes apache#1443 from lianhuiwang/SPARK-2524 and squashes the following commits: 64660fd [Wang Lianhui] address pwendell's comments 5f6bbb7 [Wang Lianhui] missing document about spark.deploy.retainedDrivers 44a3f50 [unknown] Merge remote-tracking branch 'upstream/master' eacf933 [lianhuiwang] Merge remote-tracking branch 'upstream/master' 8bbfe76 [lianhuiwang] Merge remote-tracking branch 'upstream/master' 480ce94 [lianhuiwang] address aarondav comments f2b5970 [lianhuiwang] bugfix worker DriverStateChanged state should match DriverState.FAILED

…id unnecessary sort operations ### What changes were proposed in this pull request? This pull request tries to normalize the SortOrder properly to prevent unnecessary sort operators. Currently the sameOrderExpressions are not normalized as part of AliasAwareOutputOrdering. Example: consider this join of three tables: """ |SELECT t2id, t3.id as t3id |FROM ( | SELECT t1.id as t1id, t2.id as t2id | FROM t1, t2 | WHERE t1.id = t2.id |) t12, t3 |WHERE t1id = t3.id """. The plan for this looks like: *(8) Project [t2id#1059L, id#1004L AS t3id#1060L] +- *(8) SortMergeJoin [t2id#1059L], [id#1004L], Inner :- *(5) Sort [t2id#1059L ASC NULLS FIRST ], false, 0 <----------------------------- : +- *(5) Project [id#1000L AS t2id#1059L] : +- *(5) SortMergeJoin [id#996L], [id#1000L], Inner : :- *(2) Sort [id#996L ASC NULLS FIRST ], false, 0 : : +- Exchange hashpartitioning(id#996L, 5), true, [id=#1426] : : +- *(1) Range (0, 10, step=1, splits=2) : +- *(4) Sort [id#1000L ASC NULLS FIRST ], false, 0 : +- Exchange hashpartitioning(id#1000L, 5), true, [id=#1432] : +- *(3) Range (0, 20, step=1, splits=2) +- *(7) Sort [id#1004L ASC NULLS FIRST ], false, 0 +- Exchange hashpartitioning(id#1004L, 5), true, [id=#1443] +- *(6) Range (0, 30, step=1, splits=2) In this plan, the marked sort node could have been avoided as the data is already sorted on "t2.id" by the lower SortMergeJoin. ### Why are the changes needed? To remove unneeded Sort operators. ### Does this PR introduce any user-facing change? No ### How was this patch tested? New UT added. Closes #30302 from prakharjain09/SPARK-33400-sortorder. Authored-by: Prakhar Jain <[email protected]> Signed-off-by: Wenchen Fan <[email protected]>

lianhuiwang and others added 6 commits May 23, 2014 22:02

bugfix worker DriverStateChanged state should match DriverState.FAILED

f2b5970

address aarondav comments

480ce94

Merge remote-tracking branch 'upstream/master'

8bbfe76

Merge remote-tracking branch 'upstream/master'

eacf933

Merge remote-tracking branch 'upstream/master'

44a3f50

missing document about spark.deploy.retainedDrivers

5f6bbb7

The configuration on spark.deploy.retainedDrivers is undocumented but actually used https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/deploy/master/Master.scala#L60

pwendell reviewed Jul 16, 2014
View reviewed changes

address pwendell's comments

64660fd

asfgit closed this in 4da01e3 Jul 20, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-2524] missing document about spark.deploy.retainedDrivers #1443

[SPARK-2524] missing document about spark.deploy.retainedDrivers #1443

lianhuiwang commented Jul 16, 2014

SparkQA commented Jul 16, 2014

SparkQA commented Jul 16, 2014

pwendell Jul 16, 2014

lianhuiwang commented Jul 17, 2014

SparkQA commented Jul 17, 2014

SparkQA commented Jul 17, 2014

pwendell commented Jul 20, 2014

[SPARK-2524] missing document about spark.deploy.retainedDrivers #1443

[SPARK-2524] missing document about spark.deploy.retainedDrivers #1443

Conversation

lianhuiwang commented Jul 16, 2014

SparkQA commented Jul 16, 2014

SparkQA commented Jul 16, 2014

pwendell Jul 16, 2014

Choose a reason for hiding this comment

lianhuiwang commented Jul 17, 2014

SparkQA commented Jul 17, 2014

SparkQA commented Jul 17, 2014

pwendell commented Jul 20, 2014