[SPARK-13916][SQL] Add a metric to WholeStageCodegen to measure duration. #11741

nongli · 2016-03-15T20:17:21Z

What changes were proposed in this pull request?

WholeStageCodegen naturally breaks the execution into pipelines that are easier to
measure duration. This is more granular than the task timings (a task can be multiple
pipelines) and is integrated with the web ui.

We currently report total time (across all tasks), min/mask/median to get a sense of how long each is taking.

How was this patch tested?

Manually tested looking at the web ui.

…ion. WholeStageCodegen naturally breaks the execution into pipelines that are easier to measure duration. This is more granular than the task timings (a task can be multiple pipelines) and is integrated with the web ui.

SparkQA · 2016-03-15T20:24:08Z

Test build #53220 has finished for PR 11741 at commit 76958d8.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-03-15T22:30:10Z

Test build #53222 has finished for PR 11741 at commit 28a85f6.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-03-17T22:03:47Z

Test build #53465 has finished for PR 11741 at commit 81c6a47.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2016-03-20T19:59:37Z

sql/core/src/main/scala/org/apache/spark/sql/execution/BufferedRowIterator.java

@@ -34,6 +34,7 @@
  protected LinkedList<InternalRow> currentRows = new LinkedList<>();
  // used when there is no column in output
  protected UnsafeRow unsafeRow = new UnsafeRow(0);
+  private long startTimeMs = System.currentTimeMillis();


we should use nanoTime. See https://github.com/databricks/scala-style-guide#misc_currentTimeMillis_vs_nanoTime

SparkQA · 2016-03-21T23:39:15Z

Test build #53716 has finished for PR 11741 at commit 1435d2a.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2016-03-21T23:46:16Z

LGTM

SparkQA · 2016-03-21T23:53:05Z

Test build #53717 has finished for PR 11741 at commit b02e189.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2016-03-21T23:56:24Z

Merging in master. Thanks.

…ion. ## What changes were proposed in this pull request? WholeStageCodegen naturally breaks the execution into pipelines that are easier to measure duration. This is more granular than the task timings (a task can be multiple pipelines) and is integrated with the web ui. We currently report total time (across all tasks), min/mask/median to get a sense of how long each is taking. ## How was this patch tested? Manually tested looking at the web ui. Author: Nong Li <[email protected]> Closes apache#11741 from nongli/spark-13916.

Import order.

28a85f6

Fix tests.

81c6a47

rxin reviewed Mar 20, 2016
View reviewed changes

nongli added 2 commits March 21, 2016 15:05

CR

1435d2a

Fix division.

b02e189

asfgit closed this in 5e86e92 Mar 21, 2016

nongli deleted the spark-13916 branch March 22, 2016 20:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-13916][SQL] Add a metric to WholeStageCodegen to measure duration. #11741

[SPARK-13916][SQL] Add a metric to WholeStageCodegen to measure duration. #11741

nongli commented Mar 15, 2016

SparkQA commented Mar 15, 2016

SparkQA commented Mar 15, 2016

SparkQA commented Mar 17, 2016

rxin Mar 20, 2016

SparkQA commented Mar 21, 2016

rxin commented Mar 21, 2016

SparkQA commented Mar 21, 2016

rxin commented Mar 21, 2016

[SPARK-13916][SQL] Add a metric to WholeStageCodegen to measure duration. #11741

[SPARK-13916][SQL] Add a metric to WholeStageCodegen to measure duration. #11741

Conversation

nongli commented Mar 15, 2016

What changes were proposed in this pull request?

How was this patch tested?

SparkQA commented Mar 15, 2016

SparkQA commented Mar 15, 2016

SparkQA commented Mar 17, 2016

rxin Mar 20, 2016

Choose a reason for hiding this comment

SparkQA commented Mar 21, 2016

rxin commented Mar 21, 2016

SparkQA commented Mar 21, 2016

rxin commented Mar 21, 2016