-
Notifications
You must be signed in to change notification settings - Fork 28.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-13916][SQL] Add a metric to WholeStageCodegen to measure duration. #11741
Conversation
…ion. WholeStageCodegen naturally breaks the execution into pipelines that are easier to measure duration. This is more granular than the task timings (a task can be multiple pipelines) and is integrated with the web ui.
Test build #53220 has finished for PR 11741 at commit
|
Test build #53222 has finished for PR 11741 at commit
|
Test build #53465 has finished for PR 11741 at commit
|
@@ -34,6 +34,7 @@ | |||
protected LinkedList<InternalRow> currentRows = new LinkedList<>(); | |||
// used when there is no column in output | |||
protected UnsafeRow unsafeRow = new UnsafeRow(0); | |||
private long startTimeMs = System.currentTimeMillis(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we should use nanoTime. See https://github.com/databricks/scala-style-guide#misc_currentTimeMillis_vs_nanoTime
Test build #53716 has finished for PR 11741 at commit
|
LGTM |
Test build #53717 has finished for PR 11741 at commit
|
Merging in master. Thanks. |
…ion. ## What changes were proposed in this pull request? WholeStageCodegen naturally breaks the execution into pipelines that are easier to measure duration. This is more granular than the task timings (a task can be multiple pipelines) and is integrated with the web ui. We currently report total time (across all tasks), min/mask/median to get a sense of how long each is taking. ## How was this patch tested? Manually tested looking at the web ui. Author: Nong Li <[email protected]> Closes apache#11741 from nongli/spark-13916.
What changes were proposed in this pull request?
WholeStageCodegen naturally breaks the execution into pipelines that are easier to
measure duration. This is more granular than the task timings (a task can be multiple
pipelines) and is integrated with the web ui.
We currently report total time (across all tasks), min/mask/median to get a sense of how long each is taking.
How was this patch tested?
Manually tested looking at the web ui.