Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-2952] Enable logging actor messages at DEBUG level #1870

Closed
wants to merge 2 commits into from

Conversation

rxin
Copy link
Contributor

@rxin rxin commented Aug 10, 2014

Example messages:

14/08/09 21:37:01 DEBUG BlockManagerMasterActor: [actor] received message RegisterBlockManager(BlockManagerId(0, rxin-mbp, 58092, 0),278302556,Actor[akka.tcp://spark@rxin-mbp:58088/user/BlockManagerActor1#-63596539]) from Actor[akka.tcp://spark@rxin-mbp:58088/temp/$c]

14/08/09 21:37:01 DEBUG BlockManagerMasterActor: [actor] handled message (0.279 ms) RegisterBlockManager(BlockManagerId(0, rxin-mbp, 58092, 0),278302556,Actor[akka.tcp://spark@rxin-mbp:58088/user/BlockManagerActor1#-63596539]) from Actor[akka.tcp://spark@rxin-mbp:58088/temp/$c]

cc @mengxr @tdas @pwendell

@SparkQA
Copy link

SparkQA commented Aug 10, 2014

QA tests have started for PR 1870. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18268/consoleFull

@SparkQA
Copy link

SparkQA commented Aug 10, 2014

QA results for PR 1870:
- This patch FAILED unit tests.
- This patch merges cleanly
- This patch adds the following public classes (experimental):
class ClientActor extends Actor with ActorLogReceive with Logging {
class DriverActor(sparkProperties: Seq[(String, String)]) extends Actor with ActorLogReceive {
* class BlockManagerMasterActor extends Actor with ActorLogReceive with Logging {

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18268/consoleFull

@SparkQA
Copy link

SparkQA commented Aug 10, 2014

QA tests have started for PR 1870. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18270/consoleFull

@SparkQA
Copy link

SparkQA commented Aug 10, 2014

QA results for PR 1870:
- This patch FAILED unit tests.
- This patch merges cleanly
- This patch adds the following public classes (experimental):
class ClientActor extends Actor with ActorLogReceive with Logging {
class DriverActor(sparkProperties: Seq[(String, String)]) extends Actor with ActorLogReceive {
* class BlockManagerMasterActor extends Actor with ActorLogReceive with Logging {

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18270/consoleFull

@rxin
Copy link
Contributor Author

rxin commented Aug 10, 2014

@andrewor14 is the SparkSubmit test flaky? I've seen it fail a few times already.

[info] - launch simple application with spark-submit *** FAILED ***
[info]   org.apache.spark.SparkException: Process List(./bin/spark-submit, --class, org.apache.spark.deploy.SimpleApplicationTest, --name, testApp, --master, local, file:/tmp/1407649354658-0/testJar-1407649354658.jar) exited with code 1
[info]   at org.apache.spark.util.Utils$.executeAndGetOutput(Utils.scala:795)
[info]   at org.apache.spark.deploy.SparkSubmitSuite.runSparkSubmit(SparkSubmitSuite.scala:311)
[info]   at org.apache.spark.deploy.SparkSubmitSuite$$anonfun$14.apply$mcV$sp(SparkSubmitSuite.scala:291)
[info]   at org.apache.spark.deploy.SparkSubmitSuite$$anonfun$14.apply(SparkSubmitSuite.scala:284)
[info]   at org.apache.spark.deploy.SparkSubmitSuite$$anonfun$14.apply(SparkSubmitSuite.scala:284)
[info]   at org.scalatest.Transformer$$anonfun$apply$1.apply(Transformer.scala:22)
[info]   at org.scalatest.Transformer$$anonfun$apply$1.apply(Transformer.scala:22)
[info]   at org.scalatest.OutcomeOf$class.outcomeOf(OutcomeOf.scala:85)
[info]   at org.scalatest.OutcomeOf$.outcomeOf(OutcomeOf.scala:104)
[info]   at org.scalatest.Transformer.apply(Transformer.scala:22)
[info]   ...
[info] - spark submit includes jars passed in through --jar *** FAILED ***
[info]   org.apache.spark.SparkException: Process List(./bin/spark-submit, --class, org.apache.spark.deploy.JarCreationTest, --name, testApp, --master, local-cluster[2,1,512], --jars, file:/tmp/1407649358023-0/testJar-1407649358532.jar,file:/tmp/1407649358533-0/testJar-1407649358635.jar, file:/tmp/1407649358022-0/testJar-1407649358023.jar) exited with code 1
[info]   at org.apache.spark.util.Utils$.executeAndGetOutput(Utils.scala:795)
[info]   at org.apache.spark.deploy.SparkSubmitSuite.runSparkSubmit(SparkSubmitSuite.scala:311)
[info]   at org.apache.spark.deploy.SparkSubmitSuite$$anonfun$15.apply$mcV$sp(SparkSubmitSuite.scala:305)
[info]   at org.apache.spark.deploy.SparkSubmitSuite$$anonfun$15.apply(SparkSubmitSuite.scala:294)
[info]   at org.apache.spark.deploy.SparkSubmitSuite$$anonfun$15.apply(SparkSubmitSuite.scala:294)
[info]   at org.scalatest.Transformer$$anonfun$apply$1.apply(Transformer.scala:22)
[info]   at org.scalatest.Transformer$$anonfun$apply$1.apply(Transformer.scala:22)
[info]   at org.scalatest.OutcomeOf$class.outcomeOf(OutcomeOf.scala:85)
[info]   at org.scalatest.OutcomeOf$.outcomeOf(OutcomeOf.scala:104)
[info]   at org.scalatest.Transformer.apply(Transformer.scala:22)
[info]   ...

@rxin
Copy link
Contributor Author

rxin commented Aug 10, 2014

Jenkins, retest this please.

asfgit pushed a commit that referenced this pull request Aug 10, 2014
This helps us log UpdateBlockInfo properly once #1870 is merged.

Author: Reynold Xin <[email protected]>

Closes #1872 from rxin/UpdateBlockInfo and squashes the following commits:

0cee1c2 [Reynold Xin] Turn UpdateBlockInfo into case class.

(cherry picked from commit 482c5af)
Signed-off-by: Reynold Xin <[email protected]>
@andrewor14
Copy link
Contributor

Yeah that and the DriverSuite. Not sure what the reason is yet, but I noticed that it started happening after #1777 went in...

@pwendell
Copy link
Contributor

LGTM - thanks Reynold!

@rxin
Copy link
Contributor Author

rxin commented Aug 11, 2014

Thanks. Merging in master & branch-1.1

asfgit pushed a commit that referenced this pull request Aug 11, 2014
Example messages:
```
14/08/09 21:37:01 DEBUG BlockManagerMasterActor: [actor] received message RegisterBlockManager(BlockManagerId(0, rxin-mbp, 58092, 0),278302556,Actor[akka.tcp://spark@rxin-mbp:58088/user/BlockManagerActor1#-63596539]) from Actor[akka.tcp://spark@rxin-mbp:58088/temp/$c]

14/08/09 21:37:01 DEBUG BlockManagerMasterActor: [actor] handled message (0.279 ms) RegisterBlockManager(BlockManagerId(0, rxin-mbp, 58092, 0),278302556,Actor[akka.tcp://spark@rxin-mbp:58088/user/BlockManagerActor1#-63596539]) from Actor[akka.tcp://spark@rxin-mbp:58088/temp/$c]
```

cc @mengxr @tdas @pwendell

Author: Reynold Xin <[email protected]>

Closes #1870 from rxin/actorLogging and squashes the following commits:

c531ee5 [Reynold Xin] Added license header for ActorLogReceive.
f6b1ebe [Reynold Xin] [SPARK-2952] Enable logging actor messages at DEBUG level

(cherry picked from commit 3733866)
Signed-off-by: Reynold Xin <[email protected]>
@asfgit asfgit closed this in 3733866 Aug 11, 2014
@mridulm
Copy link
Contributor

mridulm commented Aug 11, 2014

Just saw this as part of the close, sorry for the late comment.

Also, some of the INFO messages which are useful have now become DEBUG ? Makes it slightly harder to pinpoint an issue (ex: "Stopping BlockManagerMaster").
Spark logs are already hitting 60gig for some of our jobs in DEBUG mode !

Btw, if we are logging all messages, would it not be better to use TRACE instead of DEBUG ?

@rxin
Copy link
Contributor Author

rxin commented Aug 11, 2014

I thought about that in this PR actually. Since in log4j you can configure what you want, it seems pretty easy to just enable debug logging for a specific package.

@mridulm
Copy link
Contributor

mridulm commented Aug 11, 2014

Unfortunately, in most cases, we wont know what the issue is other than bug hunting in the logs.
So debug logging gets enabled for a wide swathe of packages.

@rxin rxin deleted the actorLogging branch August 11, 2014 23:12
@rxin
Copy link
Contributor Author

rxin commented Aug 12, 2014

Yea I'm fine turning that into trace. Even better, if we can configure what level to log those on, that'd be great. Do you want to submit a pull request?

xiliu82 pushed a commit to xiliu82/spark that referenced this pull request Sep 4, 2014
This helps us log UpdateBlockInfo properly once apache#1870 is merged.

Author: Reynold Xin <[email protected]>

Closes apache#1872 from rxin/UpdateBlockInfo and squashes the following commits:

0cee1c2 [Reynold Xin] Turn UpdateBlockInfo into case class.
xiliu82 pushed a commit to xiliu82/spark that referenced this pull request Sep 4, 2014
Example messages:
```
14/08/09 21:37:01 DEBUG BlockManagerMasterActor: [actor] received message RegisterBlockManager(BlockManagerId(0, rxin-mbp, 58092, 0),278302556,Actor[akka.tcp://spark@rxin-mbp:58088/user/BlockManagerActor1#-63596539]) from Actor[akka.tcp://spark@rxin-mbp:58088/temp/$c]

14/08/09 21:37:01 DEBUG BlockManagerMasterActor: [actor] handled message (0.279 ms) RegisterBlockManager(BlockManagerId(0, rxin-mbp, 58092, 0),278302556,Actor[akka.tcp://spark@rxin-mbp:58088/user/BlockManagerActor1#-63596539]) from Actor[akka.tcp://spark@rxin-mbp:58088/temp/$c]
```

cc @mengxr @tdas @pwendell

Author: Reynold Xin <[email protected]>

Closes apache#1870 from rxin/actorLogging and squashes the following commits:

c531ee5 [Reynold Xin] Added license header for ActorLogReceive.
f6b1ebe [Reynold Xin] [SPARK-2952] Enable logging actor messages at DEBUG level
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants