Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-3825] Log more detail when unrolling a block fails #2688

Closed
wants to merge 3 commits into from

Conversation

andrewor14
Copy link
Contributor

Before:

14/10/06 16:45:42 WARN CacheManager: Not enough space to cache partition rdd_0_2
in memory! Free memory is 481861527 bytes.

After:

14/10/07 11:08:24 WARN MemoryStore: Not enough space to cache rdd_2_0 in memory!
(computed 68.8 MB so far)
14/10/07 11:08:24 INFO MemoryStore: Memory use = 1088.0 B (blocks) + 445.1 MB
(scratch space shared across 8 thread(s)) = 445.1 MB. Storage limit = 459.5 MB.

@andrewor14
Copy link
Contributor Author

@pwendell @tdas

@SparkQA
Copy link

SparkQA commented Oct 7, 2014

QA tests have started for PR 2688 at commit 5638c49.

  • This patch merges cleanly.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/21364/Test FAILed.

@andrewor14
Copy link
Contributor Author

retest this please

@SparkQA
Copy link

SparkQA commented Oct 7, 2014

QA tests have started for PR 2688 at commit 5638c49.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Oct 7, 2014

QA tests have finished for PR 2688 at commit 5638c49.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/21365/Test PASSed.

@SparkQA
Copy link

SparkQA commented Oct 7, 2014

QA tests have finished for PR 2688 at commit 5638c49.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/21366/Test PASSed.

@pwendell
Copy link
Contributor

pwendell commented Oct 7, 2014

This is a little wordy. Throughout Spark the log messages are intentionally terse. This matters a lot when you are trying to read through hundreds of lines of logs. Something like below cuts down on words a lot but has just as much information.

4/10/06 21:40:36 WARN MemoryStore: Gave up caching rdd_0_2 after 68.8 MB!
4/10/06 21:40:36 INFO MemoryStore: Memory use: 1088.0 B (blocks)
+ 445.1 MB (scratch space for 8 thread(s)) = XX. Storage limit is 459.5 MB.

@andrewor14
Copy link
Contributor Author

How about:

14/10/07 10:55:13 WARN MemoryStore: Not enough space to cache rdd_0_2 in memory!
(gave up after unrolling 38.7 MB)
14/10/07 10:55:13 INFO MemoryStore: Memory use = 1088.0 B (blocks) + 445.1 MB
(unrolling space shared across 8 thread(s)) = 445.1 MB. Storage limit = 459.5 MB.

@pwendell
Copy link
Contributor

pwendell commented Oct 7, 2014

I'd prefer to use "unrolling" less but don't really mind. LGTM with whatever you want.

@andrewor14
Copy link
Contributor Author

Ok, I updated it. Refer to the PR description for the latest state.

@SparkQA
Copy link

SparkQA commented Oct 7, 2014

QA tests have started for PR 2688 at commit 28e33d6.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Oct 7, 2014

QA tests have finished for PR 2688 at commit 28e33d6.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/21389/Test PASSed.

@andrewor14
Copy link
Contributor Author

Got a verbal LGTM from @pwendell. This is going into master and 1.1!

@asfgit asfgit closed this in 553737c Oct 7, 2014
asfgit pushed a commit that referenced this pull request Oct 7, 2014
Before:
```
14/10/06 16:45:42 WARN CacheManager: Not enough space to cache partition rdd_0_2
in memory! Free memory is 481861527 bytes.
```
After:
```
14/10/07 11:08:24 WARN MemoryStore: Not enough space to cache rdd_2_0 in memory!
(computed 68.8 MB so far)
14/10/07 11:08:24 INFO MemoryStore: Memory use = 1088.0 B (blocks) + 445.1 MB
(scratch space shared across 8 thread(s)) = 445.1 MB. Storage limit = 459.5 MB.
```

Author: Andrew Or <[email protected]>

Closes #2688 from andrewor14/cache-log-message and squashes the following commits:

28e33d6 [Andrew Or] Shy away from "unrolling"
5638c49 [Andrew Or] Grammar
39a0c28 [Andrew Or] Log more detail when unrolling a block fails

(cherry picked from commit 553737c)
Signed-off-by: Andrew Or <[email protected]>
@andrewor14 andrewor14 deleted the cache-log-message branch October 7, 2014 20:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants