-
Notifications
You must be signed in to change notification settings - Fork 28.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-6963][CORE]Flaky test: o.a.s.ContextCleanerSuite automatically cleanup checkpoint #5548
Conversation
Test build #30453 has finished for PR 5548 at commit
|
I'd like to start pushing back on PRs without explanation, here or in the JIRA. Please add a short explanation of why this is the right change and what the problem is. Otherwise I think we need to start rejecting these. |
@@ -245,7 +245,7 @@ class ContextCleanerSuite extends ContextCleanerSuiteBase { | |||
assert(fs.exists(RDDCheckpointData.rddCheckpointDataPath(sc, rddId).get)) | |||
|
|||
// Test that GC causes checkpoint data cleanup after dereferencing the RDD | |||
postGCTester = new CleanerTester(sc, Seq(rddId), Nil, Nil) | |||
postGCTester = new CleanerTester(sc, Seq(rddId), Nil, Nil, Seq(rddId)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@srowen This is just a test code bug. new CleanerTester(sc, Seq(rddId), Nil, Nil)
This code is only to ensure that the RDD is cleaned, but checkpoint. rdd and checkpoint almost simultaneously be cleaned, But there are exceptions, depending on the GC. The checkpointCleaned
ensure the checkpoint is cleaned.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How does this fix the assertion that failed? you add a new set of IDs that must now be counted down by a new callback, and you assert it ends up empty, which is good. But the assertion that failed was earlier:
assert(fs.exists(RDDCheckpointData.rddCheckpointDataPath(sc, rddId).get))
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The tips are more confusing.
The jira display the error in: ContextCleanerSuite.scala#L252.
The code in #L252 is assert(!fs.exists(RDDCheckpointData.rddCheckpointDataPath(sc, rddId).get))
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah right, it is the following one. Still, how does this affect whether the checkpoint files exist?
Test build #30476 has finished for PR 5548 at commit
|
cc @andrewor14