-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
808 Checkpoint/Restart collection #809
Conversation
Codecov Report
@@ Coverage Diff @@
## develop #809 +/- ##
===========================================
+ Coverage 80.07% 80.41% +0.34%
===========================================
Files 343 347 +4
Lines 10724 10933 +209
===========================================
+ Hits 8587 8792 +205
- Misses 2137 2141 +4
|
bad9807
to
a4021ea
Compare
This generally looks good, and pretty straightforward. Something we may want to consider in |
This does make an assumption of the array being dense in its index space. Looking ahead a little bit, will that be a problem if we wanted to use this in NimbleSM? Does that even care about preserving a collection instance across a hypothetical checkpoint? |
As we look toward large-scale runs, we're going to have a lot of objects. We may want to create subdirectories to hold limited numbers of objects each. I was going to suggest per-index-component, but that falls over if one of the index components is simply process rank, since even that gets too big. Instead, maybe we should hash the indices, and use a few bytes from the hash to set subdirectories? |
I can fix that easily by just not failing if the file is not found during restart---assume it never existed. |
That seems a bit risky - How about each process recording the objects it knows it's the home process for, in an 'indices.$node' file? |
a4021ea
to
88e3e02
Compare
88e3e02
to
94ea92c
Compare
@PhilMiller I've addressed all your concerns and written a test for it! |
Clones added
============
- tests/unit/collection/test_checkpoint.cc 2
See the complete overview on Codacy |
All lgtm |
@PhilMiller Good for beta.9? |
Yeah, good for the stable branch
…On Fri, Jun 5, 2020, 17:00 Jonathan Lifflander ***@***.***> wrote:
@PhilMiller <https://github.com/PhilMiller> Good for beta.9?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#809 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAA64AD6HLYK6J7BXCPRZXDRVFMGPANCNFSM4M5R2W3A>
.
|
Fixes #808
Has a test, working.