-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Archive the 4 seasons runs to HPSS #2131
Comments
Scratch and HPSS locations for each season: DYAMOND2:/gpfs/alpine/cli115/proj-shared/donahue/e3sm_scratch/ne1024pg2_ne1024pg2.F2010-SCREAMv1-DYAMOND2.20221220_dyamond2.24ab0b8bdbdebccd2fc717e55d3f8a8a16cebfff DYAMOND1:/gpfs/alpine/cli115/proj-shared/donahue/e3sm_scratch/ne1024pg2_ne1024pg2.F2010-SCREAMv1-DYAMOND1.20221216_dyamond1.081538b5e95a5ae54533e533176fd29c2a1b98ab Oct 1:/gpfs/alpine/cli115/proj-shared/donahue/e3sm_scratch/ne1024pg2_ne1024pg2.F2010-SCREAMv1.20221014_production_run.27604ccf3f1aaa88ea3413b774ef3817cad7343a/ Apr 1:/gpfs/alpine/cli115/proj-shared/donahue/e3sm_scratch/ne1024pg2_ne1024pg2.F2010-SCREAMv1.20221208_April_2013.ea484ef5161699ffd0192e23972d6858e5cd9be0 |
Issues with zstash: |
This ended up not really being a git issue, but rather a scratch pad for me to figure out what to do. I would like feedback on whether my strategy seems sound from @golaz , @AaronDonahue , and @crterai just to make sure we don't discover 3 months from now that we're missing some important datasets... |
This has been done and is documented in the 4-Seasons paper and on e3sm-docs |
My current strategy for archiving 4 seasons runs output isn't working. I've spent months trying to use zstash to upload our data to HPSS. In contrast, I was able to move all of our output data to NERSC (for each season) with ~10 min of work and ~3 hrs of wait time. It's time to give up on my half-done zstash archives and try something new. The critical problem seems to be tarring the files. Verifying file integrity is also problematic.
Plan:
zstash create --hpss /hpss/prod/cli115/world-shared/EAMxx/V1FourSeasons/DYAMOND2/case_scripts case_scripts/ --exclude="cmake_macros/*"
zstash create --hpss /hpss/prod/cli115/world-shared/EAMxx/V1FourSeasons/DYAMOND2/run run/ --exclude="*.nc,core.*"
zstash check
the 2 tar files I've created? We don't care a ton about these files (I think) so maybe not? But they're so small that checking should be easy?Note that to check whether zstash actually grabbed the data desired, you need to use
zstash ls -l --hpss="/hpss/prod/cli115/world-shared/EAMxx/V1FourSeasons/DYAMOND2/run"
. A bunch of variants I tried made it look like the archive was empty.Does this plan seem workable? Am I missing any directories or files that we care about? Does not tarring things seem evil?
Globus transfers are still in flight, but otherwise I think I've finished this task.
The text was updated successfully, but these errors were encountered: