-
Notifications
You must be signed in to change notification settings - Fork 251
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Hercules support #1733
Add Hercules support #1733
Conversation
Change Hercules queue to windfall.
…of default_vars.sh.
revert to develop + add hercules
fix formatting mess-up
…ufs-weather-model into feature/add_hercules
err and out files from the hercules_cpld_control_p8_err.txt |
based on efforts from @BijuThomas-NOAA and @christopherwharrop-noaa, it has been identified that |
some updates: non-coupled test case control_c48 compiles and runs successfully with the following changes:
|
FYI - I got a response from the Hercules team about installation of Ruby packages on other nodes. The team will take a look early next week. I don't anticipate any issues; I think they just forgot to follow up with the final step of propagating the packages to other nodes after I had verified my testing of Rocoto was successful. |
I got the same. Trying again now after Cameron's adjustment. |
@ulmononian Can you fix the BM_IC-20220207 sym-link also? |
done. |
@ulmononian Thanks for fixing the input directories. I was able to run two tests and they passed against the I'm assuming that was created on Hercules yesterday, not copied over from Orion. |
yes, as far as i am aware, those are the baselines created on hercules. |
Yes, they were created on Hercules. |
@ulmononian I realized when I ran the cpld_control ORTs manually I used intel, I tried it again with gnu and received the below error. However, after following your recommendation and removing line list(APPEND CDEPS_SHARE_DEFS "CPRGNU") in CDEPS-interface/CMakeLists.txt and CMEPS-interface/CMakeLists.txt then moving it immediately below if(CMAKE_Fortran_COMPILER_ID MATCHES "GNU"), then the tests pass. Can you update this PR to include the change? I think Fernando has had some regression tests fail with gnu on Hera as well, so this may resolve those too. |
move CPRGNU CPP flag to apply to all gnu versions (fixes hera failures)
apply CPRGNU cpp flag to all gnu versions to fix hera issue
Looks like we're mostly done. I have Gaea running right now, since it was down yesterday. Should be finished soon. |
Hey, @ulmononian . Can you update the PR template here .github/pull_request_template.md to add Hercules as a machine name? I can make the change as well if you're ok with it. |
@ulmononian Thank you! We're finished with testing. If you can please resolve the two conversations above, then we can begin the final review/merge process. |
done. thanks for testing, everyone! |
Description
MSU Hercules was recently made available for NOAA R&D use. Though it shares a file system with Orion, its system specs and software stack are significantly different. Therefore, to enable running the UFS-WM on Hercules, several of the RT configs/scripts need updated. A new
ufs_hercules.intel.lua
file is also required.UPDATE: rocoto has been installed under
contrib
; cron services are available on hercules-login-1.rocoto
is not yet available on Hercules (at least not as a system default module), though some software is still being installed. An inquiry has been sent as to whether there is a plan to installrocoto
in the near-term. Thus, for initial testing, the RTs can just be run directly (without a workflow manager) or withecflow
.The
spack-stack/1.4.0
unified-env
is already installed adjacent to the Orion spack-stack installations (/work/noaa/epic-ps/role-epic-ps/spack-stack/spack-stack-1.4.0-hercules/envs/unified-env-v2/install/
), so testing on Hercules can parallel spack-stack testing in PR #1707.Note that
windfall
is currently the only QOS available, so SBATCH settings will need to be adjusted once all QOS options are opened.Top of commit queue on: TBD
Input data additions/changes
Anticipated changes to regression tests:
There are not currently standardized RTs for Hercules to compare against.
Subcomponents involved:
Combined with PR's (If Applicable):
Commit Queue Checklist:
Linked PR's and Issues:
#1707
Will close #1732
Testing Day Checklist:
Testing Log (for CM's):