-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
make crs bigger for new pars file for jump unit tests #1356
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #1356 +/- ##
=======================================
Coverage 78.57% 78.57%
=======================================
Files 117 117
Lines 7861 7861
=======================================
Hits 6177 6177
Misses 1684 1684
*This pull request uses carry forward flags. Click here to find out more. ☔ View full report in Codecov by Sentry. |
Do we think that we if we just add CRDS_CONTEXT to the list of parameters that this will go away? I think we don't want every time a new context comes along for us to have to update all of the regression test files? |
Do you mean define For the latter, It does appear that jwst ignores the crds context for fitsdiff comparisons of generated to truth files: |
I have no clue at the moment why one regtest failed with the following 🤷
|
This is the one where I need the second okify. The change to tweakwcs defaults led a different star to be included in the match, making for directly different fit results. Sorry, I can okay that tomorrow. |
I okified the wcs_fit_results issue. I was suggesting adding CRDS_CONTEXT=roman_0061.pmap, as in this run: https://github.com/spacetelescope/RegressionTests/actions/runs/10319437845 I am just skeptical in general that for the regression tests we can use whatever context is the latest; it seems to me like that guarantees that regtests will fail if they use the new context, and if they don't use the new context then having the latest context isn't buying us anything? |
Thanks! I reran the regtests and they all pass with this PR:
What do you think is a good forum for discussing pinning the
Once things have been established (reference file formats are finalized, etc) I would expect that a new context should only produce expected differences in some data arrays (so if a new dark is added and put in use the regtests that compare the output from the dark currant step will fail until the new results are oked). I believe these are expected differences that benefit users since presumably the new results are more accurate. This does make the regression tests serve at least 2 purposes (and can lead to confusion):
And the second one sort of randomly pops up whenever a new context happens to be published. Is there any other place where a new context is tested (by running the romancal regtests) before it's made "live"? As @nden mentioned this morning, some of this will change when there is a crds ops server for roman but that sounds like it's a ways off. |
roman.meta.ref_file.crds.context_used
to list of ignored paths for compare_asdf
I agree with you that the "2 purposes" are the problem. It's good to know that we are sensitive to changes to reference files, but in practice, at least I'm pretty removed from those updates and I'm not sure I want to be sensitive to every time a new reference file goes in. Were we to move in that direction we'd want to coordinate better with CRDS about the deliveries. I worry that at present it's going to be more like "tests are failing again, let's just update them, maybe CRDS changed," which would not be good. I could imagine another set of tests (like the Mac tests, or something) where we use the latest context and observe failures but which doesn't drive development. But those would have to live in a separate artifactory directory if we ever wanted to okify them. @ddavis-stsci , do you have a preference here? I think my sense is that as a temporary expedient let's go ahead and ignore context_used as you do here, but longer term I think I would lean towards our old model (fixing a context for the regression tests) and then changing from that later once we work out a new approach. We can take this up again at next week's romancal tag up. |
I would not ignore the CRDS context. We tie each release to a specific context and should test against that. Do the regression tests have a variety of purposes - yes. Should you okify tests because you "think" you have a crds change, No. If the regression tests fail we need to understand why. If the code or the reference files have changed and the new results are more accurate the running okify is a convenient way to update the files. If we do not understand why the results do not match the truth files the truth files should not be updated just the make them match. |
Okay, I take your message as saying that we should find a way to force the regression tests to use a specific context. Right now they are using the latest context, which brings in context changes that lead to regression test failures. I don't know how to force GA to do that in the short term, so I'd propose using this PR of Brett's to ignore the context in the short term, while planning to revert that change after we get the GA to always use a particular context. |
I'd suggest that we keep the crs changes and punt on the crds context issue. I think the crds context check needs more discussion. |
roman.meta.ref_file.crds.context_used
to list of ignored paths for compare_asdf
I dropped the crds context ignore. The regression tests will continue to fail due to the new crds context. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Make CRs bigger than the threshold used for the new pars files.
Fixes the unit test failures seen in:
#1355
due to the new pars file:
https://roman-crds.stsci.edu/browse/roman_wfi_pars-jumpstep_0001.asdf
Regression tests: https://github.com/spacetelescope/RegressionTests/actions/runs/10308257135
show only expected differences due to the new crds context:
Checklist
CHANGES.rst
under the corresponding subsection