-
-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
JDK22: zLinux: Extended.OpenJDK regularly times out and aborts #852
Comments
Some data for comparison:
Ok, so maybe it's not that this platform+version are particularly slow. Hmm. @Haroon-Khel - What do you think. Should we just increase timeout to 15 hours and move on? |
Id recommend 20hrs |
iirc there was another issue in the infra repo documenting extended openjdk timeouts. I cant find it at the moment. But this information should be added to that issue before closing this issue |
We've got adoptium/infrastructure#2662 regarding general machine-specific issues. I'm not sure what that table comparing differnet platforms is showing us, but it doesn't seem to demonstrate the problem - was that data not from the slow runs we initially documented? Was there a reason for choosing jvm_compiler_? as the comparison? @smlambert Was there a reason to transfer this from the tests to pipelines repository? I didn't think anything in here could have an influence over this, or does this control the timeout values somewhere for the extended.openjdk jobs? The answer to that will potential answer my slack question on how configurable the default TIME_LIMIT options for the jobs are. |
Transferred here because it is the right place to set TIME_LIMIT parameter so that it doesn't become 'unset' when/if test jobs are regenerated. |
My last comment on here seems to have got lost. I thoght the jobs were created via Test_Job_Auto_Gen which runs aqatests -> testJobTemplate. Where could we make a change in ci-jenkins-pipelines that would affect the |
Since we automated the generation of new test jobs out of this pipeline code base when they do not exist (here), it is good to set TIME_LIMIT from this code base, so JDK23+ test jobs that are generated also get the new value for TIME_LIMIT. While there are several places where it could be done, a good spot to set this would be getCommonTestJobParams Adding a few lines of code after L187:
Alternatively, although a larger endeavour would be to add a mechanism to hold testArgs in the configuration files (much like there are buildArgs), but I would rather not go that route for 1 use case of 1 parameter. If later we see the need to be modifying many other test parameters, we could look at such an approach. |
Noting that this should also be applied for riscv64 where sanity.openjdk can take around 17 hours on some of the systems. |
Ok, I've put this together to resolve this issue as advised, and to cover Stewart's riscv case. Here's a test run for zlinux. Note: Both are queued, and will not exist until the previous job finishes. Update: |
@adamfarley It looks like you've done the PR for s390x but you've said it'll cover the riscv use case too. Have I missed something (quite possible - I'm reading this on my phone!) |
@sxa - It should cover the riscv case as well. if-then-elseif. |
Yeah LGTM - that didn't seem to show on my phone yesterday :-) |
Ok, the change is in and proven to work, plus a timeout extension PR here. |
Summary
Tests run very slow on this platform, and abort on most machines due to timeout.
Details
Apparently this is par for the course for this test set. Past timeouts occurred on:
And the sole pass happened on:
Next steps
I'm not seeing signs of a hang, just slow performance. Perhaps we could extend the timeout in this case?
The text was updated successfully, but these errors were encountered: