-
-
Notifications
You must be signed in to change notification settings - Fork 317
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
sanity.functional cmdLineTester_pltest_0 j9vmem_test_numa (docker?) test failure #2143
Comments
The API code is https://github.ibm.com/runtimes/openj9-omr/blob/ibm_sdk/port/linux/omrvmem.c#L2081 The API code reads from /sys/devices/system/node, looking for The API sets the state of each node based on the global memory policy and if the node has memory. The test code is https://github.com/eclipse/openj9/blob/master/runtime/tests/port/j9vmemTest.c#L1781-L1810 It looks for nodes which have the PREFERRED or ALLOWED policy. If there are none, the test fails. |
Is this intermittent? I see in recent builds running on testc-packet-fedora33-amd-2 or non-dockers passed. https://ci.adoptopenjdk.net/job/Test_openjdk8_j9_sanity.functional_x86-64_linux_xl/73/ |
but I do not know if its intermittent on that static docker instance. |
I didn't see there is machine docker-packet-ubuntu2004-x64-1f1? I thought all testc* machine is docker instance? |
@sxa - you are the most likely to know the answer to #2143 (comment) |
I moved it to another host - replace x64 with amd in the name and you have effectively the same system |
I've created a pltest binary containing debug and sent it to Stewart to try on the docker instance. |
On naming - yes it's a docker container. I've been experimenting with formats so there are several that can indicate it's docker adoptium/infrastructure#1809 (comment) ... Open to suggestions/comments on the preferred format that I'll cover them all too later 😊 |
The test is failing because there are 2 or more nodes detected (in Note the behavior is explicit in the VM code. It sets a variable |
OK The two AMD EPYC systems I have had slightly different CPUs. NUMA isn't available on the EPYC 7402P but is (and so the test fails) on the EPYC 7401P (Memo to self: Here is the strace from Peter's standalone It includes:
It can be resolved by starting the container with |
This can also be resolved with Reference: https://docs.docker.com/config/containers/resource_constraints/ |
I think I only have one NUMA-capable host system (It's a 2.2GHz Intel Xeon Gold 5120). It currently has six docker images on it. I've confirmed (By starting another instance of the Fedora33 docker image with I'd suggest there's still a question over whether this is the right thing to do - in the |
Although there were more containers deployed on the machine, only three have been active in jenkins, therefore I have restarted and tested those: #87 #88 #89 so the outstanding discussion is over whether we want to leave the machines with the |
Subtest j9vmem_test_numa within the test target cmdLineTester_pltest_0 fails with:
jdk version under test:
Failing test machine: docker-packet-ubuntu2004-x64-1f1.
Test passed on test-ibmcloud-rhel6-x64-1 - maybe related to running on docker? (https://ci.adoptopenjdk.net/job/Test_openjdk8_j9_sanity.functional_x86-64_linux/63/)
Link to failing job: https://ci.adoptopenjdk.net/job/Test_openjdk8_j9_sanity.functional_x86-64_linux_xl/68/
The text was updated successfully, but these errors were encountered: