-
-
Notifications
You must be signed in to change notification settings - Fork 30.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gh-117657: Enable TSAN check on macOS with free-threads build #120502
base: main
Are you sure you want to change the base?
Conversation
@@ -500,6 +500,7 @@ jobs: | |||
options: ./configure --config-cache --disable-gil --with-thread-sanitizer --with-pydebug | |||
suppressions_path: Tools/tsan/suppressions_free_threading.txt | |||
tsan_logs_artifact_name: tsan-logs-free-threading | |||
os-matrix: '["ubuntu-22.04", "macos-14"]' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
To avoid queuing up too many jobs for the limited number of macOS builders, we'd want to use macos-14
for forks, but Cirrus for upstream builds.
See elsewhere in this file for examples, also needs changes in the reusable workflow.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm ambivalent about adding another TSan job. I'm not sure it's currently worth the extra CI resources and #120500 doesn't really seem like it uncovered a new race -- it might be due to the different version of clang or just a slightly different call stack that wasn't covered by the suppressions file.
if support.check_sanitizer(thread=True) and sys.platform == 'darwin': | ||
# See: gh-120696 | ||
raise unittest.SkipTest("this test will hang on macOS with TSAN") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If this is hanging in faulthandler, that probably means the test is crashing.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Faulthandler hanging is definitely not great, but even if faulthandler is fixed I expect there will still be a problem with this test.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These tests are just test the mallocator without the GIL, and intented to be crashed, so I think they are ok for now?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh, that makes sense -- I missed that.
I have another idea. We can just modified the code to temporarily skip these two problematic tests and run the TSAN checks on macOS locally. What do you think? |
Yeah, I think we should be able to successfully run TSan tests locally on macOS. There still seems to be some underlying bugs that we should fix. As you pointed out above, the test (in a subprocess) is supposed to crash because it calls
(I found it easier to run that directly under gdb or lldb than running the test case)
|
Hi @colesbury, I opened #120900 just to skip these two tests on macOS. |
There are some macOS only races, for example #120500, so I think maybe we should enable the test for macOS on GHA.
Another thing is that two TSAN tests will hang: #120696, so we should skip them with TSAN on macOS.