Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stacktrace handler revisited #11329

Merged
merged 3 commits into from
Sep 18, 2015

Conversation

bbockelm
Copy link
Contributor

This is the second version of the improved stacktrace. The first version had improper handling around forked processes (the helper thread would communicate with the wrong process).

  • Ran unit tests in FWCore/Modules successfully (these failed in the first round).
  • Verified stack traces are printed appropriately on SIGSEGV.
  • Verified detached thread causes no problem for the non-failure case.
  • Managed to send a SIGSEGV while one thread was holding the jemalloc lock; process did not deadlock.

We've been unable to figure out why GDB sometimes does not print
out the traceback of the thread invoking clone().

Accordingly, this patch launches a helper thread at the beginning
of the process; the signal handler will notify helper thread to
print out a traceback and wait for completion.

(cherry picked from commit 99b873e)
(cherry picked from commit 240c5b0)
Make sure that the child thread only talks to pipes in the *current*
process; before, the forked child process would unexpectedly talk
to the child thread of the parent process, triggering an exception.
@cmsbuild
Copy link
Contributor

A new Pull Request was created by @bbockelm (Brian Bockelman) for CMSSW_7_6_X.

Stacktrace handler revisited

It involves the following packages:

FWCore/Services

@cmsbuild, @smuzaffar, @Dr15Jones can you please review it and eventually sign? Thanks.
@Martin-Grunewald, @wddgit, @wmtan this is something you requested to watch as well.
You can sign-off by replying to this message having '+1' in the first line of your reply.
You can reject by replying to this message having '-1' in the first line of your reply.
If you are a L2 or a release manager you can ask for tests by saying 'please test' in the first line of a comment.
@Degano you are the release manager for this.
You can merge this pull request by typing 'merge' in the first line of your comment.

@Dr15Jones
Copy link
Contributor

please test

@Dr15Jones
Copy link
Contributor

@bbockelm did you test when forking that a crash in a child process would give a traceback?

@cmsbuild
Copy link
Contributor

The tests are being triggered in jenkins.

@bbockelm
Copy link
Contributor Author

Just tested this. stacktrace printer works fine in the child process.

However, it seems that the parent gets wedged in Xrootd-land when the child SEGV's. Doesn't appear to be related to this, however.

@cmsbuild
Copy link
Contributor

@Dr15Jones
Copy link
Contributor

+1

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next CMSSW_7_6_X IBs (tests are also fine). This pull request requires discussion in the ORP meeting before it's merged. @davidlange6, @Degano, @smuzaffar

@cmsbuild
Copy link
Contributor

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next CMSSW_7_6_X IBs (tests are also fine). This pull request requires discussion in the ORP meeting before it's merged. @davidlange6, @Degano, @smuzaffar

@davidlange6
Copy link
Contributor

+1

cmsbuild added a commit that referenced this pull request Sep 18, 2015
@cmsbuild cmsbuild merged commit c519d63 into cms-sw:CMSSW_7_6_X Sep 18, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants