Terminate with signal in case of an assert #134

akosthekiss · 2015-05-29T13:53:07Z

Automated debugging is easier if the process terminates with a
signal instead of a regular exit (code); call, since in this
latter case the cause of error cannot be automatically backtraced.

Care has been taken to ensure that the exit code after the signal
is the same as what we would get with exit.

JerryScript-DCO-1.0-Signed-off-by: Akos Kiss [email protected]

zherczeg · 2015-06-01T07:26:11Z

I had this problem with Jerry before. I suspect the reason of not using abort() is avoiding compiling issues on dev boards. Perhaps we could add an abort() to jerry libc, and call the normal abort on Linux.

akosthekiss · 2015-06-01T08:08:27Z

Going for abort (); is another, almost identical option. __builtin_trap (); was another -- quicker -- way.

From the proposed patch's point of view, the abort-approach would need the change of 2 lines only: replacing __builtin_trap with abort and re-defining ERR_FAILED_INTERNAL_ASSERTION to 134 (to keep internal error codes and externally visible exit codes aligned -- at least on x86-64/linux). However, this would require some more work on jerry-libc side.

We could also have a 2-step approach:

go with the current PR first,
add abort () (with all implementations) to jerry-libc and adapt jerry-core (as described above) as second.
IMHO

ruben-ayrapetyan · 2015-06-01T09:04:45Z

@akiss77, @zherczeg, there is also a third option that seems to be more convenient for automatic testing systems, and maybe, for manual debugging too.

As far as I understand, by SIGILL we are causing core dump to extract backtrace later.

While the way with SIGILL is working and seems to be simple, there are some disadvantages, as using SIGILL:

can confuse, because SIGILL means 'execution of invalid instruction', while real termination cause is quite another;
doesn't provide a way to introduce and distinguish various abnormal termination cases (with different exit codes);
performs core dump only for the fixed termination cause 'failed assertion' and cannot be used to backtrace, for example, out of memory (in case, exit code, used for the cause, is another).

There is another way to perform core dump. Under Linux we could use gdb's gcore operation.
For example, upon reaching exit function with exit code, indicating an error, gdb is invoked using execve or something like this, with command line instructing it to perform gcore operation for the engine's process.

When using this way, we:

don't confuse termination cause with SIGILL;
can use any exit codes (and even, can perform core dumps without actually terminating the engine's process);
can enable or disable core dumps for any situations (probably, using an option like --enable-core-dump;
can replace core dumping with just attaching gdb to the engine's process without terminating (--attach-gdb-on-fail ?).

Furthermore, this way we can get backtrace without allocating much storage space for core dumps during automatic testing.

zherczeg · 2015-06-01T09:26:13Z

I feel we would just overcomplicate things. We need to capture abnormal program termination in testing environments, regardless how we fix this. So there is no simplification here. When we are debugging, gdb has been started, and creating another one would be overkill. And abort() throws SIGABRT, not SIGILL. That is for invalid instructions.

ruben-ayrapetyan · 2015-06-01T09:32:55Z

@zherczeg, using abort upon failure seems to be much better option than raising SIGILL.

ruben-ayrapetyan · 2015-06-01T09:35:32Z

When we are debugging, gdb has been started, and creating another one would be overkill.

This is just a secondary use case, while the main proposed use case was to self-attach gdb upon failure during automatic testing for dumping backtrace, core, or whatever we would add to the dumping script.

akosthekiss · 2015-06-01T09:43:58Z

@ruben-ayrapetyan, tying gdb so tightly to jerry does not feel right to me. That would add a very strong dependency to the project. Also, usually, an automatic testing framework does start whatever debugger it wants from outside. Starting it up from inside of the tested app is normally unexpected.

Furthermore, the question/point (for me, at least) is not really which signal to raise just to raise one and not exit in a "normal" way. (As seen in all projects using assertions I had a chance to stumble upon.) SIGILL comes into play only because __builtin_trap is implemented that way in gcc 4.8.2 for x86-64/linux.

From the docs of gcc: "This function causes the program to exit abnormally. GCC implements this function by using a target-dependent mechanism (such as intentionally executing an illegal instruction) or by calling abort."

So, eventually, it may be the same as calling abort elsewhere.

In my view, the advantage of __builtin_trap is only that it's already there (being a builtin), while abort is still to be implemented in jerry-libc (AFAIK).

seanshpark · 2015-06-01T10:14:04Z

Hope this fix has little or no problem with embed RTOS.

akosthekiss · 2015-06-01T11:02:35Z

@seanshpark, I'm not too familiar with embedded RTOSes so I have to rely on existing examples: iotjs also uses assert (), from assert.h, which eventually translates to an abort () call (on my box). The above discussed approaches (calling __builtin_trap () or abort () directly) are not exactly the same but may be close enough - unless the definition of assert () is significantly different on the RTOS.

seanshpark · 2015-06-01T23:15:01Z

@akiss77 , thank you for your kind explanation. :)

akosthekiss · 2015-06-02T15:27:01Z

Updated patch to use abort instead of __builtin_trap. Works as soon as #141 gets landed.

egavrin · 2015-06-02T15:30:32Z

jerry-core/jerry.h

@@ -50,7 +50,7 @@ typedef enum
  ERR_SYSCALL = 11,
  ERR_PARSER = 12,
  ERR_UNIMPLEMENTED_CASE = 118,
-  ERR_FAILED_INTERNAL_ASSERTION = 120
+  ERR_FAILED_INTERNAL_ASSERTION = 134


In case of abort usage we don't need this change anymore.

egavrin · 2015-06-02T15:43:30Z

Blocked by #141.

akosthekiss · 2015-06-02T16:22:26Z

ERR_FAILED_INTERNAL_ASSERTION can be left untouched, of course. This will cause to have an internal error code of 120 and an externally visible exit code of 134, however. (On x86-64/linux, at least.) Is that OK?

ruben-ayrapetyan · 2015-06-02T17:09:52Z

Using ERR_FAILED_INTERNAL_ASSERTION as an exceptional case, connected to SIGABRT, could lead to some difficulties in future. For example, if we would need core dumps for ERR_OUT_OF_MEMORY cases, it would be necessary to change its internal exit code to 134 too, but maybe some testing scripts would already use current value by that time, so they would need to be updated too, and also we wouldn't have a way to distinguish between the two failure types.

We could add a debug option like --abort-on-failthat would cause engine to perform abort upon exit with non-zero exit code (maybe, except ERR_SYSCALL?), and update condition

if (code == ERR_FAILED_INTERNAL_ASSERTION)

with the following:

if (code != 0
    && jrt_is_abort_on_failure ())
{
  abort ();
}
else
{
  exit (code);
}

In the case, we would expect exit code, corresponding to SIGABRT, if the option is passed in debug version (maybe, in release too?), and expect our internal exit codes otherwise.

We can leave abort for assertion failures in the pull request, and update the implementation in an upcoming enhancement.

akosthekiss · 2015-06-02T19:49:05Z

Updated the patch according to the latest comments from @egavrin and @ruben-ayrapetyan . The introduction of an --abort-on-fail option seems to be a sensible next step. Thanks for the review.

egavrin · 2015-06-03T08:30:34Z

@akiss77 Great! make push

akosthekiss · 2015-06-03T09:27:02Z

Got make push rejected twice in a row because of parallel works on the repo. Now waiting a bit for the dust to settle. Will try again afterwards.

Automated debugging is easier if the process terminates with a signal instead of a regular `exit (code);` call, since in this latter case the cause of error cannot be automatically backtraced. JerryScript-DCO-1.0-Signed-off-by: Akos Kiss [email protected]

akosthekiss added the enhancement An improvement label May 29, 2015

egavrin added this to the Core ECMA features milestone May 29, 2015

egavrin self-assigned this May 29, 2015

akosthekiss mentioned this pull request Jun 2, 2015

Add abort () to jerry-libc #141

Merged

egavrin reviewed Jun 2, 2015
View reviewed changes

akosthekiss merged commit 6a60775 into jerryscript-project:master Jun 3, 2015

akosthekiss deleted the assertion_trap branch June 3, 2015 10:19

akosthekiss mentioned this pull request Jun 3, 2015

Make exit behaviour of jerry_fatal flag-dependent #146

Merged

zherczeg mentioned this pull request Apr 7, 2016

Nominating Akos Kiss (akiss77) for JerryScript Maintainer status #992

Closed

somang-park unassigned egavrin Nov 25, 2016

This was referenced May 17, 2020

stack-overflow in vm_loop #3750

Closed

stack-overflow in ecma_regexp_match #3753

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Terminate with signal in case of an assert #134

Terminate with signal in case of an assert #134

akosthekiss commented May 29, 2015

zherczeg commented Jun 1, 2015

akosthekiss commented Jun 1, 2015

ruben-ayrapetyan commented Jun 1, 2015

zherczeg commented Jun 1, 2015

ruben-ayrapetyan commented Jun 1, 2015

ruben-ayrapetyan commented Jun 1, 2015

akosthekiss commented Jun 1, 2015

seanshpark commented Jun 1, 2015

akosthekiss commented Jun 1, 2015

seanshpark commented Jun 1, 2015

akosthekiss commented Jun 2, 2015

egavrin Jun 2, 2015

egavrin commented Jun 2, 2015

akosthekiss commented Jun 2, 2015

ruben-ayrapetyan commented Jun 2, 2015

akosthekiss commented Jun 2, 2015

egavrin commented Jun 3, 2015

akosthekiss commented Jun 3, 2015

Terminate with signal in case of an assert #134

Terminate with signal in case of an assert #134

Conversation

akosthekiss commented May 29, 2015

zherczeg commented Jun 1, 2015

akosthekiss commented Jun 1, 2015

ruben-ayrapetyan commented Jun 1, 2015

zherczeg commented Jun 1, 2015

ruben-ayrapetyan commented Jun 1, 2015

ruben-ayrapetyan commented Jun 1, 2015

akosthekiss commented Jun 1, 2015

seanshpark commented Jun 1, 2015

akosthekiss commented Jun 1, 2015

seanshpark commented Jun 1, 2015

akosthekiss commented Jun 2, 2015

egavrin Jun 2, 2015

Choose a reason for hiding this comment

egavrin commented Jun 2, 2015

akosthekiss commented Jun 2, 2015

ruben-ayrapetyan commented Jun 2, 2015

akosthekiss commented Jun 2, 2015

egavrin commented Jun 3, 2015

akosthekiss commented Jun 3, 2015