Handling of KeyboardInterrupt #9286

mibaumgartner · 2021-09-02T15:22:25Z

🐛 Bug

Same behavior as in #6807

May be caused due to the wrong indentation of the raise statement?
https://github.com/PyTorchLightning/pytorch-lightning/blob/75350938ca646efc0b4bac432ba2d5d4676662bb/pytorch_lightning/trainer/trainer.py#L524

To Reproduce

Run training and raise a Keyboard Interrupt.

Expected behavior

Training should end and KeyboardInterrupt should stop the whole program.

Environment

PyTorch Lightning Version (e.g., 1.3.0):
PyTorch Version (e.g., 1.8)
Python version:
OS (e.g., Linux):
CUDA/cuDNN version:
GPU models and configuration:
How you installed PyTorch (conda, pip, source):
If compiling from source, the output of torch.__config__.show():
Any other relevant information:

Additional context

The text was updated successfully, but these errors were encountered:

tchaton · 2021-09-02T15:44:33Z

@daniellepintz

daniellepintz · 2021-09-02T17:45:23Z

Thanks for reporting @mibaumgartner I am looking into it!

daniellepintz · 2021-09-02T17:48:26Z

May be caused due to the wrong indentation of the raise statement?

What do you mean exactly? I don't see any wrong indentation of the raise statement

mibaumgartner · 2021-09-02T20:19:43Z

Hi @daniellepintz, thanks for looking into this.
I thought about moving the raise statement one indentation to the left but it might be better (more explicit) to add a second raise statement in the keyboard interrupt block.

daniellepintz · 2021-09-02T21:16:42Z

@mibaumgartner Done in #9260

ananthsub · 2021-09-02T21:25:09Z

@tchaton @daniellepintz @awaelchli was it intentional that keyboard interrupt didn't re-raise the exception after doing the graceful teardown from #856 ? is that desirable?

awaelchli · 2021-09-02T23:57:13Z

yes certainly, ctrl+c -> graceful shutdown -> exit training without leaving behind a stack trace. that's simply for UX

ananthsub · 2021-09-03T01:11:02Z

yes certainly, ctrl+c -> graceful shutdown -> exit training without leaving behind a stack trace. that's simply for UX

Was there concern about ghost background processes in this case? Should this apply to all training types, or only for single device ones?

awaelchli · 2021-09-03T02:02:14Z

@daniellepintz interrupt / exception handling changes are not included in any release are they?
@mibaumgartner we are kindly requesting at least the PL version you were running this with please. KeyboardInterrupts work fine on master branch given my quick testing. more details would be appreciated to address your specific issue.

daniellepintz · 2021-09-03T03:57:56Z

yes that is correct, all of my recent changes are still unreleased

mibaumgartner · 2021-09-03T07:28:45Z

Hi @awaelchli,

It occurred with several (recent) versions of lightning, including the latest 1.4.5 release.

The Issue can be reproduced by raising a Keyboard Interrupt in the Lightning Module (could be a good test :) ):
https://colab.research.google.com/drive/1aSoLeHHJCyKCfxpm-YsdJ3Gq6YLNR2Ci?usp=sharing

Changes in notebook:

    def training_step(self, batch, batch_idx):
        output = self.layer(batch)
        loss = self.loss(batch, output)

        if batch_idx > 200: # see training starting, not actually needed
          raise KeyboardInterrupt

        return {"loss": loss}

    # Train the model ⚡
    trainer.fit(model, train, val)

    print("don't execute this since Keyboard Interrupt was raised in training")

    trainer.test(test_dataloaders=test)

(In the example notebook trainer.test also results in an additional error since the checkpoint from the training is missing)

daniellepintz · 2021-09-06T23:18:08Z

@Borda sorry for the confusion but I don't think #9260 quite resolves this (my bad for saying it did in the PR description)

even though we just deprecated on_keyboard_interrupt what @mibaumgartner is referring to will still remain.

current behavior is when there is a KeyboardInterrupt it gracefully exits the current trainer function , not the whole program. but when there is an exception, it raises the exception which exits the whole program. @awaelchli is this intended behavior?

awaelchli · 2021-09-06T23:37:01Z

Yes it will terminate only the current fit function. This graceful shutdown mechanism was introduced before Lightning v1.0, so it is standard behavior for a long time now.

To stop program execution, we can raise a SystemExit

daniellepintz · 2021-09-07T21:24:16Z

sounds good thanks @awaelchli . @mibaumgartner does this behavior work for you?

mibaumgartner · 2021-09-07T21:25:25Z

Yes, sounds good 👍

mibaumgartner added bug Something isn't working help wanted Open to be worked on labels Sep 2, 2021

daniellepintz mentioned this issue Sep 2, 2021

Deprecate on_keyboard_interrupt callback hook #9260

Merged

12 tasks

awaelchli added the information needed label Sep 3, 2021

Borda closed this as completed in #9260 Sep 6, 2021

daniellepintz mentioned this issue Sep 8, 2021

update GH templates' labels #9295

Merged

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handling of KeyboardInterrupt #9286

Handling of KeyboardInterrupt #9286

mibaumgartner commented Sep 2, 2021

tchaton commented Sep 2, 2021

daniellepintz commented Sep 2, 2021

daniellepintz commented Sep 2, 2021 •

edited

Loading

mibaumgartner commented Sep 2, 2021

daniellepintz commented Sep 2, 2021

ananthsub commented Sep 2, 2021

awaelchli commented Sep 2, 2021

ananthsub commented Sep 3, 2021

awaelchli commented Sep 3, 2021 •

edited

Loading

daniellepintz commented Sep 3, 2021

mibaumgartner commented Sep 3, 2021 •

edited

Loading

daniellepintz commented Sep 6, 2021

awaelchli commented Sep 6, 2021 •

edited

Loading

daniellepintz commented Sep 7, 2021

mibaumgartner commented Sep 7, 2021

Handling of KeyboardInterrupt #9286

Handling of KeyboardInterrupt #9286

Comments

mibaumgartner commented Sep 2, 2021

🐛 Bug

To Reproduce

Expected behavior

Environment

Additional context

tchaton commented Sep 2, 2021

daniellepintz commented Sep 2, 2021

daniellepintz commented Sep 2, 2021 • edited Loading

mibaumgartner commented Sep 2, 2021

daniellepintz commented Sep 2, 2021

ananthsub commented Sep 2, 2021

awaelchli commented Sep 2, 2021

ananthsub commented Sep 3, 2021

awaelchli commented Sep 3, 2021 • edited Loading

daniellepintz commented Sep 3, 2021

mibaumgartner commented Sep 3, 2021 • edited Loading

daniellepintz commented Sep 6, 2021

awaelchli commented Sep 6, 2021 • edited Loading

daniellepintz commented Sep 7, 2021

mibaumgartner commented Sep 7, 2021

daniellepintz commented Sep 2, 2021 •

edited

Loading

awaelchli commented Sep 3, 2021 •

edited

Loading

mibaumgartner commented Sep 3, 2021 •

edited

Loading

awaelchli commented Sep 6, 2021 •

edited

Loading