[refactor] Add state_dict to loops #8197

tchaton · 2021-06-29T12:38:34Z

What does this PR do?

This PR adds the templating logic for loops state. This would be used by Fault Tolerant Logic.

Before submitting

Was this discussed/approved via a GitHub issue? (not for typos and docs)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you update the CHANGELOG? (not for typos, docs, test updates, or internal minor changes/refactorings)

PR review

Anyone in the community is free to review the PR once the tests have passed.
Before you start reviewing make sure you have read Review guidelines. In short, see the following bullet-list:

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

Did you have fun?

Make sure you had fun coding 🙃

codecov · 2021-06-29T12:39:40Z

Codecov Report

Merging #8197 (342853c) into master (3e6f884) will decrease coverage by 5%.
The diff coverage is 96%.

@@           Coverage Diff           @@
##           master   #8197    +/-   ##
=======================================
- Coverage      93%     88%    -5%     
=======================================
  Files         212     212            
  Lines       13679   13695    +16     
=======================================
- Hits        12720   12052   -668     
- Misses        959    1643   +684

pytorch_lightning/loops/fit_loop.py

pytorch_lightning/loops/epoch/training_epoch_loop.py

pytorch_lightning/loops/dataloader/evaluation_loop.py

pytorch_lightning/loops/base.py

pytorch_lightning/loops/dataloader/evaluation_loop.py

pytorch_lightning/loops/fit_loop.py

ananthsub

do you have examples of what types of state the loop needs to save/load?

carmocca · 2021-06-29T17:34:21Z

do you have examples of what types of state the loop needs to save/load?

Namely progress (dataclasses) and results (ResultCollection)

trainer
    loops:
        fit_loop:
            epoch_loop:
                batch_loop:
                val_loop:
                    ...
        validate_loop:
            ...
        test_loop:
            ...
------
and each loop has:
a_loop:
    progress
    results

pytorch_lightning/loops/base.py

pytorch_lightning/loops/batch/training_batch_loop.py

pytorch_lightning/loops/epoch/training_epoch_loop.py

pytorch_lightning/loops/fit_loop.py

pytorch_lightning/trainer/properties.py

pytorch_lightning/loops/fit_loop.py

…tning/pytorch-lightning into add_loop_state_dicts

ananthsub

could you describe how the trainer will resume the states? at what point is the loop's load_state_dict called?

also does what requirements does this impose on composition of loops?

pytorch_lightning/loops/base.py

tests/loops/test_loops.py

pytorch_lightning/trainer/properties.py

…tning/pytorch-lightning into add_loop_state_dicts

tests/loops/ test_loop_state_dict.py

pytorch_lightning/trainer/properties.py

tchaton added 2 commits June 29, 2021 13:32

add state_dict, loop_dict on loops

09cee73

update

efd2e50

tchaton self-assigned this Jun 29, 2021

tchaton added this to the v1.4 milestone Jun 29, 2021

tchaton added the refactor label Jun 29, 2021

update changelog

0430370

tchaton marked this pull request as ready for review June 29, 2021 12:41

tchaton requested review from awaelchli, Borda, carmocca, justusschock, kaushikb11, SeanNaren and williamFalcon as code owners June 29, 2021 12:41

carmocca reviewed Jun 29, 2021

View reviewed changes

awaelchli reviewed Jun 29, 2021

View reviewed changes

pytorch_lightning/loops/dataloader/evaluation_loop.py Outdated Show resolved Hide resolved

pytorch_lightning/loops/fit_loop.py Outdated Show resolved Hide resolved

mergify bot added the has conflicts label Jun 29, 2021

ananthsub reviewed Jun 29, 2021

View reviewed changes

tchaton added 2 commits June 30, 2021 13:03

resolve on comments

21190e2

Merge branch 'master' into add_loop_state_dicts

27a18e9

mergify bot removed the has conflicts label Jun 30, 2021

tchaton requested review from ananthsub, awaelchli and carmocca June 30, 2021 12:04

carmocca reviewed Jun 30, 2021

View reviewed changes

tchaton added 2 commits June 30, 2021 17:40

update on comments

b92c681

Merge branch 'add_loop_state_dicts' of https://github.com/PyTorchLigh…

5e0631d

…tning/pytorch-lightning into add_loop_state_dicts

tchaton requested a review from carmocca June 30, 2021 16:43

Update tests and CHANGELOG

3d748cb

carmocca force-pushed the add_loop_state_dicts branch from 113c569 to 3d748cb Compare June 30, 2021 16:53

Move code and rename

dad9f13

carmocca approved these changes Jun 30, 2021

View reviewed changes

carmocca added 2 commits June 30, 2021 19:41

Rename

cf5a260

Rename

c6d026a

ananthsub reviewed Jun 30, 2021

View reviewed changes

pytorch_lightning/loops/base.py Show resolved Hide resolved

tests/loops/test_loops.py Outdated Show resolved Hide resolved

ananthsub reviewed Jun 30, 2021

View reviewed changes

pytorch_lightning/trainer/properties.py Outdated Show resolved Hide resolved

tchaton added 2 commits June 30, 2021 19:37

change test file name

95a4073

Merge branch 'add_loop_state_dicts' of https://github.com/PyTorchLigh…

7fb2735

…tning/pytorch-lightning into add_loop_state_dicts

awaelchli approved these changes Jul 1, 2021

View reviewed changes

tests/loops/ test_loop_state_dict.py Outdated Show resolved Hide resolved

Borda approved these changes Jul 1, 2021

View reviewed changes

tchaton enabled auto-merge (squash) July 1, 2021 09:23

tchaton disabled auto-merge July 1, 2021 09:23

rename file

d2576fb

tchaton enabled auto-merge (squash) July 1, 2021 09:24

Address comments

90663d6

carmocca reviewed Jul 1, 2021

View reviewed changes

pytorch_lightning/trainer/properties.py Outdated Show resolved Hide resolved

carmocca added 2 commits July 1, 2021 11:49

Update pytorch_lightning/trainer/properties.py

d207caa

update test

b814c19

carmocca disabled auto-merge July 1, 2021 09:53

carmocca enabled auto-merge (squash) July 1, 2021 09:54

Merge branch 'master' into add_loop_state_dicts

342853c

carmocca disabled auto-merge July 1, 2021 14:08

carmocca enabled auto-merge (squash) July 1, 2021 14:08

carmocca merged commit d51b0ae into master Jul 1, 2021

carmocca deleted the add_loop_state_dicts branch July 1, 2021 15:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[refactor] Add state_dict to loops #8197

[refactor] Add state_dict to loops #8197

tchaton commented Jun 29, 2021 •

edited

Loading

codecov bot commented Jun 29, 2021 •

edited

Loading

ananthsub left a comment

carmocca commented Jun 29, 2021 •

edited

Loading

ananthsub left a comment

[refactor] Add state_dict to loops #8197

[refactor] Add state_dict to loops #8197

Conversation

tchaton commented Jun 29, 2021 • edited Loading

What does this PR do?

Before submitting

PR review

Did you have fun?

codecov bot commented Jun 29, 2021 • edited Loading

Codecov Report

ananthsub left a comment

Choose a reason for hiding this comment

carmocca commented Jun 29, 2021 • edited Loading

ananthsub left a comment

Choose a reason for hiding this comment

tchaton commented Jun 29, 2021 •

edited

Loading

codecov bot commented Jun 29, 2021 •

edited

Loading

carmocca commented Jun 29, 2021 •

edited

Loading