Saving checkpoint before evaluation #226

AdrianKs · 2021-09-16T14:08:09Z

Currently we are storing checkpoints after evaluation.
If we for some reason encounter an error during evaluation (e.g. OOM) we will lose the complete epoch.
Therefore, we should store the checkpoint before (or even while) we run the evaluation code.

AprLie · 2022-10-02T15:11:39Z

I think this problem can be avoided by evaluating the model before the training phase as saving the checkpoints after the evaluation can provide the best model and well support the early stopping.

AdrianKs added the enhancement New feature or request label Sep 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Saving checkpoint before evaluation #226

Saving checkpoint before evaluation #226

AdrianKs commented Sep 16, 2021

AprLie commented Oct 2, 2022

Saving checkpoint before evaluation #226

Saving checkpoint before evaluation #226

Comments

AdrianKs commented Sep 16, 2021

AprLie commented Oct 2, 2022