[`Trainer` / `GC`] Add `gradient_checkpointing_kwargs` in trainer and training arguments #27068

younesbelkada · 2023-10-25T13:43:46Z

What does this PR do?

Partially fixes: huggingface/trl#912

Following #27020 it is important to propagate gradient_checkpointing_kwargs in Trainer as well

cc @ArthurZucker

HuggingFaceDocBuilderDev · 2023-10-25T14:05:25Z

The documentation is not available anymore as the PR was closed or merged.

ArthurZucker

Thank 🔥

ArthurZucker · 2023-10-25T14:11:55Z

src/transformers/training_args.py

+    gradient_checkpointing_kwargs: dict = field(
+        default=None,
+        metadata={
+            "help": "Gradient checkpointing key word arguments. Will be passed to `torch.utils.checkpoint.checkpoint` through `model.gradient_checkpointing_enable`."


could you add why anyone would want to pass antyhing and what is supported?

And a small small test!

ArthurZucker · 2023-10-25T14:12:53Z

Also why does it partially fix the issue

younesbelkada · 2023-10-30T09:02:12Z

It partially fixes the issue because I need huggingface/peft#1036 to be merged to fix the bug with respect to PEFT models

… training arguments (huggingface#27068) * add `gradient_checkpointing_kwargs` in trainer and training arguments * add comment * add test - currently failing * now tests pass

add gradient_checkpointing_kwargs in trainer and training arguments

9590031

younesbelkada requested a review from ArthurZucker October 25, 2023 13:43

younesbelkada mentioned this pull request Oct 25, 2023

[core] Fix use_reentrant issues huggingface/peft#1036

Merged

ArthurZucker approved these changes Oct 25, 2023

View reviewed changes

younesbelkada added 3 commits October 27, 2023 14:16

Merge remote-tracking branch 'upstream/main' into trainer-fix-gc

7a0be33

add comment

77aa687

add test - currently failing

9beef59

younesbelkada marked this pull request as draft October 27, 2023 20:53

now tests pass

f91a9ec

younesbelkada marked this pull request as ready for review October 30, 2023 09:01

Merge remote-tracking branch 'upstream/main' into trainer-fix-gc

6df2f8e

younesbelkada merged commit 5fbed2d into huggingface:main Oct 30, 2023

younesbelkada deleted the trainer-fix-gc branch October 30, 2023 11:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`Trainer` / `GC`] Add `gradient_checkpointing_kwargs` in trainer and training arguments #27068

[`Trainer` / `GC`] Add `gradient_checkpointing_kwargs` in trainer and training arguments #27068

younesbelkada commented Oct 25, 2023

HuggingFaceDocBuilderDev commented Oct 25, 2023 •

edited

Loading

ArthurZucker left a comment

ArthurZucker Oct 25, 2023

ArthurZucker Oct 25, 2023

younesbelkada Oct 30, 2023

ArthurZucker commented Oct 25, 2023

younesbelkada commented Oct 30, 2023

[Trainer / GC] Add gradient_checkpointing_kwargs in trainer and training arguments #27068

[Trainer / GC] Add gradient_checkpointing_kwargs in trainer and training arguments #27068

Conversation

younesbelkada commented Oct 25, 2023

What does this PR do?

HuggingFaceDocBuilderDev commented Oct 25, 2023 • edited Loading

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Oct 25, 2023

Choose a reason for hiding this comment

ArthurZucker Oct 25, 2023

Choose a reason for hiding this comment

younesbelkada Oct 30, 2023

Choose a reason for hiding this comment

ArthurZucker commented Oct 25, 2023

younesbelkada commented Oct 30, 2023

[`Trainer` / `GC`] Add `gradient_checkpointing_kwargs` in trainer and training arguments #27068

[`Trainer` / `GC`] Add `gradient_checkpointing_kwargs` in trainer and training arguments #27068

HuggingFaceDocBuilderDev commented Oct 25, 2023 •

edited

Loading