[`WIP`] Multi-adapter saving support for PEFT #26411

younesbelkada · 2023-09-26T10:22:05Z

What does this PR do?

To be potentially merged after #26407
This PR adds the multi-adapter support for save_pretrained to be consistent with PEFT API that saves all adapters when calling save_pretrained. Note the default adapter is always saved in the root directory of save_directory.

cc @LysandreJik @BenjaminBossan @pacman100

Co-authored-by: Benjamin Bossan <[email protected]>

HuggingFaceDocBuilderDev · 2023-09-26T10:38:09Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

…ulti-adapter

BenjaminBossan

Thanks for the update. In general, LGTM. I have only some minor comments, no blockers.

BenjaminBossan · 2023-09-27T11:42:59Z

src/transformers/integrations/peft.py

+        Gets the current active adapters of the model. In case of multi-adapter inference (combining multiple adapters
+        for inference) returns the list of all active adapters so that users can deal with them accordingly.
+
+        For previous PEFT versions (that does not support multi-adapter inference), `module.active_adapter` will return


Suggested change

For previous PEFT versions (that does not support multi-adapter inference), `module.active_adapter` will return

For previous PEFT versions (that do not support multi-adapter inference), `module.active_adapter` will return

I think this statement is a bit confusing: This method always returns a list of str, right? The statement seems to relate to what older PEFT versions do under the hood, but that's an implementation detail and should not be mentioned in the docstring (but you could add this as a comment in the code).

BenjaminBossan · 2023-09-27T11:48:40Z

src/transformers/integrations/peft.py


-    def active_adapter(self) -> str:
+    def active_adapters(self) -> List[str]:


Here, .active_adapters is a method, in PEFT, it's a property. Since we newly introduce this method in transformers, do we want to take the opportunity to make it property for consistency? The downside is that active_adapter is a method here, not a property, so it would be inconsistent with that method. We could change it too, but that would be BC breaking.

BenjaminBossan · 2023-09-27T11:52:12Z

src/transformers/integrations/peft.py

+                break
+
+        # For previous PEFT versions
+        if isinstance(active_adapters, str):


Is it possible to have no active adapter at all or is this prevented at some point? Otherwise, active_adapters could be undefined here.

BenjaminBossan · 2023-09-27T12:00:30Z

src/transformers/modeling_utils.py

+                if len(peft_multi_adapter_state_dict.keys()) == 1:
+                    current_adapter = list(peft_multi_adapter_state_dict.keys())[0]
+                    state_dict = peft_multi_adapter_state_dict[current_adapter].copy()
+                    peft_multi_adapter_state_dict = None


How about this change, which I think makes the intent a bit more obvious and avoid changing the type of peft_multi_adapter_state_dict:

Suggested change

if len(peft_multi_adapter_state_dict.keys()) == 1:

current_adapter = list(peft_multi_adapter_state_dict.keys())[0]

state_dict = peft_multi_adapter_state_dict[current_adapter].copy()

peft_multi_adapter_state_dict = None

if len(peft_multi_adapter_state_dict.keys()) == 1:

current_adapter = list(peft_multi_adapter_state_dict.keys())[0]

state_dict = peft_multi_adapter_state_dict.pop(current_adapter).copy()

Not sure if the .copy() is needed?

Then, below change:

- _peft_save_multi_adapter = _hf_peft_config_loaded and peft_multi_adapter_state_dict is not None + _peft_save_multi_adapter = _hf_peft_config_loaded and peft_multi_adapter_state_dict

BenjaminBossan · 2023-09-27T12:12:46Z

src/transformers/modeling_utils.py

+                for adapter_name in peft_multi_adapter_state_dict:
+                    for ignore_key in self._keys_to_ignore_on_save:
+                        if ignore_key in peft_multi_adapter_state_dict[adapter_name].keys():
+                            del peft_multi_adapter_state_dict[adapter_name][ignore_key]


I asked ChatGPT to make this block more elegant, here is what it came up with:

if not _peft_save_multi_adapter: state_dict = {k: v for k, v in state_dict.items() if k not in self._keys_to_ignore_on_save} else: peft_multi_adapter_state_dict = { adapter_name: {k: v for k, v in adapter.items() if k not in self._keys_to_ignore_on_save} for adapter_name, adapter in peft_multi_adapter_state_dict.items() }

WDYT? :)

github-actions · 2023-10-27T08:04:34Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

BenjaminBossan · 2023-10-27T10:26:01Z

@younesbelkada What's the status?

younesbelkada · 2023-10-28T19:05:27Z

@BenjaminBossan I think this might be too much of an edge case for the work it requires, I propose to keep that PR as open and if some interest arises from the community I'll work on it

github-actions · 2023-11-22T08:05:34Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

younesbelkada and others added 11 commits September 26, 2023 08:00

fix PEFT multi adapters support

05bbcd7

refactor a bit

b2e32a5

save pretrained + BC + added tests

d772435

attempt to save multi-adapter

2b6c4b2

up

438eb67

fix

bd402a1

fixup

e1686e0

Update src/transformers/integrations/peft.py

87e304a

Co-authored-by: Benjamin Bossan <[email protected]>

add more tests

3dc62a3

add suggestion

9250ccf

final changes

c7ad2b4

younesbelkada mentioned this pull request Sep 26, 2023

[PEFT] Fix PEFT multi adapters support #26407

Merged

younesbelkada added 3 commits September 26, 2023 10:41

adapt a bit

9e93a26

fixup

f8e435f

Merge remote-tracking branch 'origin/fix-peft-setter' into peft-fix-m…

d676d41

…ulti-adapter

BenjaminBossan reviewed Sep 27, 2023

View reviewed changes

younesbelkada closed this Nov 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`WIP`] Multi-adapter saving support for PEFT #26411

[`WIP`] Multi-adapter saving support for PEFT #26411

younesbelkada commented Sep 26, 2023

HuggingFaceDocBuilderDev commented Sep 26, 2023

BenjaminBossan left a comment

BenjaminBossan Sep 27, 2023

BenjaminBossan Sep 27, 2023

BenjaminBossan Sep 27, 2023

BenjaminBossan Sep 27, 2023

BenjaminBossan Sep 27, 2023

github-actions bot commented Oct 27, 2023

BenjaminBossan commented Oct 27, 2023

younesbelkada commented Oct 28, 2023

github-actions bot commented Nov 22, 2023

	For previous PEFT versions (that does not support multi-adapter inference), `module.active_adapter` will return
	For previous PEFT versions (that do not support multi-adapter inference), `module.active_adapter` will return


		def active_adapter(self) -> str:
		def active_adapters(self) -> List[str]:

[WIP] Multi-adapter saving support for PEFT #26411

[WIP] Multi-adapter saving support for PEFT #26411

Conversation

younesbelkada commented Sep 26, 2023

What does this PR do?

HuggingFaceDocBuilderDev commented Sep 26, 2023

BenjaminBossan left a comment

Choose a reason for hiding this comment

BenjaminBossan Sep 27, 2023

Choose a reason for hiding this comment

BenjaminBossan Sep 27, 2023

Choose a reason for hiding this comment

BenjaminBossan Sep 27, 2023

Choose a reason for hiding this comment

BenjaminBossan Sep 27, 2023

Choose a reason for hiding this comment

BenjaminBossan Sep 27, 2023

Choose a reason for hiding this comment

github-actions bot commented Oct 27, 2023

BenjaminBossan commented Oct 27, 2023

younesbelkada commented Oct 28, 2023

github-actions bot commented Nov 22, 2023

[`WIP`] Multi-adapter saving support for PEFT #26411

[`WIP`] Multi-adapter saving support for PEFT #26411