feat: lora fine-tuning in FHE + gpt2 use case example #823

RomanBredehoft · 2024-08-02T15:13:03Z

currently running the notebook with 100 epochs

refs https://github.com/zama-ai/concrete-ml-internal/issues/4522

RomanBredehoft · 2024-08-16T13:51:55Z

timings are from my computer, it hasn't been refreshed

andrei-stoian-zama

I haven't reviewed everything yet.

Main issue is :

can we make it work for other models than GPT2 (for example the simple MLP)
make the LoraTraining use self.training to determine if it's doing inference or training

src/concrete/ml/torch/lora.py

tests/torch/test_lora.py

src/concrete/ml/torch/lora.py

tests/torch/test_lora.py

use_case_examples/lora_finetuning/README.md

andrei-stoian-zama

I would like to remove the transformer dependency:

move the dep to the use case example, replace conv1d -> nn.Linear in the usecase
parametrize the remote module finding function in way that avoids the dep

andrei-stoian-zama · 2024-09-25T13:51:33Z

src/concrete/ml/torch/lora.py

+from typing import List
+
+import torch
+from transformers import Conv1D as TransformerConv1D


adding the dependency to transformers .. we had removed it earlier

alright done

andrei-stoian-zama · 2024-09-25T14:28:58Z

src/concrete/ml/torch/lora.py

+        return grad_input, None, None
+
+
+def get_remote_names(model: torch.nn.Module, include_embedding_layers: bool = False) -> List[str]:


this is too specific for a library function. e.g. for the lm_head moniker.

I suggest you add two arguments remote_layer_types and layer_reject_filter.

You can then call with remote_layer_types = [nn.Linear, nn.Embedding, transformer.Conv1D]. Thus CML does not need to know TransformerConv1D. and layer_reject_filter = ['lm_head]`

You can thus remove the include_embedding_layers flag.

but in the function we manually add CustomLinear if the user asks for nn.Linear.

I am not sure here. This is very specific with the replace_layers_with_custom. I would not allow the user to select layers himself. For now we do linear layers in FHE and we have a workaround for lm_head and embedding as long as there are not fixed.

tests/torch/test_lora.py

github-actions · 2024-09-26T08:46:12Z

⚠️ Known flaky tests have been rerun ⚠️

One or several tests initially failed but were identified as known flaky. tests. Therefore, they have been rerun and passed. See below for more details.

Failed tests details

Known flaky tests that initially failed:

tests/torch/test_compile_torch.py::test_compile_torch_or_onnx_conv_networks[False-True-CNN-relu]

github-actions · 2024-09-26T08:46:15Z

Coverage failed ❌

Coverage details

---------- coverage: platform linux, python 3.8.18-final-0 -----------
Name                                 Stmts   Miss  Cover   Missing
------------------------------------------------------------------
src/concrete/ml/onnx/onnx_utils.py      59     10    83%   594-620
------------------------------------------------------------------
TOTAL                                 8276     10    99%

60 files skipped due to complete coverage.

andrei-stoian-zama

Excellent work here @RomanBredehoft and @jfrery !

cla-bot bot added the cla-signed label Aug 2, 2024

RomanBredehoft force-pushed the docs/add_hybrid_lora_fine_tuning branch 7 times, most recently from 23ebfd9 to f20697d Compare August 6, 2024 17:21

RomanBredehoft changed the title ~~docs/add_hybrid_lora_fine_tuning~~ docs: add hybrid lora fine tuning use case Aug 16, 2024

jfrery force-pushed the docs/add_hybrid_lora_fine_tuning branch from 6c1caeb to e63afe9 Compare September 9, 2024 12:07

jfrery force-pushed the docs/add_hybrid_lora_fine_tuning branch from e63afe9 to 1e92c43 Compare September 23, 2024 10:46

jfrery changed the title ~~docs: add hybrid lora fine tuning use case~~ feat: lora fine-tuning in FHE + gpt2 use case example Sep 23, 2024

andrei-stoian-zama requested changes Sep 23, 2024

View reviewed changes

andrei-stoian-zama requested changes Sep 25, 2024

View reviewed changes

andrei-stoian-zama reviewed Sep 25, 2024

View reviewed changes

tests/torch/test_lora.py Show resolved Hide resolved

andrei-stoian-zama mentioned this pull request Sep 25, 2024

docs: add encrypted fine-tuning documentation #887

Merged

RomanBredehoft and others added 13 commits September 26, 2024 09:29

docs: add_hybrid_lora_fine_tuning

5af1529

chore: add makefile to use case

0a0c455

chore: fix pcc

def6ef2

chore: add push_changes target to use case action

8f7ccaa

chore: fix pcc

fa97cbd

chore: refresh notebook

ad4a636

chore: clean notebook

b8a7df4

chore: refresh notebook(s) for use case lora_finetune

d96a8cc

chore: improve refresh notebook and use case

e53d494

chore: fix pcc

6650622

chore: add disable adapters and print lora weights

a16006a

chore: add simulation execution

b5a1ba8

chore: clean notebook

e675d47

RomanBredehoft and others added 19 commits September 26, 2024 09:29

chore: add loss plot

72a4c7b

chore: refresh notebook(s) for use case lora_finetune

b0eff66

chore: add FHE embedding layers

4881b39

chore: update requirements

cf32cb7

chore: add lm_head

15a7254

chore: fix remote embedding and lm_head

445ee0e

chore: temporarily remove embedding/lm_head from remote

9053e1b

chore: add 16b training, without embedding layers

039f08b

chore: seed text generation

b5c659d

chore: rename use case and notebook + add readme + refacto + fix

2f20b0f

chore: update licenses

4d51e35

chore: fix forbidden words

9024a71

chore: fix codeblock

9d4f956

chore: update notebook executed

2e7f15d

chore: lora more generic for the MLP

8e8409a

chore: add LoraMLP notebook

7dca588

chore: pcc + test

f736d84

chore: add docstring loratraining

bbc3266

chore: make transformer lib optional

0379e15

jfrery force-pushed the docs/add_hybrid_lora_fine_tuning branch from 09a96b5 to 0379e15 Compare September 26, 2024 07:29

andrei-stoian-zama self-requested a review September 26, 2024 08:32

jfrery marked this pull request as ready for review September 26, 2024 08:52

jfrery requested a review from a team as a code owner September 26, 2024 08:52

andrei-stoian-zama approved these changes Sep 26, 2024

View reviewed changes

jfrery merged commit 4d2f2e6 into main Sep 26, 2024
15 checks passed

jfrery deleted the docs/add_hybrid_lora_fine_tuning branch September 26, 2024 08:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: lora fine-tuning in FHE + gpt2 use case example #823

feat: lora fine-tuning in FHE + gpt2 use case example #823

RomanBredehoft commented Aug 2, 2024

RomanBredehoft commented Aug 16, 2024 •

edited

Loading

andrei-stoian-zama left a comment

andrei-stoian-zama left a comment

andrei-stoian-zama Sep 25, 2024

jfrery Sep 26, 2024

andrei-stoian-zama Sep 25, 2024

andrei-stoian-zama Sep 25, 2024

jfrery Sep 26, 2024

github-actions bot commented Sep 26, 2024

Known flaky tests that initially failed:

github-actions bot commented Sep 26, 2024

andrei-stoian-zama left a comment

		return grad_input, None, None


		def get_remote_names(model: torch.nn.Module, include_embedding_layers: bool = False) -> List[str]:

feat: lora fine-tuning in FHE + gpt2 use case example #823

feat: lora fine-tuning in FHE + gpt2 use case example #823

Conversation

RomanBredehoft commented Aug 2, 2024

RomanBredehoft commented Aug 16, 2024 • edited Loading

andrei-stoian-zama left a comment

Choose a reason for hiding this comment

andrei-stoian-zama left a comment

Choose a reason for hiding this comment

andrei-stoian-zama Sep 25, 2024

Choose a reason for hiding this comment

jfrery Sep 26, 2024

Choose a reason for hiding this comment

andrei-stoian-zama Sep 25, 2024

Choose a reason for hiding this comment

andrei-stoian-zama Sep 25, 2024

Choose a reason for hiding this comment

jfrery Sep 26, 2024

Choose a reason for hiding this comment

github-actions bot commented Sep 26, 2024

⚠️ Known flaky tests have been rerun ⚠️

Known flaky tests that initially failed:

github-actions bot commented Sep 26, 2024

Coverage failed ❌

andrei-stoian-zama left a comment

Choose a reason for hiding this comment

RomanBredehoft commented Aug 16, 2024 •

edited

Loading