Fix `cos_sin` device issue in Falcon model #26448

ydshieh · 2023-09-27T15:08:37Z

What does this PR do?

We should update the device accordingly. See the comments along the changes.

So far, running the model on GPU, then change model to CPU and running CPU inputs, the cached tensors (like cos_cached) will be on GPU and fail to run with CPU tensors.

ydshieh · 2023-09-27T15:17:03Z

src/transformers/models/falcon/modeling_falcon.py

+        # the cached tensors need to update their devices (for example, after we change the model's device)
+        if device != self.cos_cached:
+            self.cos_cached = self.cos_cached.to(device)
+            self.sin_cached = self.sin_cached.to(device)


without this, we have problems when switching a model device

Shouldn't this be a check on the device of self.cos_cached rather than on the tensor?

Yes, don't know why I made such mistake (as it works on the code snippet I am using 😅 sorry). You saved me!

HuggingFaceDocBuilderDev · 2023-09-27T15:30:03Z

The documentation is not available anymore as the PR was closed or merged.

ydshieh · 2023-09-28T07:37:18Z

src/transformers/models/falcon/modeling_falcon.py

+        self.cos_cached = self.cos_cached.to(device)
+        self.sin_cached = self.sin_cached.to(device)


I removed the check of device. As we can't compare device (given by a tensor) with a string directly.
If it is on the same device (inferred by torch itself), there won't be any operation performed.

LysandreJik

Great, thanks @ydshieh!

ydshieh added 2 commits September 27, 2023 16:03

fix

ac9ae23

fix

24ef7af

ydshieh commented Sep 27, 2023

View reviewed changes

ydshieh requested a review from LysandreJik September 27, 2023 15:27

fix

1a4f624

ydshieh commented Sep 28, 2023

View reviewed changes

LysandreJik approved these changes Sep 28, 2023

View reviewed changes

LysandreJik merged commit 375b4e0 into main Sep 28, 2023

LysandreJik deleted the fix_flacon_device branch September 28, 2023 08:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `cos_sin` device issue in Falcon model #26448

Fix `cos_sin` device issue in Falcon model #26448

ydshieh commented Sep 27, 2023 •

edited

Loading

ydshieh Sep 27, 2023

LysandreJik Sep 28, 2023

ydshieh Sep 28, 2023

HuggingFaceDocBuilderDev commented Sep 27, 2023 •

edited

Loading

ydshieh Sep 28, 2023

LysandreJik left a comment

		self.cos_cached = self.cos_cached.to(device)
		self.sin_cached = self.sin_cached.to(device)

Fix cos_sin device issue in Falcon model #26448

Fix cos_sin device issue in Falcon model #26448

Conversation

ydshieh commented Sep 27, 2023 • edited Loading

What does this PR do?

ydshieh Sep 27, 2023

Choose a reason for hiding this comment

LysandreJik Sep 28, 2023

Choose a reason for hiding this comment

ydshieh Sep 28, 2023

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Sep 27, 2023 • edited Loading

ydshieh Sep 28, 2023

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

Fix `cos_sin` device issue in Falcon model #26448

Fix `cos_sin` device issue in Falcon model #26448

ydshieh commented Sep 27, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 27, 2023 •

edited

Loading