Change cached seq_len to int to enable compilation #38

f0k · 2024-11-27T10:34:19Z

First of all, thanks for the nicely reusable package!

With recent versions of PyTorch, torch.compile() fails with the error "RuntimeError: aten::copy() Expected a value of type 'Tensor' for argument 'src' but instead found type 'int'." on the line that does self.cached_freqs_seq_len.copy_(seq_len), where seq_len is an int.
Also it warns about a "Graph break from Tensor.item()" on the line that has (offset + seq_len) <= self.cached_freqs_seq_len.item().

This PR fixes both by changing cached_freqs_seq_len and cached_scales_seq_len from singleton int tensors to plain Python integers. Forgive me if I overlooked anything, but it seems to me that there is no benefit of having these values on the GPU?

lucidrains · 2024-11-27T13:18:16Z

@f0k thank you for the PR Jan! yes indeed, not sure why it was stored that way

Change cached seq_len from buffer to plain int

37fdab4

f0k mentioned this pull request Nov 27, 2024

("Reference beats are empty.") mir_eval warning CPJKU/beat_this#5

Closed

lucidrains merged commit 8f2ccce into lucidrains:main Nov 27, 2024

f0k deleted the int-seq-len branch November 28, 2024 09:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change cached seq_len to int to enable compilation #38

Change cached seq_len to int to enable compilation #38

f0k commented Nov 27, 2024

lucidrains commented Nov 27, 2024

Change cached seq_len to int to enable compilation #38

Change cached seq_len to int to enable compilation #38

Conversation

f0k commented Nov 27, 2024

lucidrains commented Nov 27, 2024