AMDGPU: Fix inst-selection of large scratch offsets with sgpr base #110256

petar-avramovic · 2024-09-27T12:49:41Z

Use i32 for offset instead of i16, this way it does not get interpreted
as negative 16 bit offset.

petar-avramovic · 2024-09-27T12:49:53Z

This stack of pull requests is managed by Graphite. Learn more about stacking.

Join @petar-avramovic and the rest of your teammates on Graphite

llvmbot · 2024-09-27T12:51:30Z

@llvm/pr-subscribers-backend-amdgpu

Author: Petar Avramovic (petar-avramovic)

Changes

Use i32 for offset instead of i16, this way it does not get interpreted
as negative 16 bit offset.

Full diff: https://github.com/llvm/llvm-project/pull/110256.diff

2 Files Affected:

(modified) llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp (+1-1)
(modified) llvm/test/CodeGen/AMDGPU/flat-scratch.ll (+2-2)

diff --git a/llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp b/llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
index d3d5bc924525fc..48971a6840c779 100644
--- a/llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
+++ b/llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
@@ -1911,7 +1911,7 @@ bool AMDGPUDAGToDAGISel::SelectScratchSAddr(SDNode *Parent, SDValue Addr,
                     0);
   }
 
-  Offset = CurDAG->getTargetConstant(COffsetVal, DL, MVT::i16);
+  Offset = CurDAG->getTargetConstant(COffsetVal, DL, MVT::i32);
 
   return true;
 }
diff --git a/llvm/test/CodeGen/AMDGPU/flat-scratch.ll b/llvm/test/CodeGen/AMDGPU/flat-scratch.ll
index 667a8a38c62ecc..496ac80a3dfbcf 100644
--- a/llvm/test/CodeGen/AMDGPU/flat-scratch.ll
+++ b/llvm/test/CodeGen/AMDGPU/flat-scratch.ll
@@ -4926,7 +4926,7 @@ define amdgpu_gs void @sgpr_base_large_offset(ptr addrspace(1) %out, ptr addrspa
 ;
 ; GFX12-LABEL: sgpr_base_large_offset:
 ; GFX12:       ; %bb.0: ; %entry
-; GFX12-NEXT:    scratch_load_b32 v2, off, s0 offset:-24
+; GFX12-NEXT:    scratch_load_b32 v2, off, s0 offset:65512
 ; GFX12-NEXT:    s_wait_loadcnt 0x0
 ; GFX12-NEXT:    global_store_b32 v[0:1], v2, off
 ; GFX12-NEXT:    s_nop 0
@@ -4985,7 +4985,7 @@ define amdgpu_gs void @sgpr_base_large_offset(ptr addrspace(1) %out, ptr addrspa
 ;
 ; GFX12-PAL-LABEL: sgpr_base_large_offset:
 ; GFX12-PAL:       ; %bb.0: ; %entry
-; GFX12-PAL-NEXT:    scratch_load_b32 v2, off, s0 offset:-24
+; GFX12-PAL-NEXT:    scratch_load_b32 v2, off, s0 offset:65512
 ; GFX12-PAL-NEXT:    s_wait_loadcnt 0x0
 ; GFX12-PAL-NEXT:    global_store_b32 v[0:1], v2, off
 ; GFX12-PAL-NEXT:    s_nop 0

llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp

jayfoad

LGTM, thanks! Please also backport to release/19.x.

Use i32 for offset instead of i16, this way it does not get interpreted as negative 16 bit offset.

petar-avramovic · 2024-09-30T08:43:12Z

Merge activity

Sep 30, 4:43 AM EDT: @petar-avramovic started a stack merge that includes this pull request via Graphite.
Sep 30, 4:45 AM EDT: @petar-avramovic merged this pull request with Graphite.

petar-avramovic · 2024-09-30T08:49:36Z

/cherry-pick e9d12a6 83fe851

llvmbot · 2024-09-30T08:54:19Z

/cherry-pick e9d12a6 83fe851

Error: Command failed due to missing milestone.

arsenm · 2024-09-30T08:56:51Z

/cherry-pick e9d12a6 83fe851

…lvm#110256) Use i32 for offset instead of i16, this way it does not get interpreted as negative 16 bit offset. (cherry picked from commit 83fe851)

llvmbot · 2024-09-30T09:02:35Z

/pull-request #110470

…lvm#110256) Use i32 for offset instead of i16, this way it does not get interpreted as negative 16 bit offset. (cherry picked from commit 83fe851)

…lvm#110256) Use i32 for offset instead of i16, this way it does not get interpreted as negative 16 bit offset.

petar-avramovic mentioned this pull request Sep 27, 2024

AMDGPU: Add test for 16 bit unsigned scratch offsets #110255

Merged

petar-avramovic requested review from dstutt and jayfoad September 27, 2024 12:50

petar-avramovic marked this pull request as ready for review September 27, 2024 12:50

llvmbot added the backend:AMDGPU label Sep 27, 2024

jayfoad reviewed Sep 27, 2024

View reviewed changes

llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp Show resolved Hide resolved

petar-avramovic force-pushed the users/petar-avramovic/scratch-large-offset-test branch from 41189ad to 43076c2 Compare September 27, 2024 16:03

petar-avramovic force-pushed the users/petar-avramovic/scratch-large-offset-fix branch from dcec930 to 2ea25b2 Compare September 27, 2024 16:04

jayfoad approved these changes Sep 27, 2024

View reviewed changes

arsenm approved these changes Sep 30, 2024

View reviewed changes

Base automatically changed from users/petar-avramovic/scratch-large-offset-test to main September 30, 2024 08:39

AMDGPU: Fix inst-selection of large scratch offsets with sgpr base

fd2e866

Use i32 for offset instead of i16, this way it does not get interpreted as negative 16 bit offset.

petar-avramovic force-pushed the users/petar-avramovic/scratch-large-offset-fix branch from 2ea25b2 to fd2e866 Compare September 30, 2024 08:41

petar-avramovic merged commit 83fe851 into main Sep 30, 2024
5 of 7 checks passed

petar-avramovic deleted the users/petar-avramovic/scratch-large-offset-fix branch September 30, 2024 08:45

arsenm added this to the LLVM 19.X Release milestone Sep 30, 2024

xgupta pushed a commit to xgupta/llvm-project that referenced this pull request Oct 4, 2024

AMDGPU: Fix inst-selection of large scratch offsets with sgpr base (l…

7dbfb43

…lvm#110256) Use i32 for offset instead of i16, this way it does not get interpreted as negative 16 bit offset.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AMDGPU: Fix inst-selection of large scratch offsets with sgpr base #110256

AMDGPU: Fix inst-selection of large scratch offsets with sgpr base #110256

petar-avramovic commented Sep 27, 2024

petar-avramovic commented Sep 27, 2024 •

edited

Loading

llvmbot commented Sep 27, 2024

jayfoad left a comment

petar-avramovic commented Sep 30, 2024 •

edited

Loading

petar-avramovic commented Sep 30, 2024

llvmbot commented Sep 30, 2024

arsenm commented Sep 30, 2024

llvmbot commented Sep 30, 2024

AMDGPU: Fix inst-selection of large scratch offsets with sgpr base #110256

AMDGPU: Fix inst-selection of large scratch offsets with sgpr base #110256

Conversation

petar-avramovic commented Sep 27, 2024

petar-avramovic commented Sep 27, 2024 • edited Loading

llvmbot commented Sep 27, 2024

jayfoad left a comment

Choose a reason for hiding this comment

petar-avramovic commented Sep 30, 2024 • edited Loading

Merge activity

petar-avramovic commented Sep 30, 2024

llvmbot commented Sep 30, 2024

arsenm commented Sep 30, 2024

llvmbot commented Sep 30, 2024

petar-avramovic commented Sep 27, 2024 •

edited

Loading

petar-avramovic commented Sep 30, 2024 •

edited

Loading