[OpenCL][RISCV] Support SPIR_KERNEL calling convention #69282

wangpc-pp · 2023-10-17T03:39:06Z

X86 supports this calling convention but I don't find any special
handling, so I think we can just handle it via CC_RISCV.

This should fix #69197.

llvmbot · 2023-10-17T03:40:10Z

@llvm/pr-subscribers-backend-risc-v

Author: Wang Pengcheng (wangpc-pp)

Changes

X86 supports this calling convention but I don't find any special
handling, so I think we can just handle it via CC_RISCV.

This should fix #69197.

Full diff: https://github.com/llvm/llvm-project/pull/69282.diff

2 Files Affected:

(modified) llvm/lib/Target/RISCV/RISCVISelLowering.cpp (+2)
(added) llvm/test/CodeGen/RISCV/spir-kernel-cc.ll (+86)

diff --git a/llvm/lib/Target/RISCV/RISCVISelLowering.cpp b/llvm/lib/Target/RISCV/RISCVISelLowering.cpp
index ed1f7b6c50a4d12..16bd2564867ba0e 100644
--- a/llvm/lib/Target/RISCV/RISCVISelLowering.cpp
+++ b/llvm/lib/Target/RISCV/RISCVISelLowering.cpp
@@ -29,6 +29,7 @@
 #include "llvm/CodeGen/MachineRegisterInfo.h"
 #include "llvm/CodeGen/TargetLoweringObjectFileImpl.h"
 #include "llvm/CodeGen/ValueTypes.h"
+#include "llvm/IR/CallingConv.h"
 #include "llvm/IR/DiagnosticInfo.h"
 #include "llvm/IR/DiagnosticPrinter.h"
 #include "llvm/IR/IRBuilder.h"
@@ -16997,6 +16998,7 @@ SDValue RISCVTargetLowering::LowerFormalArguments(
     report_fatal_error("Unsupported calling convention");
   case CallingConv::C:
   case CallingConv::Fast:
+  case CallingConv::SPIR_KERNEL:
     break;
   case CallingConv::GHC:
     if (!Subtarget.hasStdExtFOrZfinx() || !Subtarget.hasStdExtDOrZdinx())
diff --git a/llvm/test/CodeGen/RISCV/spir-kernel-cc.ll b/llvm/test/CodeGen/RISCV/spir-kernel-cc.ll
new file mode 100644
index 000000000000000..24f5c54021e3ae0
--- /dev/null
+++ b/llvm/test/CodeGen/RISCV/spir-kernel-cc.ll
@@ -0,0 +1,86 @@
+; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
+; RUN: llc -mtriple=riscv32 -mattr=+f,+d < %s | FileCheck %s -check-prefix=RV32
+; RUN: llc -mtriple=riscv64 -mattr=+f,+d < %s | FileCheck %s -check-prefix=RV64
+
+; Check the SPIR_KERNEL call convention work
+
+declare dso_local i64 @_Z13get_global_idj(i32 noundef signext)
+
+define dso_local spir_kernel void @foo(ptr nocapture noundef readonly align 4 %a, ptr nocapture noundef readonly align 4 %b, ptr nocapture noundef writeonly align 4 %c) {
+; RV32-LABEL: foo:
+; RV32:       # %bb.0: # %entry
+; RV32-NEXT:    addi sp, sp, -16
+; RV32-NEXT:    .cfi_def_cfa_offset 16
+; RV32-NEXT:    sw ra, 12(sp) # 4-byte Folded Spill
+; RV32-NEXT:    sw s0, 8(sp) # 4-byte Folded Spill
+; RV32-NEXT:    sw s1, 4(sp) # 4-byte Folded Spill
+; RV32-NEXT:    sw s2, 0(sp) # 4-byte Folded Spill
+; RV32-NEXT:    .cfi_offset ra, -4
+; RV32-NEXT:    .cfi_offset s0, -8
+; RV32-NEXT:    .cfi_offset s1, -12
+; RV32-NEXT:    .cfi_offset s2, -16
+; RV32-NEXT:    mv s0, a2
+; RV32-NEXT:    mv s1, a1
+; RV32-NEXT:    mv s2, a0
+; RV32-NEXT:    li a0, 0
+; RV32-NEXT:    call _Z13get_global_idj
+; RV32-NEXT:    slli a0, a0, 2
+; RV32-NEXT:    add s2, s2, a0
+; RV32-NEXT:    flw fa5, 0(s2)
+; RV32-NEXT:    add s1, s1, a0
+; RV32-NEXT:    flw fa4, 0(s1)
+; RV32-NEXT:    fadd.s fa5, fa5, fa4
+; RV32-NEXT:    add a0, s0, a0
+; RV32-NEXT:    fsw fa5, 0(a0)
+; RV32-NEXT:    lw ra, 12(sp) # 4-byte Folded Reload
+; RV32-NEXT:    lw s0, 8(sp) # 4-byte Folded Reload
+; RV32-NEXT:    lw s1, 4(sp) # 4-byte Folded Reload
+; RV32-NEXT:    lw s2, 0(sp) # 4-byte Folded Reload
+; RV32-NEXT:    addi sp, sp, 16
+; RV32-NEXT:    ret
+;
+; RV64-LABEL: foo:
+; RV64:       # %bb.0: # %entry
+; RV64-NEXT:    addi sp, sp, -32
+; RV64-NEXT:    .cfi_def_cfa_offset 32
+; RV64-NEXT:    sd ra, 24(sp) # 8-byte Folded Spill
+; RV64-NEXT:    sd s0, 16(sp) # 8-byte Folded Spill
+; RV64-NEXT:    sd s1, 8(sp) # 8-byte Folded Spill
+; RV64-NEXT:    sd s2, 0(sp) # 8-byte Folded Spill
+; RV64-NEXT:    .cfi_offset ra, -8
+; RV64-NEXT:    .cfi_offset s0, -16
+; RV64-NEXT:    .cfi_offset s1, -24
+; RV64-NEXT:    .cfi_offset s2, -32
+; RV64-NEXT:    mv s0, a2
+; RV64-NEXT:    mv s1, a1
+; RV64-NEXT:    mv s2, a0
+; RV64-NEXT:    li a0, 0
+; RV64-NEXT:    call _Z13get_global_idj
+; RV64-NEXT:    sext.w a0, a0
+; RV64-NEXT:    slli a0, a0, 2
+; RV64-NEXT:    add s2, s2, a0
+; RV64-NEXT:    flw fa5, 0(s2)
+; RV64-NEXT:    add s1, s1, a0
+; RV64-NEXT:    flw fa4, 0(s1)
+; RV64-NEXT:    fadd.s fa5, fa5, fa4
+; RV64-NEXT:    add a0, s0, a0
+; RV64-NEXT:    fsw fa5, 0(a0)
+; RV64-NEXT:    ld ra, 24(sp) # 8-byte Folded Reload
+; RV64-NEXT:    ld s0, 16(sp) # 8-byte Folded Reload
+; RV64-NEXT:    ld s1, 8(sp) # 8-byte Folded Reload
+; RV64-NEXT:    ld s2, 0(sp) # 8-byte Folded Reload
+; RV64-NEXT:    addi sp, sp, 32
+; RV64-NEXT:    ret
+entry:
+  %call = tail call i64 @_Z13get_global_idj(i32 noundef signext 0)
+  %sext = shl i64 %call, 32
+  %idxprom = ashr exact i64 %sext, 32
+  %arrayidx = getelementptr inbounds float, ptr %a, i64 %idxprom
+  %0 = load float, ptr %arrayidx, align 4
+  %arrayidx2 = getelementptr inbounds float, ptr %b, i64 %idxprom
+  %1 = load float, ptr %arrayidx2, align 4
+  %add = fadd float %0, %1
+  %arrayidx4 = getelementptr inbounds float, ptr %c, i64 %idxprom
+  store float %add, ptr %arrayidx4, align 4
+  ret void
+}
\ No newline at end of file

llvm/test/CodeGen/RISCV/spir-kernel-cc.ll

github-actions · 2023-10-17T03:50:04Z

✅ With the latest revision this PR passed the C/C++ code formatter.

dtcxzyw

LGTM. Could you simplify the test case?

X86 supports this calling convention but I don't find any special handling, so I think we can just handle it via CC_RISCV. This should fix llvm#69197.

Remove unnecessary include

Add new line

Simplify test

Remove -mattr

llvmbot added the backend:RISC-V label Oct 17, 2023

wangpc-pp requested review from asb, dtcxzyw and topperc October 17, 2023 03:40

sunshaoce reviewed Oct 17, 2023

View reviewed changes

llvm/test/CodeGen/RISCV/spir-kernel-cc.ll Outdated Show resolved Hide resolved

dtcxzyw mentioned this pull request Oct 18, 2023

OpenCL kernel (.cl) to RiscV assembly : "LLVM ERROR: Unsupported calling convention" #69197

Closed

dtcxzyw approved these changes Oct 18, 2023

View reviewed changes

wangpc-pp added 4 commits October 18, 2023 17:51

[OpenCL][RISCV] Support SPIR_KERNEL calling convention

389fbe7

X86 supports this calling convention but I don't find any special handling, so I think we can just handle it via CC_RISCV. This should fix llvm#69197.

fixup! [OpenCL][RISCV] Support SPIR_KERNEL calling convention

9655564

Remove unnecessary include

fixup! [OpenCL][RISCV] Support SPIR_KERNEL calling convention

280e142

Add new line

fixup! [OpenCL][RISCV] Support SPIR_KERNEL calling convention

ce143a0

Simplify test

wangpc-pp force-pushed the main-riscv-opencl-cc branch from e1178cc to ce143a0 Compare October 18, 2023 10:02

fixup! [OpenCL][RISCV] Support SPIR_KERNEL calling convention

3b209ac

Remove -mattr

wangpc-pp merged commit 654a3a3 into llvm:main Oct 19, 2023
2 checks passed

wangpc-pp deleted the main-riscv-opencl-cc branch October 19, 2023 03:00

madhur13490 mentioned this pull request Oct 20, 2023

Revert commit ba8565fbcb975e2d067ce3ae5a7dbaae4953edd3 madhur13490/llvm-project#3

Closed

banach-space mentioned this pull request Oct 24, 2023

[mlir][vector] Add scalable vectors to tests for vector.contract #70039

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OpenCL][RISCV] Support SPIR_KERNEL calling convention #69282

[OpenCL][RISCV] Support SPIR_KERNEL calling convention #69282

wangpc-pp commented Oct 17, 2023

llvmbot commented Oct 17, 2023

github-actions bot commented Oct 17, 2023 •

edited

Loading

dtcxzyw left a comment

[OpenCL][RISCV] Support SPIR_KERNEL calling convention #69282

[OpenCL][RISCV] Support SPIR_KERNEL calling convention #69282

Conversation

wangpc-pp commented Oct 17, 2023

llvmbot commented Oct 17, 2023

github-actions bot commented Oct 17, 2023 • edited Loading

dtcxzyw left a comment

Choose a reason for hiding this comment

github-actions bot commented Oct 17, 2023 •

edited

Loading