[SYCL][HIP] Add AMDGPU reflect pass to choose between safe and unsafe AMDGPU atomics #11467

hdelan · 2023-10-09T09:26:48Z

AMDGPU reflect pass is needed to choose between safe and unsafe atomics
at the libclc level. In the long run we will delete this patch as work
is being done to ensure correct lowering of atomic instructions. See
patches:

llvm/llvm-project#85052
llvm/llvm-project#69229

This work is necessary as malloc shared atomics rely on PCIe atomics
which can have patchy and unreliable support. Therefore, we want to be
able to choose at compile time whether we should use safe atomics using
CAS (which PCIe should support), or if we want to rely of the
availability of the newest PCIe atomics, if malloc shared atomics are
desired.

Also changes the implementation of atomic_or, atomic_and so that they
can choose between the safe or unsafe version based on the AMDGPU
reflect value.

libclc/amdgcn-amdhsa/libspirv/atomic/atomic_helpers.h

llvm/lib/Target/AMDGPU/AMDGPUReflect.cpp

ldrumm

I'm not entirely happy about introducing another reflect pass, but I appreciate that we'll likely get better perf than unconditionally prefetching.

What happens to __oclc_amdgpu_reflect if this pass isn't run?

llvm/lib/Target/AMDGPU/AMDGPUReflect.cpp

hdelan · 2023-10-11T10:06:16Z

What happens to __oclc_amdgpu_reflect if this pass isn't run?

The func will remain in the module and a linking error will result at some stage. Which is the same behaviour as the LLVM reflect pass

- Change getNumOperands to arg size - Move size check above the for loop

hdelan · 2024-04-22T16:39:50Z

@frasercrmck all your comments have been addressed. Thanks for review!

In terms of function vs module pass - I understand that it might be slightly more optimal to make this a module pass, however I also think for comprehensibility this pass should not diverge too much from NVVMReflect, which is a func pass. Let me know if you think that this is OK

frasercrmck

LGTM

hdelan · 2024-04-23T14:12:06Z

Friendly ping @intel/dpcpp-tools-reviewers

ldrumm

Generally looks good modulo some minor code nits

llvm/lib/Target/AMDGPU/AMDGPUOclcReflect.cpp

- Use a vector of CallInsts instead of Instructions. - Change assert(fasle) to report_fatal_error.

hdelan · 2024-04-23T14:55:38Z

Thanks @ldrumm changes made

llvm/lib/Target/AMDGPU/AMDGPUOclcReflect.cpp

sycl/test/check_device_code/hip/atomic/amdgpu_unsafe_atomics.cpp

llvm/test/CodeGen/AMDGPU/amdgpu-oclc-reflect.ll

- Use auto - Use drop_back to remove null byte - Replace hip_be with hip

Change opt test to use update_test_checks.py

hdelan requested review from a team as code owners October 9, 2023 09:26

hdelan requested a review from jchlanda October 9, 2023 09:26

hdelan temporarily deployed to WindowsCILock October 9, 2023 09:30 — with GitHub Actions Inactive

hdelan commented Oct 9, 2023

View reviewed changes

libclc/amdgcn-amdhsa/libspirv/atomic/atomic_helpers.h Outdated Show resolved Hide resolved

hdelan changed the title ~~[SYCL][HIP] Hip use unsafe atomics flag~~ [SYCL][HIP] Add reflect pass to choose between safe and unsafe atomic xor Oct 9, 2023

hdelan mentioned this pull request Oct 9, 2023

[HIP] Revert add prefetch for USM hip allocations a6b8fa66b537753415d24076f… oneapi-src/unified-runtime#936

Merged

hdelan temporarily deployed to WindowsCILock October 9, 2023 10:16 — with GitHub Actions Inactive

ldrumm reviewed Oct 9, 2023

View reviewed changes

llvm/lib/Target/AMDGPU/AMDGPUReflect.cpp Outdated Show resolved Hide resolved

ldrumm reviewed Oct 9, 2023

View reviewed changes

llvm/lib/Target/AMDGPU/AMDGPUReflect.cpp Outdated Show resolved Hide resolved

ldrumm requested changes Oct 9, 2023

View reviewed changes

llvm/lib/Target/AMDGPU/AMDGPUReflect.cpp Outdated Show resolved Hide resolved

llvm/lib/Target/AMDGPU/AMDGPUReflect.cpp Outdated Show resolved Hide resolved

llvm/lib/Target/AMDGPU/AMDGPUReflect.cpp Outdated Show resolved Hide resolved

hdelan requested a review from ldrumm October 11, 2023 10:06

hdelan temporarily deployed to WindowsCILock October 12, 2023 10:48 — with GitHub Actions Inactive

hdelan temporarily deployed to WindowsCILock October 12, 2023 11:27 — with GitHub Actions Inactive

hdelan closed this Oct 16, 2023

hdelan reopened this Mar 27, 2024

hdelan force-pushed the hip-use-unsafe-atomics-flag branch from 379d3a6 to e5657aa Compare March 27, 2024 16:44

hdelan had a problem deploying to WindowsCILock March 27, 2024 16:46 — with GitHub Actions Error

hdelan force-pushed the hip-use-unsafe-atomics-flag branch from e5657aa to b4617ef Compare March 27, 2024 16:49

hdelan had a problem deploying to WindowsCILock March 27, 2024 16:51 — with GitHub Actions Failure

hdelan force-pushed the hip-use-unsafe-atomics-flag branch from b4617ef to 57ae613 Compare March 28, 2024 16:21

hdelan had a problem deploying to WindowsCILock March 28, 2024 16:26 — with GitHub Actions Error

hdelan force-pushed the hip-use-unsafe-atomics-flag branch 3 times, most recently from 5a8c9ac to 9051df1 Compare March 28, 2024 16:41

hdelan had a problem deploying to WindowsCILock March 28, 2024 16:49 — with GitHub Actions Error

hdelan force-pushed the hip-use-unsafe-atomics-flag branch 2 times, most recently from 1872497 to bad104b Compare March 28, 2024 17:06

hdelan had a problem deploying to WindowsCILock March 28, 2024 17:16 — with GitHub Actions Error

hdelan force-pushed the hip-use-unsafe-atomics-flag branch from 0eda761 to 982ea7a Compare April 22, 2024 15:27

Typo

4d17f4f

hdelan force-pushed the hip-use-unsafe-atomics-flag branch from 982ea7a to 4d17f4f Compare April 22, 2024 15:27

hdelan had a problem deploying to WindowsCILock April 22, 2024 15:48 — with GitHub Actions Error

Require the use of AMDGPU reflect

7f2771b

hdelan force-pushed the hip-use-unsafe-atomics-flag branch from df15e57 to 7f2771b Compare April 22, 2024 15:50

hdelan added 2 commits April 22, 2024 16:52

Restructure comment

695fcfa

Respond to comments

6d24772

- Change getNumOperands to arg size - Move size check above the for loop

hdelan had a problem deploying to WindowsCILock April 22, 2024 16:35 — with GitHub Actions Failure

hdelan had a problem deploying to WindowsCILock April 22, 2024 17:16 — with GitHub Actions Failure

frasercrmck approved these changes Apr 23, 2024

View reviewed changes

hdelan temporarily deployed to WindowsCILock April 23, 2024 14:10 — with GitHub Actions Inactive

ldrumm approved these changes Apr 23, 2024

View reviewed changes

llvm/lib/Target/AMDGPU/AMDGPUOclcReflect.cpp Outdated Show resolved Hide resolved

llvm/lib/Target/AMDGPU/AMDGPUOclcReflect.cpp Outdated Show resolved Hide resolved

hdelan had a problem deploying to WindowsCILock April 23, 2024 14:51 — with GitHub Actions Error

Respond to comments

8eedac8

- Use a vector of CallInsts instead of Instructions. - Change assert(fasle) to report_fatal_error.

Typo

c4ab1ed

hdelan force-pushed the hip-use-unsafe-atomics-flag branch from 6c5ef4e to c4ab1ed Compare April 23, 2024 14:57

hdelan temporarily deployed to WindowsCILock April 23, 2024 15:11 — with GitHub Actions Inactive

hdelan temporarily deployed to WindowsCILock April 23, 2024 15:53 — with GitHub Actions Inactive

AlexeySachkov approved these changes Apr 24, 2024

View reviewed changes

frasercrmck reviewed Apr 24, 2024

View reviewed changes

llvm/test/CodeGen/AMDGPU/amdgpu-oclc-reflect.ll Show resolved Hide resolved

hdelan added 3 commits April 24, 2024 10:20

Respond to comments

bf5d2d0

- Use auto - Use drop_back to remove null byte - Replace hip_be with hip

Use update_test_checks.py

bbce2f9

Change opt test to use update_test_checks.py

Merge branch 'sycl' into hip-use-unsafe-atomics-flag

1a8e706

hdelan temporarily deployed to WindowsCILock April 24, 2024 09:35 — with GitHub Actions Inactive

hdelan temporarily deployed to WindowsCILock April 24, 2024 10:15 — with GitHub Actions Inactive

ldrumm merged commit 34135a3 into intel:sycl Apr 24, 2024
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL][HIP] Add AMDGPU reflect pass to choose between safe and unsafe AMDGPU atomics #11467

[SYCL][HIP] Add AMDGPU reflect pass to choose between safe and unsafe AMDGPU atomics #11467

hdelan commented Oct 9, 2023 •

edited

Loading

ldrumm left a comment

hdelan commented Oct 11, 2023 •

edited

Loading

hdelan commented Apr 22, 2024

frasercrmck left a comment

hdelan commented Apr 23, 2024

ldrumm left a comment

hdelan commented Apr 23, 2024

[SYCL][HIP] Add AMDGPU reflect pass to choose between safe and unsafe AMDGPU atomics #11467

[SYCL][HIP] Add AMDGPU reflect pass to choose between safe and unsafe AMDGPU atomics #11467

Conversation

hdelan commented Oct 9, 2023 • edited Loading

ldrumm left a comment

Choose a reason for hiding this comment

hdelan commented Oct 11, 2023 • edited Loading

hdelan commented Apr 22, 2024

frasercrmck left a comment

Choose a reason for hiding this comment

hdelan commented Apr 23, 2024

ldrumm left a comment

Choose a reason for hiding this comment

hdelan commented Apr 23, 2024

hdelan commented Oct 9, 2023 •

edited

Loading

hdelan commented Oct 11, 2023 •

edited

Loading