Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add specialisations of make_device_view with a Queue argument #45572

Merged
merged 3 commits into from
Sep 26, 2024

Conversation

fwyzard
Copy link
Contributor

@fwyzard fwyzard commented Jul 29, 2024

PR description:

Add specialisations of make_device_view with a Queue argument instead of a Device.

These functions are only for convenience: their behaviour is identical to those taking a Device.

PR validation:

None.

@cmsbuild
Copy link
Contributor

cmsbuild commented Jul 29, 2024

cms-bot internal usage

@cmsbuild
Copy link
Contributor

@cmsbuild
Copy link
Contributor

A new Pull Request was created by @fwyzard for master.

It involves the following packages:

  • HeterogeneousCore/AlpakaInterface (heterogeneous)

@cmsbuild, @fwyzard, @makortel can you please review it and eventually sign? Thanks.
@makortel, @missirol, @rovere this is something you requested to watch as well.
@antoniovilela, @mandrenguyen, @rappoccio, @sextonkennedy you are the release manager for this.

cms-bot commands are listed here

@cmsbuild
Copy link
Contributor

@cmsbuild
Copy link
Contributor

Pull request #45572 was updated. @cmsbuild, @fwyzard, @jfernan2, @makortel, @mandrenguyen, @srimanob, @subirsarkar can you please check and sign again.

@fwyzard
Copy link
Contributor Author

fwyzard commented Jul 29, 2024

please test

@fwyzard
Copy link
Contributor Author

fwyzard commented Jul 29, 2024

@makortel this seems to be useful to reduce the calls to alpaka::getDev(queue).
However, it has to be clear to the users that

make_device_view(queue, ...);

is just a shorthand for

make_device_view(alpaka::getDev(queue), ...);

and does not introduce any asynchronous behaviour.

Do you think this is OK, or potentially too confusing ?

@cmsbuild
Copy link
Contributor

-1

Failed Tests: Build HeaderConsistency ClangBuild
Size: This PR adds an extra 52KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-75d89d/40655/summary.html
COMMIT: 75cd6ac
CMSSW: CMSSW_14_1_X_2024-07-28-2300/el8_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/45572/40655/install.sh to create a dev area with all the needed externals and cmssw changes.

Build

I found compilation error when building:

Copying tmp/el8_amd64_gcc12/src/HeterogeneousTest/AlpakaWrapper/src/alpaka/HeterogeneousTestAlpakaWrapperROCmAsync/libHeterogeneousTestAlpakaWrapperROCmAsync_rocm.a to productstore area:
Copying tmp/el8_amd64_gcc12/src/RecoParticleFlow/PFRecHitProducer/plugins/RecoParticleFlowPFRecHitProducersPluginsPortableROCmAsync/libRecoParticleFlowPFRecHitProducersPluginsPortableROCmAsync_rocm.a to productstore area:
Copying tmp/el8_amd64_gcc12/src/DataFormats/TrackingRecHitSoA/test/Hits_testROCmAsync/libHits_testROCmAsync_rocm.a to productstore area:
cp: cannot stat 'tmp/el8_amd64_gcc12/src/DataFormats/TrackingRecHitSoA/test/Hits_testROCmAsync/libHits_testROCmAsync_rocm.a': No such file or directory
>> Deleted: tmp/el8_amd64_gcc12/src/DataFormats/TrackingRecHitSoA/test/Hits_testROCmAsync/libHits_testROCmAsync_rocm.a
gmake: *** [config/SCRAM/GMake/Makefile.rules:1864: tmp/el8_amd64_gcc12/src/DataFormats/TrackingRecHitSoA/test/Hits_testROCmAsync/libHits_testROCmAsync_rocm.a] Error 1
Copying tmp/el8_amd64_gcc12/src/DataFormats/SiPixelClusterSoA/test/Clusters_testROCmAsync/libClusters_testROCmAsync_rocm.a to productstore area:
Copying tmp/el8_amd64_gcc12/src/DataFormats/SiPixelDigiSoA/test/DigiErrors_testROCmAsync/libDigiErrors_testROCmAsync_rocm.a to productstore area:
Copying tmp/el8_amd64_gcc12/src/DataFormats/TrackSoA/test/TrackSoAHeterogeneousAlpaka_testROCmAsync/libTrackSoAHeterogeneousAlpaka_testROCmAsync_rocm.a to productstore area:
Copying tmp/el8_amd64_gcc12/src/DataFormats/VertexSoA/test/ZVertexSoA_testROCmAsync/libZVertexSoA_testROCmAsync_rocm.a to productstore area:
Copying tmp/el8_amd64_gcc12/src/RecoLocalCalo/HcalRecProducers/plugins/RecoLocalCaloHcalRecProducersPluginsPortableROCmAsync/libRecoLocalCaloHcalRecProducersPluginsPortableROCmAsync_rocm.a to productstore area:


Clang Build

I found compilation error while trying to compile with clang. Command used:

USER_CUDA_FLAGS='--expt-relaxed-constexpr' USER_CXXFLAGS='-Wno-register -fsyntax-only' scram build -k -j 32 COMPILER='llvm compile'

>> Entering Package RecoLocalCalo/EcalRecProducers
>> Entering Package RecoLocalCalo/HGCalRecProducers
>> Entering Package RecoLocalTracker/SiPixelClusterizer
>> Entering Package RecoTracker/PixelSeeding
>> Compile sequence completed for CMSSW CMSSW_14_1_X_2024-07-28-2300
gmake: *** [There are compilation/build errors. Please see the detail log above.] Error 1
+ eval scram build outputlog '&&' '(python3' /data/cmsbld/jenkins/workspace/ib-run-pr-tests/cms-bot/buildLogAnalyzer.py --logDir /data/cmsbld/jenkins/workspace/ib-run-pr-tests/CMSSW_14_1_X_2024-07-28-2300/tmp/el8_amd64_gcc12/cache/log/src '||' 'true)'
++ scram build outputlog
>> Entering Package DataFormats/TrackingRecHitSoA
Entering library rule at DataFormats/TrackingRecHitSoA
>> Compiling  src/DataFormats/TrackingRecHitSoA/src/classes.cc


@makortel
Copy link
Contributor

Do you think this is OK, or potentially too confusing ?

Hmm. On a quick thought I think the potential for misunderstanding is certainly there. I need to digest a bit more to reach a conclusion whether I'd think it would be too confusing or not. In any case documenting the behavior could help.

One possible direction to think about would be to add e.g. async postfix to all functions that may incur asynchronous behavior on some backend(s).

@makortel makortel mentioned this pull request Aug 2, 2024
3 tasks
@cmsbuild
Copy link
Contributor

Pull request #45572 was updated. @cmsbuild, @fwyzard, @jfernan2, @makortel, @mandrenguyen, @srimanob, @subirsarkar can you please check and sign again.

@fwyzard
Copy link
Contributor Author

fwyzard commented Sep 12, 2024

Updated the documentation and rebased on top of a more recent release.

@fwyzard
Copy link
Contributor Author

fwyzard commented Sep 12, 2024

please test

@fwyzard
Copy link
Contributor Author

fwyzard commented Sep 12, 2024

+heterogeneous

@cmsbuild
Copy link
Contributor

+1

Size: This PR adds an extra 32KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-75d89d/41478/summary.html
COMMIT: 0e26cb2
CMSSW: CMSSW_14_2_X_2024-09-11-2300/el8_amd64_gcc12
Additional Tests: GPU
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/45572/41478/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

GPU Comparison Summary

Summary:

@jfernan2
Copy link
Contributor

+1

@fwyzard
Copy link
Contributor Author

fwyzard commented Sep 26, 2024

@cms-sw/upgrade-l2 could you sign this, please ?

Alternatively, @cms-sw/orp-l2 could you bypass the upgrade signature and merge this ?
It's now been pending for two weeks.

@srimanob
Copy link
Contributor

+Upgrade

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @rappoccio, @sextonkennedy, @antoniovilela, @mandrenguyen (and backports should be raised in the release meeting by the corresponding L2)

@mandrenguyen
Copy link
Contributor

+1

@cmsbuild cmsbuild merged commit eaefa4c into cms-sw:master Sep 26, 2024
14 checks passed
@fwyzard fwyzard deleted the alpaka_make_device_view branch September 26, 2024 22:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants