Event by event random choice of one helicity #570

valassi · 2022-12-14T22:46:22Z

Hi @oliviermattelaer @roiser I have finally ~completed the patch for the random choice of helicity #403

This MR does NOT include the random choice of color, which I have not yet done (and may be a bit more complex).

The MR is essentially complete with code regeneration of all processes. I am now rerunning the full tests to check performance and see functionalities.

…sequence

…EQUENCE function

…omenta' (prepare to add RndNum for color and helicity)

…dNumMomenta

…dNumColor

…mented out for now) There are seven other types that are equivalent to this one and could be removed...

…atrixElements was used instead of PinnedHostBufferGs HOWEVER this was harmless because the two types are the same! This is an example why BufferOneFp might simplify the code...

…ndomNumberKernels)

…and device Also improve the existing copies of gs and mes

…menta are different in F and C)

… (and reorder other arguments)

…lls with/without shared memory...

…ons as done in sigmaKin

./tmad/teeMadX.sh -ggtt +10x -makeclean ./tmad/teeMadX.sh -ggtt +10x -makeclean -fltonly ./tmad/teeMadX.sh -ggtt +10x -makeclean -mixonly Note that the bridge throughputs are a bit lower as there are many more copies of rnd/sel hel/col

…ing calculate_wavefunction in sigmakin

… to simplify helicity choice (madgraph5#403) Disable OMP loops for the moment

./tmad/teeMadX.sh -ggtt +10x -makeclean -mix

… looks good

…n tput, looks good (a bit faster?)

…adgraph5#568) - will revert it and do it separately

… and new code madgraph5#568 - ok but cuda is 30% slower I guess the assert is not helping? Anyway, will revert and do it later, maybe.

…as "new" mulrichannel enabled madgraph5#568 Will revert the last changes anyway and move madgraph5#568 to a separate MR if any...

…st 4 commits Revert "[lhe] rerun tput tee ggtt with multichannel enabled, all ok, as slow as "new" mulrichannel enabled madgraph5#568" This reverts commit 445b321. Revert "[lhe] rerun tput ggtt mad with "#undef MGONGPU_SUPPORTS_MULTICHANNEL" and new code madgraph5#568 - ok but cuda is 30% slower" This reverts commit 3458706. Revert "[lhe] in ggtt.mad prototype a simplification of multichannel ifdefs (madgraph5#568) - will revert it and do it separately" This reverts commit 63e04b9. Revert "[lhe] rebuild ggtt with #undef MGONGPU_SUPPORTS_MULTICHANNEL and rerun tput, looks good (a bit faster?)" This reverts commit b9f534b.

./tmad/teeMadX.sh -ggtt +10x -makeclean -mix

…inalisation

./CODEGEN/generateAndCompare.sh gg_tt --mad --nopatch git diff --no-ext-diff -R gg_tt.mad/Source/dsample.f gg_tt.mad/Source/genps.inc gg_tt.mad/Source/vector.inc gg_tt.mad/SubProcesses/makefile > CODEGEN/MG5aMC_patches/PROD/patch.common git diff --no-ext-diff -R gg_tt.mad/SubProcesses/P1_gg_ttx/auto_dsig1.f gg_tt.mad/SubProcesses/P1_gg_ttx/driver.f gg_tt.mad/SubProcesses/P1_gg_ttx/matrix1.f > CODEGEN/MG5aMC_patches/PROD/patch.P1 git checkout gg_tt.mad

… CODEGEN (CPPProcess.cc fragments)

… CODEGEN (CPPProcess.h fragments)

… gg_tt.mad to CODEGEN The Fortran codegen was using allow_reverse=True and the cpp codegen allow_reverse=False. Now moved to allow_reverse=True also in cudacpp.

…remain 0 (no lhe files! madgraph5#403 madgraph5#402)

…city madgraph5#403

…dgraph5#403

valassi · 2022-12-14T22:58:34Z

A few comments

One of the most tedious parts was to add the new function arguments to all function calls. This is done not only for helicity Randomly pick one helicity for each event written to file #403 but also for color Randomly pick one color for each event written to file (jamp2 handling) #402. There are random numbers in for helicity and color, and seleted helicity and color out.
Correspondingly I also had to add buffers for helicity and color.
As discussed in Move the loop over npagV from calculate_wavefunction to sigmakin? #415, the implementation for C++ required that I invert the order of the loop over event pages (SIMD vectors) and over helicities.
The fact that we have recently added support for 'mixed precision' mode was an additional complication in the C++ implementation. Some simplifications may be appropriate in the future.
I have now reenabled in madX.sh the event by event comparison of selected helicity in the LHE file for fortran and cudacpp. The "dummyHelicity" script is no longer needed.
One additional complication Different helicity numbering in fortran and cudacpp? #569 was that out of the box the order of helicities was different on fortran and cudacpp, so the helicity index was initially wrong, this is now fixed

While writing this I see that some tests failed in the CI, so there is more work to be done...

valassi · 2022-12-14T23:07:33Z

Peculiar, it is some tests that fail, I can reproduce this interactively

eemumu fails
ggtt (where I developed the code) succeeds
ggttg also succeeds
ggttgg fails

The failures are that the events with given momenta give MEs which are not those in the refernce files?....

valassi · 2022-12-14T23:08:39Z

Ah voila, as I thought I had forgotten some hardcoded parameters in the codegen...

epochX/cudacpp/gg_ttgg.mad/SubProcesses/P1_gg_ttxgg/fcheck_sa.f /tmp/avalassi/jgigIx_fcheck_sa.f e43625dcf872b3faa292e8b5df57ecf103fdd8f0 100644 epochX/cudacpp/gg_ttgg.mad/SubProcesses/P1_gg_ttxgg/fcheck_sa.f c0bbf580efa6892ab11c9a340b82f15338a2ce7c 100644
7c7
<       PARAMETER(NEVTMAX=2048*256, NEXTERNAL=6, NP4=4)
---
>       PARAMETER(NEVTMAX=2048*256, NEXTERNAL=4, NP4=4)

…that must instead come from codegen

valassi · 2022-12-14T23:21:34Z

OK I fixed the bug and regenerated all processes. It looks better in ggttgg interactively

…tggg too)

… (slow) Also reorder the arguments to keep makeclean and sa last (easier to copy/paste)

…es short (skip 6 ggttggg to gain time) ./tput/allTees.sh -short This completes the random helicity choice madgraph5#403

valassi · 2022-12-15T17:10:36Z

This MR is now complete. The CI tests are succeeding, I will self merge

valassi added 30 commits December 12, 2022 19:45

[lhe] in gg_tt.mad, invert order of MES and CHANID in FBRIDGESEQUENCE

7c811b7

[lhe] in gg_tt.mad, invert order of MES and CHANID in Bridge cpu/gpu_…

42ffaf1

…sequence

[lhe] in gg_tt.mad add RNDHEL, RNDCOL, SELHEL, SELCOL to the FBRIDGES…

89fc36b

…EQUENCE function

[lhe] in gg_tt.mad rename ALL 'BufferRandomNumbers' as 'BufferRndNumM…

b4999cb

…omenta' (prepare to add RndNum for color and helicity)

[lhe] in gg_tt.mad rename sizePerEventRandomNumbers as sizePerEventRn…

ca721d4

…dNumMomenta

[lhe] in gg_tt.mad add typedefs for BufferRndNumHelicity and BufferRn…

d3a44c8

…dNumColor

[lhe] in gg_tt.mad, add memory buffers with one fptype per event (com…

9ea7278

…mented out for now) There are seven other types that are equivalent to this one and could be removed...

[lhe] in gg_tt.mad, (harmless) BUG FIX in Bridge.h, PinnedHostBufferM…

b0d763b

…atrixElements was used instead of PinnedHostBufferGs HOWEVER this was harmless because the two types are the same! This is an example why BufferOneFp might simplify the code...

[lhe] in gg_tt.mad add rndhel and rndcol to cpu/gpu_sequence

f4f6c7f

[lhe] in gg_tt.mad rename all rnarray as rndmom (except in generic Ra…

9b93421

…ndomNumberKernels)

[lhe] in gg_tt.mad clean up files with clang-format

bfce75d

[lhe] in gg_tt.mad add selhel and selcol to cpu/gpu_sequence

fc2d0ed

[lhe] in gg_tt.mad Bridge.h copy sel/rnd hel/col across fortran, cpp …

4ac97f0

…and device Also improve the existing copies of gs and mes

[lhe] in gg_tt.mad Bridge.h rename GsC as Gs and MEsC as MEs (only mo…

1601b7e

…menta are different in F and C)

[lhe] in gg_tt.mad add rnd/sel/hel/col to sigmaKin function signature…

1135c0b

… (and reorder other arguments)

[lhe] in gg_tt.mad MatrixElementKernels.cc FINALLY simplify kernel ca…

d750108

…lls with/without shared memory...

[lhe] in gg_tt.mad reorder also the arguments of calculate_wavefuncti…

2441af8

…ons as done in sigmaKin

[lhe] in gg_tt.mad fix a segfault in "./gcheck.exe -p 256 32 1 --bridge"

415097b

[lhe] in gg_tt.mad CPPProcess.cc simplify the ifdef CUDA/C++ for call…

41a1be2

…ing calculate_wavefunction in sigmakin

[lhe] invert helicity and SIMD page loop in sigmaKin (madgraph5#415),…

d66ce86

… to simplify helicity choice (madgraph5#403) Disable OMP loops for the moment

[lhe] rerun tmad ggtt dfm and check all looks good

ea0189b

./tmad/teeMadX.sh -ggtt +10x -makeclean -mix

[lhe] rerun ggtt tput (with #define MGONGPU_SUPPORTS_MULTICHANNEL 1),…

fc3d8f7

… looks good

[lhe] rebuild ggtt with #undef MGONGPU_SUPPORTS_MULTICHANNEL and reru…

b9f534b

…n tput, looks good (a bit faster?)

[lhe] in ggtt.mad prototype a simplification of multichannel ifdefs (m…

63e04b9

…adgraph5#568) - will revert it and do it separately

[lhe] rerun tput ggtt mad with "#undef MGONGPU_SUPPORTS_MULTICHANNEL"…

3458706

… and new code madgraph5#568 - ok but cuda is 30% slower I guess the assert is not helping? Anyway, will revert and do it later, maybe.

[lhe] rerun tput tee ggtt with multichannel enabled, all ok, as slow …

445b321

…as "new" mulrichannel enabled madgraph5#568 Will revert the last changes anyway and move madgraph5#568 to a separate MR if any...

[lhe] rerun ggtt tmad again for reference

7537dad

./tmad/teeMadX.sh -ggtt +10x -makeclean -mix

[lhe] in gg_tt.mad CPPProcess.cc, vectorize sigmaKin initialisation/f…

6ed7e59

…inalisation

valassi added 11 commits December 14, 2022 23:49

[lhe] in gg_tt.mad clang-format CPPProcess.cc

a19f195

[lhe] madgraph5#403 backport random helicity choice from gg_tt.mad to…

b937c3f

… CODEGEN (CPPProcess.cc fragments)

[lhe] madgraph5#403 backport random helicity choice from gg_tt.mad to…

638b287

… CODEGEN (CPPProcess.h fragments)

[lhe] madgraph5#569 and madgraph5#403 backport order of helicity from…

c30aeaf

… gg_tt.mad to CODEGEN The Fortran codegen was using allow_reverse=True and the cpp codegen allow_reverse=False. Now moved to allow_reverse=True also in cudacpp.

[lhe] regenerate gg_tt.mad, check that all is stable

10a693d

[lhe] in gg_tt.mad check_sa.cc add a comments that rndhel and rndcol …

373d738

…remain 0 (no lhe files! madgraph5#403 madgraph5#402)

[lhe] backport the last change in check_sa.cc from gg_tt.mad to CODEGEN

51039ab

[lhe] regenerate gg_tt.mad again, all ok

d42edf1

[lhe] regenerate all other 4 processes mad with random choice of heli…

2ee85ce

…city madgraph5#403

[lhe] regenerate all 6 processes sa with random choice of helicity ma…

614bd2d

…dgraph5#403

valassi force-pushed the lhe branch from a600448 to 614bd2d Compare December 14, 2022 22:50

This was linked to issues Dec 14, 2022

Randomly pick one helicity for each event written to file #403

Closed

Different helicity numbering in fortran and cudacpp? #569

Closed

[lhe] remove the dummyHelicity script that is no longer needed

4e13aa5

valassi added 3 commits December 15, 2022 00:14

[lhe] BUG FIX in CODEGEN - I had forgotten some hardcoded parameters …

4fd55d6

…that must instead come from codegen

[lhe] regenerate 5 processes mad after bug fix in codegen

3dc98f6

[lhe] regenerate 6 processes sa after the bug fix in codegen

1af94d9

This was referenced Dec 14, 2022

Randomly pick one helicity for each event written to file #403

Closed

Different helicity numbering in fortran and cudacpp? #569

Closed

valassi added 3 commits December 15, 2022 17:58

[lhe] rerun 15 tmad allTees (initially only 12, later added the 3 ggt…

68bdeb2

…tggg too)

[lhe] fix tput/allTees to ensure that -short does not run any ggttggg…

7cb9c6b

… (slow) Also reorder the arguments to keep makeclean and sa last (easier to copy/paste)

[lhe] ** COMPLETE LHE PART 3 (RANDOM HELICITY) ** rerun tput 54 allTe…

ee6cc16

…es short (skip 6 ggttggg to gain time) ./tput/allTees.sh -short This completes the random helicity choice madgraph5#403

valassi merged commit 3780502 into madgraph5:master Dec 15, 2022

valassi mentioned this pull request Dec 16, 2022

Retry OMP multithreading in cudacpp (and prototype custom multithreading, and compare to MP) - suboptimal results in ggttgg (Dec 2022) #575

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Event by event random choice of one helicity #570

Event by event random choice of one helicity #570

valassi commented Dec 14, 2022

valassi commented Dec 14, 2022

valassi commented Dec 14, 2022

valassi commented Dec 14, 2022

valassi commented Dec 14, 2022

valassi commented Dec 15, 2022

Event by event random choice of one helicity #570

Event by event random choice of one helicity #570

Conversation

valassi commented Dec 14, 2022

valassi commented Dec 14, 2022

valassi commented Dec 14, 2022

valassi commented Dec 14, 2022

valassi commented Dec 14, 2022

valassi commented Dec 15, 2022