Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix HGCAL Layer Cluster times in heterogeneous workflows at HLT #45838

Merged
merged 1 commit into from
Nov 4, 2024

Conversation

AuroraPerego
Copy link
Contributor

PR description:

The times and timeErrors vectors used to store the rechits time were initialized with a size of 16, but then push_back was used to fill them leaving the first 16 entries as 0.
This PR removes the allocation of 16 elements in the constructor, using reserve() instead.
The change affects only heterogeneous workflows at the HLT.

PR validation:

Tested on wf 31834.492, the HGCAL Layer Clusters times are expected to change:
image
old -> pre fix
new -> post fix

FYI @rovere

@cmsbuild
Copy link
Contributor

cmsbuild commented Aug 30, 2024

cms-bot internal usage

@cmsbuild
Copy link
Contributor

@cmsbuild
Copy link
Contributor

A new Pull Request was created by @AuroraPerego for master.

It involves the following packages:

  • RecoLocalCalo/HGCalRecProducers (upgrade, reconstruction)

@cmsbuild, @jfernan2, @mandrenguyen, @srimanob, @subirsarkar can you please review it and eventually sign? Thanks.
@apsallid, @bsunanda, @cseez, @edjtscott, @felicepantaleo, @hatakeyamak, @lecriste, @lgray, @missirol, @pfs, @rovere, @sameasy, @sethzenz, @vandreev11, @youyingli this is something you requested to watch as well.
@antoniovilela, @mandrenguyen, @rappoccio, @sextonkennedy you are the release manager for this.

cms-bot commands are listed here

@rovere
Copy link
Contributor

rovere commented Aug 30, 2024

thanks @AuroraPerego for the investigation and the fix.

@rovere
Copy link
Contributor

rovere commented Aug 30, 2024

@cmsbuild please test

@cmsbuild
Copy link
Contributor

+1

Size: This PR adds an extra 20KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-4551d6/41199/summary.html
COMMIT: 95e7d74
CMSSW: CMSSW_14_2_X_2024-08-29-2300/el8_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/45838/41199/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • You potentially added 2 lines to the logs
  • Reco comparison results: 7 differences found in the comparisons
  • DQMHistoTests: Total files compared: 44
  • DQMHistoTests: Total histograms compared: 3328315
  • DQMHistoTests: Total failures: 6
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3328289
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 43 files compared)
  • Checked 193 log files, 163 edm output root files, 44 DQM output files
  • TriggerResults: no differences found

for (unsigned int i = 0; i < clusters->size(); ++i) {
times[i].reserve(16);
timeErrors[i].reserve(16);
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would it make sense to embed this loop into the previous one at lines 69 to 77? In this way, the heuristic is not even necessary and the size can be reserved correctly (i.e. the maximum possible, eventually some rechits will be discarded from the timing computation).

@rovere
Copy link
Contributor

rovere commented Sep 2, 2024

@cmsbuild please test

@rovere
Copy link
Contributor

rovere commented Sep 2, 2024

The bot seems to be a little stuck...

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 2, 2024

@felicepantaleo
Copy link
Contributor

test parameters:

  • enable = gpu
  • workflows_gpu = 31834.492
  • workflow_opts = -w upgrade
  • workflow_opts_gpu = -w upgrade

@felicepantaleo
Copy link
Contributor

@cmsbuild please test

@cmsbuild
Copy link
Contributor

-1

Failed Tests: RelVals-GPU RelVals-INPUT
Size: This PR adds an extra 12KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-4551d6/42479/summary.html
COMMIT: c733b85
CMSSW: CMSSW_14_2_X_2024-10-30-1100/el8_amd64_gcc12
Additional Tests: GPU
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/45838/42479/install.sh to create a dev area with all the needed externals and cmssw changes.

  • DAS Queries: The DAS query tests failed, see the summary page for details.

RelVals-GPU

ValueError: Undefined workflows: 31834.492

RelVals-INPUT

  • 2024.0000012024.000001_RunJetMET02024D_10k/step1_dasquery.log
  • 2024.0010012024.001001_RunZeroBias2024D_10k/step1_dasquery.log
  • 2024.1000012024.100001_RunJetMET02024C_10k/step1_dasquery.log
Expand to see more relval errors ...
  • 2024.101001
  • 2024.000001
  • 2024.001001
  • 2024.100001
  • 2024.101001

Comparison Summary

Summary:

  • You potentially added 4 lines to the logs
  • Reco comparison results: 6 differences found in the comparisons
  • DQMHistoTests: Total files compared: 46
  • DQMHistoTests: Total histograms compared: 3569372
  • DQMHistoTests: Total failures: 425
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3568927
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 45 files compared)
  • Checked 201 log files, 171 edm output root files, 46 DQM output files
  • TriggerResults: no differences found

@felicepantaleo
Copy link
Contributor

test parameters:

  • enable = gpu
  • workflows_gpu = 31834.496
  • workflow_opts = -w upgrade
  • workflow_opts_gpu = -w upgrade

@felicepantaleo
Copy link
Contributor

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Nov 1, 2024

+1

Size: This PR adds an extra 12KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-4551d6/42527/summary.html
COMMIT: c733b85
CMSSW: CMSSW_14_2_X_2024-10-31-2300/el8_amd64_gcc12
Additional Tests: GPU
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/45838/42527/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

GPU Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 25 differences found in the comparisons
  • DQMHistoTests: Total files compared: 8
  • DQMHistoTests: Total histograms compared: 154009
  • DQMHistoTests: Total failures: 710
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 153299
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 7 files compared)
  • Checked 29 log files, 35 edm output root files, 8 DQM output files
  • TriggerResults: no differences found

@jfernan2
Copy link
Contributor

jfernan2 commented Nov 3, 2024

+1

@Moanwar
Copy link
Contributor

Moanwar commented Nov 4, 2024

+Upgrade

@cmsbuild
Copy link
Contributor

cmsbuild commented Nov 4, 2024

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @antoniovilela, @mandrenguyen, @rappoccio, @sextonkennedy (and backports should be raised in the release meeting by the corresponding L2)

@mandrenguyen
Copy link
Contributor

+1

@cmsbuild cmsbuild merged commit 669f2de into cms-sw:master Nov 4, 2024
15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants