-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Don't recluster jets in HI miniAOD for Run 3 #38711
Conversation
+code-checks Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-38711/31009
|
A new Pull Request was created by @mandrenguyen (Matthew Nguyen) for master. It involves the following packages:
@jpata, @cmsbuild, @clacaputo can you please review it and eventually sign? Thanks. cms-bot commands are listed here |
Hi @mandrenguyen did you do any performance comparisons in |
@cmsbuild please test |
Hi @clacaputo We should see a large speedup of PAT, but that will not be super evident when it's run concurrently with reco (as in 159). The best place to see this would be in a stand-alone re-miniAOD workflow like 158.01. That wf is for 2018 though, where the reclustering is necessary, unless the 2018 PbPb data is re-reco'd. For Run 3, we don't yet have a re-miniAOD MC RelVal wf. How about I add a 159.01 to this PR for that purpose? For reasons that are a bit complicated to explain here, we plan to produce miniAOD at the HI T2 (not together with prompt reco), such that the PAT timing is an important consideration. Note also that without this PR, Run 3 HI re-miniAOD (i.e., not concurrently with reco) doesn't even run due to a collection label issue. I was actually trying to solve that issue when I realized that we should really remove the reclustering. |
+1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-0edc59/26242/summary.html Comparison SummarySummary:
|
I think it's a good idea |
+code-checks Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-38711/31059
|
Hi @mandrenguyen |
Hi @srimanob 159 already has miniAOD as part of the reconstruction. The issue is that it miniAOD was crashing when not run as part of the reco (i.e., "re-miniAOD"). While fixing that I noticed that we no longer need to recluster the jets, which was the most time consuming part of the 2018 re-miniAOD (wf 158.01). We do plan to run the miniAOD separately from reco in 2022, and the speed up from not reclustering the jets will be important. Does that clear it up? |
OK, this is more clear now. Thanks @mandrenguyen Just my 2 cents: (2) One thing that HI workflow does not reflect is the real production. I assume in the real situation, you will start from AOD, not GEN-SIM-RECO. I don't expect any issues, i.e. broken of MiniAOD when start from AOD. But it is quite simple to make it happen also in relvals. I let's you and PdmV decide on this. I am happy to sign this PR anyways (just after the trigger of 159.01 if needed). |
Yes, someone please trigger 159.1 :-) Regarding AOD as input, there is currently not a suitable dataset produced in relval. When the real 2022 PbPb data arrives, we will add a workflow like 140.5611, which tests AOD --> mAOD for 2018 data. In the meantime, I'm fairly confident that if it works from RECO, it will work from AOD, but we will test this manually prior to the data taking. |
test parameters:
|
@cmsbuild please test |
+1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-0edc59/26322/summary.html Comparison Summary@slava77 comparisons for the following workflows were not done due to missing matrix map:
Summary:
|
+Upgrade |
+reconstruction
|
+pdmv |
This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @perrotta, @dpiparo, @qliphy, @rappoccio (and backports should be raised in the release meeting by the corresponding L2) |
+1
|
PR description:
Jets were being reclustered in the PAT step for heavy ions, as the jet algorithm had been updated between the time the last data were reconstructed (2018), and the first miniAOD campaign (2020).
For the upcoming data (era=pp_on_PbPb_run3), the miniAOD will be done concurrently with the reconstruction such that this is no longer necessary.
This PR removes the reclustering for that era, as well as legacy b-tagging, which is no longer needed. DeepCSV is activated in its place. Finally, an electron fix for 2018 reMiniAOD is also removed that is no longer necessary.
PR validation:
Tested 159, 140.5611 and 158.01