perf: cache yield_stdev using spec and interpcodes of model #322

lhenkelm · 2022-02-04T17:37:14Z

This implements the changes to hashing discussed in #315.

It was necessary to modify one test that was directly looking up cached results in the yield_stdev cache.

* caching for yield uncertainty calculation now based on model specification and interpolation codes
* caching now works when re-creating pyhf.pdf.Model objects

... instead of the model itself. In effect, this makes the cache treat models like a value-typed hash. In current pyhf (0.6.4 and below) Model inherits __hash__ from object, so caching on model instances treats copies as different.

alexander-held

Thanks a lot for implementing this! This is super useful to prevent accidental re-computation of computationally expensive results.

src/cabinetry/model_utils.py

codecov · 2022-02-04T18:41:13Z

Codecov Report

Merging #322 (766573b) into master (40120c2) will not change coverage.
The diff coverage is 100.00%.

@@            Coverage Diff            @@
##            master      #322   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           23        23           
  Lines         1881      1889    +8     
  Branches       305       306    +1     
=========================================
+ Hits          1881      1889    +8

Impacted Files	Coverage Δ
src/cabinetry/model_utils.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 40120c2...766573b. Read the comment docs.

alexander-held · 2022-02-07T14:08:43Z

Thank you very much for the PR, this all looks good to me. I'd like to see whether we can get a resolution of scikit-hep/pyhf#1762 (comment), but as far as I can tell this has no impact for the equality comparison done here (where we care about equal expected_data output). Maybe worth adding another sentence to the docstring to prevent possible future confusion: the key ignores measurement config information, but as far as I can tell that matters for neither logpdf nor expected_data, and only for configuration information like config.suggested_bounds().

lhenkelm · 2022-02-07T17:12:33Z

Thanks for reviewing! I added the warning to the docstring.
In the specific case of yield_stdev I agree this makes no difference, since no fits will be done and a full fit result is passed in alongside the model. In general the best thing would be to have pyhf objects that are deliberate about how they hash (in so far as they model values), since capturing everything would involve diving deep into the tree of pyhf objects, many of them internal to pyhf.

alexander-held

Looks great, thanks again!

alexander-held reviewed Feb 4, 2022

View reviewed changes

src/cabinetry/model_utils.py Outdated Show resolved Hide resolved

src/cabinetry/model_utils.py Outdated Show resolved Hide resolved

src/cabinetry/model_utils.py Outdated Show resolved Hide resolved

construct hashable key for model from modifier_settings for clarity

6b21cb4

rename model key helper to better indicate usage

20c7627

alexander-held mentioned this pull request Feb 4, 2022

Model hashing scikit-hep/pyhf#1762

Open

1 task

extend docstring to explain args and returned value

5535caf

warn about extra model options in docstring

766573b

alexander-held approved these changes Feb 8, 2022

View reviewed changes

alexander-held merged commit f34c73a into scikit-hep:master Feb 8, 2022

lhenkelm deleted the perf/yield-stdev-cache-model-value-like branch February 8, 2022 11:07

alexander-held mentioned this pull request Feb 11, 2022

test: dedicated test for model keys #327

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: cache yield_stdev using spec and interpcodes of model #322

perf: cache yield_stdev using spec and interpcodes of model #322

lhenkelm commented Feb 4, 2022 •

edited by alexander-held

Loading

alexander-held left a comment

codecov bot commented Feb 4, 2022 •

edited

Loading

alexander-held commented Feb 7, 2022

lhenkelm commented Feb 7, 2022

alexander-held left a comment

perf: cache yield_stdev using spec and interpcodes of model #322

perf: cache yield_stdev using spec and interpcodes of model #322

Conversation

lhenkelm commented Feb 4, 2022 • edited by alexander-held Loading

alexander-held left a comment

Choose a reason for hiding this comment

codecov bot commented Feb 4, 2022 • edited Loading

Codecov Report

alexander-held commented Feb 7, 2022

lhenkelm commented Feb 7, 2022

alexander-held left a comment

Choose a reason for hiding this comment

lhenkelm commented Feb 4, 2022 •

edited by alexander-held

Loading

codecov bot commented Feb 4, 2022 •

edited

Loading