Restructure workflow to facilitate workflow partitioning for efficient streaming execution #163

SimonHeybrock · 2024-08-29T07:36:07Z

There are two structure changes here:

Split the wavelength-reduction (giving the new ReducedQ) from the normalization. This is essential for streamed processing since we need to accumulate data (numerator and denominator) before normalization, but as late as possible in the workflow. The wavelength-reduction is relatively expensive since it performs binned data operations, e.g., to handle wavelength bands. Further, accumulating event data must be avoided since it will quickly run out of memory.
Split the detector term from the monitor term in the computation of the denominator. As the detector-term (solid angle and direct beam function) is static, it can be pre-computed. This is important since it involves a relatively costly reduction over many pixels in Q-bins.

Will move direct beam into geometry factor

src/ess/sans/conversions.py

src/ess/sans/normalization.py

nvaytet · 2024-09-17T11:07:13Z

src/ess/sans/normalization.py

-    :py:func:`iofq_norm_wavelength_term_sample` or
-    :py:func:`iofq_norm_wavelength_term_background`.
+    Keeping the monitor term separate from the detector term allows us to compute the
+    the latter only once when repeatedly processing chunks of events in streamed data


I guess this is assuming that the pixel positions, and thus the solid angle, remain the same throughout the run.
This is fine I think, as we are no longer computing the beam center as part of the workflow.

Thats said, could the beam center be refined as more and more signal is collected?

My current idea is to use the stream-processing feature for component position updates, i.e., one lever higher than the Sciline workflow.

nvaytet · 2024-09-17T11:09:09Z

src/ess/sans/normalization.py

+
+    The denominator is then simply:
+    :math:`M_{\\lambda} T_{\\lambda} D_{\\lambda} \\Omega_{R}`,
+    which is equivalent to ``wavelength_term * solid_angle``.


I think wavelength_term here was refering to variable names in the code. Should we update or rewrite so that we don't have to change docstring if we change variable names?

How about https://github.com/scipp/esssans/pull/163/files#r1774386408?

nvaytet · 2024-09-17T11:14:42Z

tests/loki/iofq_test.py

    graph_no_grav = pipeline.compute(ElasticCoordTransformGraph)
    pipeline[CorrectForGravity] = True
    data_with_grav = (
-        pipeline.compute(CleanWavelengthMasked[SampleRun, Numerator])
+        pipeline.compute(CleanWavelength[SampleRun, Numerator])
        .flatten(to='pixel')
        .hist(wavelength=sc.linspace('wavelength', 1.0, 12.0, 101, unit='angstrom'))
    )


Is it worth adding a test which sort of simulates the streaming? Basically make two chunks of data that are accumulated? I'm not sure what you would test though? Just that it doesn't fail?

Or does this belong more in the data streaming (i.e. in a different repository)?

In any case, it does not belong into this PR. And since even before restructure streaming was possible (it just recomputed more than strictly necessary) it would require mocking some workflow components and adding call counters for the tests. Such tests exist in the ess.reduce.streaming module.

src/ess/sans/normalization.py

Co-authored-by: Neil Vaytet <[email protected]>

Co-authored-by: Simon Heybrock <[email protected]>

Base automatically changed from ess-reduce-nexus-workflow to main August 29, 2024 08:26

SimonHeybrock requested a review from nvaytet September 2, 2024 13:14

SimonHeybrock added 8 commits September 4, 2024 04:54

Extract helper functions

9fed73c

Split out reduction step so it can be computed as intermediate result

47bd0ce

Experiment with restructure, breaks pixel-dependent direct beam

06447be

Will move direct beam into geometry factor

Restructure so pixel-dependent DB works

18a1cb8

Fix BC-finder once again

7f80021

Fix direct_beam and move wav masking after q binning

8b83518

Fix BC notebook

56ab504

Update docstrings and cleanup

c152011

SimonHeybrock force-pushed the streaming-workflow branch from c485395 to c152011 Compare September 4, 2024 03:18

SimonHeybrock marked this pull request as ready for review September 4, 2024 03:18

nvaytet reviewed Sep 17, 2024

View reviewed changes

SimonHeybrock commented Sep 25, 2024

View reviewed changes

src/ess/sans/normalization.py Outdated Show resolved Hide resolved

Apply suggestions from code review

94a8b3e

Co-authored-by: Neil Vaytet <[email protected]>

SimonHeybrock mentioned this pull request Sep 25, 2024

Add loki reduction notebook #170

Merged

SimonHeybrock and others added 3 commits September 25, 2024 06:18

Merge branch 'main' into streaming-workflow

a607e2e

Formatting

6cee29e

Update src/ess/sans/normalization.py

8a55abd

Co-authored-by: Simon Heybrock <[email protected]>

nvaytet approved these changes Sep 25, 2024

View reviewed changes

SimonHeybrock enabled auto-merge September 25, 2024 08:24

SimonHeybrock merged commit 8826f6b into main Sep 25, 2024
4 checks passed

SimonHeybrock deleted the streaming-workflow branch September 25, 2024 08:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restructure workflow to facilitate workflow partitioning for efficient streaming execution #163

Restructure workflow to facilitate workflow partitioning for efficient streaming execution #163

SimonHeybrock commented Aug 29, 2024 •

edited

Loading

nvaytet Sep 17, 2024

SimonHeybrock Sep 25, 2024

nvaytet Sep 17, 2024

SimonHeybrock Sep 25, 2024

nvaytet Sep 17, 2024

SimonHeybrock Sep 25, 2024

Restructure workflow to facilitate workflow partitioning for efficient streaming execution #163

Restructure workflow to facilitate workflow partitioning for efficient streaming execution #163

Conversation

SimonHeybrock commented Aug 29, 2024 • edited Loading

nvaytet Sep 17, 2024

Choose a reason for hiding this comment

SimonHeybrock Sep 25, 2024

Choose a reason for hiding this comment

nvaytet Sep 17, 2024

Choose a reason for hiding this comment

SimonHeybrock Sep 25, 2024

Choose a reason for hiding this comment

nvaytet Sep 17, 2024

Choose a reason for hiding this comment

SimonHeybrock Sep 25, 2024

Choose a reason for hiding this comment

SimonHeybrock commented Aug 29, 2024 •

edited

Loading