Create numba ufunc for sum of samples within charge extraction window #1038

watsonjj · 2019-04-03T15:17:40Z

This PR replaces the extract_charge_from_peakpos_array function with sum_samples_around_peakpos. This is a numba function that more efficiently extracts the charge from the waveform within the window defined by the peak position, window width, and window size.

It is defined using the @guvectorize decorator, which creates a numpy universal function, enabling the passing of both scalars and arrays for the peak position, window width, and window shift arguments.

The way the @guvectorize works (in this case) is that you define the operation that is applied for each channel and pixel. The operation is then optimised using numba's just-in-time compilation.

I have profiled sum_samples_around_peakpos against the previous extract_charge_from_peakpos_array function. The new function provides a factor of 60 reduction in execution time (from a couple of ms, to tens of µs).

It also provides a factor of 2 reduction in execution time upon the waveforms[:, :, start:end].sum(2) operation performed in GlobalPeakWindowSum.

The execution time for the extractors before this change were:

GlobalPeakWindowSum
6.15 ms ± 57.2 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
LocalPeakWindowSum
12 ms ± 209 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
NeighborPeakWindowSum
14.7 ms ± 293 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

The execution time for the extractors following this change are:

GlobalPeakWindowSum
3.26 ms ± 30.6 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
LocalPeakWindowSum
3.28 ms ± 28.3 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
NeighborPeakWindowSum
5.67 ms ± 33 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

The bottleneck is now the pulse_time calculation, which I could address in a different PR.

- Replace extract_charge_from_peakpos_array

codecov · 2019-04-03T15:36:38Z

Codecov Report

❗ No coverage uploaded for pull request base (master@24707ab). Click here to learn what that means.
The diff coverage is 88.23%.

@@           Coverage Diff            @@
##             master   #1038   +/-   ##
========================================
  Coverage          ?   83.2%           
========================================
  Files             ?     186           
  Lines             ?   10583           
  Branches          ?       0           
========================================
  Hits              ?    8806           
  Misses            ?    1777           
  Partials          ?       0

Impacted Files	Coverage Δ
ctapipe/image/tests/test_extractor.py	`100% <100%> (ø)`
ctapipe/image/extractor.py	`67.67% <53.84%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 24707ab...cc3b326. Read the comment docs.

* master: Set out-of-bounds pulse time to -1 Set to nan if out of range Rename UserWindowSum to FixedWindowSum

ctapipe/image/extractor.py

thomasarmstrong · 2019-04-23T11:27:53Z

I have had a look over the code and have run some tests to compare these changes to the extraction methods in the current master branch. Everything seems good to me, in fact this pull request also fixes a current bug in the GlobalPeakWindowSum which causes it to fail if there is only one channel. I would suggest that the PR is approved.

(One note, there is a typo in the sum_samples_around_peak description "The ret argument is required by numpy to creae the numpy array which is")

maxnoe

Other than a typo looks good

ctapipe/image/extractor.py

* master: Correct function name Create extract_pulse_time_around_peakpos Fix passing config the CameraCalibrator Correct test Add test to show error in creating CameraCalibrator with config Correct min to max Fix test Update bokeh plotters to handle nan # Conflicts: # ctapipe/image/extractor.py

…sum_numba * 'sum_numba' of https://github.com/watsonjj/ctapipe: Fixed typo

* master: Fix See Also docs for sphinx 2 (cta-observatory#1051)

watsonjj · 2019-04-24T09:55:38Z

@kosack @maxnoe Could you restore your review here

* master: Create numba ufunc for sum of samples within charge extraction window (cta-observatory#1038) Implement nan-handling like matplotlib high-level api (cta-observatory#1050) Fix See Also docs for sphinx 2 (cta-observatory#1051) Correct function name Create extract_pulse_time_around_peakpos Correct min to max Fix test Update bokeh plotters to handle nan

Create numba function sum_samples_around_peakpos

81ca930

- Replace extract_charge_from_peakpos_array

watsonjj mentioned this pull request Apr 3, 2019

Major refactoring of calibration chain #1026

Closed

24 tasks

Add test for out of range windows

e4296f6

watsonjj mentioned this pull request Apr 4, 2019

Replace WaveformCleaner and ChargeExtractor with WaveformExtractor #1033

Merged

Add extra information to docstring

5658e24

watsonjj added the optimization label Apr 4, 2019

Merge branch 'master' into sum_numba

b5f1b6f

* master: Set out-of-bounds pulse time to -1 Set to nan if out of range Rename UserWindowSum to FixedWindowSum

watsonjj added the ready for review label Apr 8, 2019

kosack requested changes Apr 9, 2019

View reviewed changes

ctapipe/image/extractor.py Outdated Show resolved Hide resolved

Rename peakpos to peak_index

2bd85a6

kosack previously approved these changes Apr 15, 2019

View reviewed changes

thomasarmstrong previously approved these changes Apr 23, 2019

View reviewed changes

maxnoe requested changes Apr 23, 2019

View reviewed changes

ctapipe/image/extractor.py Outdated Show resolved Hide resolved

Fixed typo

a8c207f

watsonjj dismissed stale reviews from thomasarmstrong and kosack via a8c207f April 23, 2019 13:13

maxnoe previously approved these changes Apr 23, 2019

View reviewed changes

thomasarmstrong previously approved these changes Apr 23, 2019

View reviewed changes

watsonjj added 2 commits April 23, 2019 16:42

Merge branch 'sum_numba' of https://github.com/watsonjj/ctapipe into …

40cb749

…sum_numba * 'sum_numba' of https://github.com/watsonjj/ctapipe: Fixed typo

watsonjj dismissed stale reviews from thomasarmstrong and maxnoe via 40cb749 April 23, 2019 14:43

Merge branch 'master' into sum_numba

cc3b326

* master: Fix See Also docs for sphinx 2 (cta-observatory#1051)

watsonjj requested review from maxnoe, kosack and thomasarmstrong April 23, 2019 15:09

thomasarmstrong approved these changes Apr 24, 2019

View reviewed changes

kosack approved these changes Apr 24, 2019

View reviewed changes

kosack merged commit 87273b3 into cta-observatory:master Apr 24, 2019

watsonjj deleted the sum_numba branch April 29, 2019 13:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create numba ufunc for sum of samples within charge extraction window #1038

Create numba ufunc for sum of samples within charge extraction window #1038

watsonjj commented Apr 3, 2019 •

edited

Loading

codecov bot commented Apr 3, 2019 •

edited

Loading

thomasarmstrong commented Apr 23, 2019

maxnoe left a comment

watsonjj commented Apr 24, 2019

Create numba ufunc for sum of samples within charge extraction window #1038

Create numba ufunc for sum of samples within charge extraction window #1038

Conversation

watsonjj commented Apr 3, 2019 • edited Loading

codecov bot commented Apr 3, 2019 • edited Loading

Codecov Report

thomasarmstrong commented Apr 23, 2019

maxnoe left a comment

Choose a reason for hiding this comment

watsonjj commented Apr 24, 2019

watsonjj commented Apr 3, 2019 •

edited

Loading

codecov bot commented Apr 3, 2019 •

edited

Loading