Generalize reproject_and_coadd for N-dimensional data, and add option to specify blank pixel value and progress bar #351

keflavich · 2023-03-13T21:14:44Z

This enables memmap'd output, which will be needed for very large (greater-than-memory) output cubes

EDIT: it also extends reproject_and_coadd into three dimensions

codecov · 2023-03-13T21:22:35Z

Codecov Report

Attention: Patch coverage is 69.72477% with 33 lines in your changes missing coverage. Please review.

Project coverage is 90.60%. Comparing base (b42f7c6) to head (0054941).
Report is 16 commits behind head on main.

❗ Current head 0054941 differs from pull request most recent head f5e3016

Please upload reports for the commit f5e3016 to get more accurate results.

Files	Patch %	Lines
reproject/mosaicking/coadd.py	73.33%	20 Missing ⚠️
reproject/mosaicking/subset_array.py	63.33%	11 Missing ⚠️
reproject/utils.py	50.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #351      +/-   ##
==========================================
- Coverage   93.63%   90.60%   -3.03%     
==========================================
  Files          25       25              
  Lines         895      947      +52     
==========================================
+ Hits          838      858      +20     
- Misses         57       89      +32

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

astrofrog

This is useful, thanks! Just some comments below, and be sure to include some tests - thanks!

reproject/mosaicking/coadd.py

keflavich · 2023-03-13T22:45:14Z

I'm a little stymied on this:
ValueError: Chunks do not add up to shape. Got chunks=((4,), (4, 100, 101), (4, 100, 101)), shape=(4, 174, 173)
It looks like the block_size isn't playing nice with the different-shaped output; I think that relies on your WIP PR @astrofrog ?

keflavich · 2023-03-13T22:51:13Z

@astrofrog I have some questions now:

The loop for reproject_and_coadd is presently not lazy or possible to make lazy, correct? We need your daskified PR to make it lazy?

I think I'm going to move the write-to-big-array into the parent loop, because otherwise everything still has to be held in memory

keflavich · 2023-03-13T22:51:49Z

ahhh, there's good reason for that - background matching! Yikes.

keflavich · 2023-03-13T22:58:42Z

so, as is, this does coadding on a dataset-by-dataset basis - which, for cubes, means a cube-by-cube basis. It may be more efficient to do it on a plane-by-plane basis. That seems like a lot of additional refactoring work though

keflavich · 2023-03-13T23:10:35Z

There is a huge efficiency boost to be gained if we can reproject directly into the output array, but that results in overwriting the individual values instead of adding to them.

keflavich · 2023-03-13T23:16:06Z

@astrofrog I don't see any way to do this, but do you have ideas? Basically, instead of map_coordinates(..., output=array) writing directly into memory, it would do output += map_coordinates(...). That looks like it is problematic at the lowest levels, and I don't know if it would work at all for other reprojection methods.

This sacrifices flexibility even further to improve speed and robustness and reduce memory use. In the case where you don't want to match backgrounds, this approach just loads the unweighted data into an array and the footprint into another array and divides them at the end. The robustness bit is that we can flush to disk granularly.

I'm not sure this is worth pursuing further, though, because of the depth of refactor that would be needed. Maybe this is something just solved well enough by dask.

astrofrog · 2023-03-14T08:08:07Z

I'll have a think about all this! Should be able to work on it a bit this week.

keflavich · 2023-09-09T14:57:18Z

@astrofrog I've been using this PR in practice for a while now, and I just had a look into rebasing, but the codebase has diverged a ton, making this a rather challenging rebase. I'd like to go ahead with it, but this time make sure the changes can get merged. What's your feeling - is there anything blocking this if it's rebased?

keflavich · 2023-09-09T15:07:04Z

ok the rebase wasn't as bad as I thought, but there were some confusing items that I am not sure I've resolved yet.

astrofrog · 2023-09-09T15:27:41Z

I can try and prioritise this!

keflavich · 2023-09-09T17:48:36Z

with the new modes (first, last, min, max) we again face duplicating code because we need a version that does, and another version that does not, try to background-match before the final combination step. The current approach simply breaks if match_background is not on and one tries first/last/min/max. Is there a more elegant approach?

astrofrog · 2023-09-09T20:38:18Z

Will investigate!

astrofrog · 2023-09-11T09:19:38Z

To try and keep things a bit simpler I've extracted out the changes related to specifying output arrays/footprints into a separate PR: #387

astrofrog · 2023-09-11T10:35:25Z

In terms of supporting 3D arrays (and also the median combine mode), I think things are getting complicated enough that it's worth thinking about a different approach for doing the co-adding. I'm working on an experimental re-write of reproject_and_coadd which uses dask internally and will open an alternative PR soon so that we can compare and see what works best in practice.

astrofrog · 2023-09-11T12:20:06Z

See #388 for an alternative approach.

keflavich · 2023-12-03T01:29:29Z

@astrofrog I'm still using this branch in production. It's apparently working (verification in progress). I clearly ran out of time to test 388. Any further thoughts on which direction to take? Both PRs are big, I don't immediately remember/know what the reasons are for one or the other.

astrofrog · 2023-12-03T08:12:37Z

Sorry for dropping the ball on this, I'll try and see if I can wrap things up in the next week or so.

keflavich · 2024-01-30T03:49:40Z

my last commit adds a hack to solve this issue: radio-astro-tools/spectral-cube#900 (should've posted that here, perhaps, but it isn't obvious to me where it came from)

keflavich · 2024-05-30T21:02:46Z

@astrofrog I'm at a workshop with @e-koch, and I'd like to show off that we can do the mosaicing in radio-astro-tools/spectral-cube#868. Any chance we can make a little progress on this shortly?

astrofrog · 2024-05-30T21:30:59Z

@keflavich yes sorry for the delay, I will be back at work on Monday and will try and prioritise wrapping this up.

block size making

very wasteful

…emove 'median' as an advertised option since it isn't implemented.

astrofrog

I pushed a couple of commits cleaning this up - I've generalized it to N dimensions which simplifies things, and changed the default for blank_pixel_value to 0 for backward-compatibility. Thanks @keflavich, and sorry for taking so long to wrap this up!

keflavich mentioned this pull request Mar 13, 2023

Dask reproject radio-astro-tools/spectral-cube#845

Draft

astrofrog requested changes Mar 13, 2023

View reviewed changes

reproject/mosaicking/coadd.py Outdated Show resolved Hide resolved

reproject/mosaicking/coadd.py Outdated Show resolved Hide resolved

reproject/mosaicking/coadd.py Outdated Show resolved Hide resolved

reproject/mosaicking/coadd.py Outdated Show resolved Hide resolved

astrofrog reviewed Mar 13, 2023

View reviewed changes

reproject/mosaicking/coadd.py Outdated Show resolved Hide resolved

keflavich commented Mar 13, 2023

View reviewed changes

reproject/mosaicking/coadd.py Outdated Show resolved Hide resolved

keflavich force-pushed the memmaped_coadd branch from a0f3111 to bde9d46 Compare September 9, 2023 14:58

astrofrog mentioned this pull request Sep 11, 2023

Add ability to specify output array and footprint in reproject_and_coadd #387

Merged

keflavich force-pushed the memmaped_coadd branch from 6f1950b to d1319d1 Compare December 3, 2023 00:42

keflavich mentioned this pull request Dec 3, 2023

WIP: Cubewcs mosaic and dask reproject radio-astro-tools/spectral-cube#894

Draft

keflavich force-pushed the memmaped_coadd branch from d1319d1 to ba28206 Compare January 30, 2024 03:30

keflavich and others added 12 commits June 5, 2024 11:55

add ability to specify output array in coadd, and allow specification of

50bb23f

block size making

allow reproject_and_coadd to operate on cubes

c012c5b

implement astrofrog suggestions

71c85a5

extend dimensionality of reprojectedarraysubset

e40477c

fix typo

a35c415

only store all the arrays in memory if matching backgrounds; otherwise

e177cbf

very wasteful

add a progressbar option

32080b4

add progressbar as kwd

6007835

change dimensional subsetting to operate on low-level wcs

8975490

add a helpful error msg

9a98b42

Fix several issues

22cfce9

Rename progressbar to progress_bar, document blank_pixel_value, and r…

bc503a4

…emove 'median' as an advertised option since it isn't implemented.

astrofrog force-pushed the memmaped_coadd branch from 182c3dd to c13b218 Compare June 5, 2024 10:56

Generalize reproject_and_coadd to N-dimensions and fix test failures

f5e3016

astrofrog force-pushed the memmaped_coadd branch from c13b218 to f5e3016 Compare June 5, 2024 11:17

astrofrog changed the title ~~Add some more flexibility to coadd: output array specification~~ Generalize reproject_and_coadd for N-dimensional data, and add option to specify blank pixel value and progress bar Jun 5, 2024

astrofrog approved these changes Jun 5, 2024

View reviewed changes

astrofrog merged commit 8ab8a78 into astropy:main Jun 5, 2024
11 of 15 checks passed

This was referenced Jun 13, 2024

Blank pixels should be set to ... #426

Closed

Fix 426: add blank_pixel_value keyword #427

Closed

astrofrog added the enhancement label Jun 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalize reproject_and_coadd for N-dimensional data, and add option to specify blank pixel value and progress bar #351

Generalize reproject_and_coadd for N-dimensional data, and add option to specify blank pixel value and progress bar #351

keflavich commented Mar 13, 2023 •

edited

Loading

codecov bot commented Mar 13, 2023 •

edited

Loading

astrofrog left a comment

keflavich commented Mar 13, 2023

keflavich commented Mar 13, 2023

keflavich commented Mar 13, 2023

keflavich commented Mar 13, 2023

keflavich commented Mar 13, 2023

keflavich commented Mar 13, 2023

astrofrog commented Mar 14, 2023

keflavich commented Sep 9, 2023

keflavich commented Sep 9, 2023

astrofrog commented Sep 9, 2023

keflavich commented Sep 9, 2023

astrofrog commented Sep 9, 2023

astrofrog commented Sep 11, 2023

astrofrog commented Sep 11, 2023

astrofrog commented Sep 11, 2023 •

edited

Loading

keflavich commented Dec 3, 2023

astrofrog commented Dec 3, 2023

keflavich commented Jan 30, 2024

keflavich commented May 30, 2024

astrofrog commented May 30, 2024

astrofrog left a comment

Generalize reproject_and_coadd for N-dimensional data, and add option to specify blank pixel value and progress bar #351

Generalize reproject_and_coadd for N-dimensional data, and add option to specify blank pixel value and progress bar #351

Conversation

keflavich commented Mar 13, 2023 • edited Loading

codecov bot commented Mar 13, 2023 • edited Loading

Codecov Report

astrofrog left a comment

Choose a reason for hiding this comment

keflavich commented Mar 13, 2023

keflavich commented Mar 13, 2023

keflavich commented Mar 13, 2023

keflavich commented Mar 13, 2023

keflavich commented Mar 13, 2023

keflavich commented Mar 13, 2023

astrofrog commented Mar 14, 2023

keflavich commented Sep 9, 2023

keflavich commented Sep 9, 2023

astrofrog commented Sep 9, 2023

keflavich commented Sep 9, 2023

astrofrog commented Sep 9, 2023

astrofrog commented Sep 11, 2023

astrofrog commented Sep 11, 2023

astrofrog commented Sep 11, 2023 • edited Loading

keflavich commented Dec 3, 2023

astrofrog commented Dec 3, 2023

keflavich commented Jan 30, 2024

keflavich commented May 30, 2024

astrofrog commented May 30, 2024

astrofrog left a comment

Choose a reason for hiding this comment

keflavich commented Mar 13, 2023 •

edited

Loading

codecov bot commented Mar 13, 2023 •

edited

Loading

astrofrog commented Sep 11, 2023 •

edited

Loading