[draft] Accept Cubed arrays instead of dask #249

TomNicholas · 2023-06-24T17:39:50Z

Very very rough changes to see what happens if you try to give flox a cubed.Array. Not many changes are required to get the cubed array inputs all the way to the reduction step, but I have not yet been able to run it, so there might be additional incompatibilities that turn up.

Would close Using Flox with cubed #224

for more information, see https://pre-commit.ci

flox/core.py

TomNicholas · 2023-06-24T17:51:54Z

flox/core.py

+    from xarray.core.parallelcompat import get_chunked_array_type
+
+    chunkmanager = get_chunked_array_type(array)


Obviously this approach would introduce a dependency on xarray, which presumably is not desirable.

I'm fine just having dask_kwargs and cubed_kwargs instead of all this complexity.

I probably should have just done that in xarray itself 😅

TomNicholas · 2023-06-24T17:52:44Z

flox/core.py

+        # meta=array._meta,
        align_arrays=False,
-        name=f"{name}-chunk-{token}",
+        # name=f"{name}-chunk-{token}",


_meta and name are dask-specific. Are they used for anything important here or just for labelling tasks in the graph?

if you don't provide meta, dask will try to figure it out and then break?

dcherian · 2023-06-24T20:10:47Z

flox/core.py

        else:
            combine = partial(_grouped_combine, engine=engine, sort=sort)
-            combine_name = "grouped-combine"


There's a test for these names that will need to be fixed.

flox/aggregations.py

dcherian · 2023-06-24T20:12:34Z

flox/core.py

@@ -1889,7 +1896,7 @@ def groupby_reduce(
        axis_ = np.core.numeric.normalize_axis_tuple(axis, array.ndim)  # type: ignore
    nax = len(axis_)

-    has_dask = is_duck_dask_array(array) or is_duck_dask_array(by_)
+    has_dask = is_chunked_array(array) or is_duck_dask_array(by_)


Suggested change

has_dask = is_chunked_array(array) or is_duck_dask_array(by_)

is_chunked = is_chunked_array(array) or is_chunked_array(by_)

…axis to chunk identity fn

for more information, see https://pre-commit.ci

TomNicholas and others added 3 commits June 24, 2023 13:35

array api compatiblity

6eca6f1

use xarray chunkmanager

58d2021

[pre-commit.ci] auto fixes from pre-commit.com hooks

8fdc367

for more information, see https://pre-commit.ci

TomNicholas mentioned this pull request Jun 24, 2023

Using Flox with cubed #224

Open

TomNicholas added 2 commits June 24, 2023 13:46

remove commented out line

4777e77

Merge branch 'cubed' of https://github.com/TomNicholas/flox into cubed

5582e5e

TomNicholas commented Jun 24, 2023

View reviewed changes

flox/core.py Outdated Show resolved Hide resolved

TomNicholas commented Jun 24, 2023

View reviewed changes

TomNicholas marked this pull request as draft June 24, 2023 17:53

dcherian reviewed Jun 24, 2023

View reviewed changes

flox/aggregations.py Outdated Show resolved Hide resolved

dcherian reviewed Jun 24, 2023

View reviewed changes

TomNicholas and others added 3 commits June 26, 2023 16:37

remove uneccessary asarray

fabaf35

remove concatenate kwargs, use array API version of reshape, and add …

858c98a

…axis to chunk identity fn

[pre-commit.ci] auto fixes from pre-commit.com hooks

786af6a

for more information, see https://pre-commit.ci

dcherian closed this Jun 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[draft] Accept Cubed arrays instead of dask #249

[draft] Accept Cubed arrays instead of dask #249

TomNicholas commented Jun 24, 2023

TomNicholas Jun 24, 2023

dcherian Jun 24, 2023

TomNicholas Jun 25, 2023

TomNicholas Jun 24, 2023

dcherian Jun 24, 2023 •

edited

Loading

dcherian Jun 24, 2023

dcherian Jun 24, 2023

		from xarray.core.parallelcompat import get_chunked_array_type

		chunkmanager = get_chunked_array_type(array)

	has_dask = is_chunked_array(array) or is_duck_dask_array(by_)
	is_chunked = is_chunked_array(array) or is_chunked_array(by_)

[draft] Accept Cubed arrays instead of dask #249

[draft] Accept Cubed arrays instead of dask #249

Conversation

TomNicholas commented Jun 24, 2023

TomNicholas Jun 24, 2023

Choose a reason for hiding this comment

dcherian Jun 24, 2023

Choose a reason for hiding this comment

TomNicholas Jun 25, 2023

Choose a reason for hiding this comment

TomNicholas Jun 24, 2023

Choose a reason for hiding this comment

dcherian Jun 24, 2023 • edited Loading

Choose a reason for hiding this comment

dcherian Jun 24, 2023

Choose a reason for hiding this comment

dcherian Jun 24, 2023

Choose a reason for hiding this comment

dcherian Jun 24, 2023 •

edited

Loading