You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm very excited about this package and I'm just familiarising myself to see where I can use it for my use cases. I followed the example in the documentation to apply a groupby to the datatree. However, I did use dask because my dataset is too large to fit it into memory. I realised that my group_by function is not being applied to lazy loaded dask arrays.
DataTree('None', parent=None)
└── DataTree('second')
Dimensions: (time: 9)
Dimensions without coordinates: time
Data variables:
x (time) int64 dask.array<chunksize=(9,), meta=np.ndarray>
Versions
xarray: 2022.6.0
datatree: 0.0.9
The text was updated successfully, but these errors were encountered:
What is your issue?
Copied from xarray-contrib/datatree#152
Issue
Hi,
I'm very excited about this package and I'm just familiarising myself to see where I can use it for my use cases. I followed the example in the documentation to apply a
groupby
to the datatree. However, I did use dask because my dataset is too large to fit it into memory. I realised that mygroup_by
function is not being applied to lazy loaded dask arrays.Minimal example
Please compare the results for the eager (
a
) and lazy (b
) loaded datasets below:Any ideas what is going wrong?
This can likely be generalised for any
map_blocks
function:Versions
xarray: 2022.6.0
datatree: 0.0.9
The text was updated successfully, but these errors were encountered: