You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<xarray.Dataset> Size: 0B
Dimensions: ()
Data variables:
*empty*
Attributes:
something_same: a
I would like a way to preserve the values of something_different on each dataset. Perhaps we could add an option to just make a list of the differing items?
<xarray.Dataset> Size: 0B
Dimensions: ()
Data variables:
*empty*
Attributes:
something_same: a
something_different: [a, b]
This is motivated by a real world use case. For CMIP6 each file has a unique tracking_id that can be used to find issues with a specific file (which would then affect all the resulting concatenated dataset). Currently my pangeo-forge-recipes based workflow is completely dropping this important information.
Happy to help with a PR but I am not quite sure what the best way to expose such a behavior to the user is?
Would this be a keyword argument to StoreToZarr?
The text was updated successfully, but these errors were encountered:
FYI this is a hard problem in general, and we normally recommend promoting unique_tracking_id to be an actual coordinate variable so that it has specific rules for propagation.
Interesting. It would be great to have this implemented on the xarray level, but AFAICT that would still not solve the issue here, since we are not using xarray to generate much of the schema?
When we merge dataset schemas here we currently drop everything in the attributes that is not identical between them.
Example:
gives
I would like a way to preserve the values of
something_different
on each dataset. Perhaps we could add an option to just make a list of the differing items?This is motivated by a real world use case. For CMIP6 each file has a unique tracking_id that can be used to find issues with a specific file (which would then affect all the resulting concatenated dataset). Currently my pangeo-forge-recipes based workflow is completely dropping this important information.
Happy to help with a PR but I am not quite sure what the best way to expose such a behavior to the user is?
Would this be a keyword argument to
StoreToZarr
?The text was updated successfully, but these errors were encountered: