-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
When and where to get missing information for active storage #31
Comments
It appears that we can do it in one of three places:
Given the objection above, it looks like 2 is the right answer? |
(I think doing it at instantiation would make |
unless the Active Storage device makes the metadata available for reading locally (in some way), I reckon the best way is to have it passed to the client (just the metadata) and loaded outside the active call, since that's needed both for active and passive cases. Note that we will need the metadata not only for missing/fill values, but also for such magnificent things like cell measures, various other attributes, fixing units, fixing units of coordinates etc - a whole lot of metadata that we should think of a mechanism to be passed/loaded/used that is general enough to accommodate all those |
I would for go for "2. when instantiating", and agrees that V - what's the use case for passing other metadata (like cell measures) to the active storage? Perhaps I have misunderstood! |
@davidhassell a use case: we need to compute a mean of a variable that is masked with a cell measure (eg |
Hi V - I think that use case is out of scope, as we can't use active storage to do the work unless it's the first operation in the stack, and something like |
Ok, so we have a consensus on 2., but I am not sure how to handle the "allow missing data", as there are a lot of options just for missing data alone, let alone filters and compression, so would we assume that if any keyword attributes were present then all the keywords had be seen set appropriately? For the moment I'm going to ignore this, we can put that in a future version, since. by default that'd preserve backwards compatibility. |
Actually, I'm wrong, since we get the compression and filter info from the zarr metadata, we're just left with the missing stuff, which is well posed ... so I'll put that in now. |
yeah my thought too about backwards compatibility - hence me wondering about a design scheme to have all this mostly preserved when we start doing more complex stuff 🍺 |
At one point it was said
But we need access to netcdf attributes inside the active storage (to get all the information about missing values, compression etc).
Where and when do we think we should open the dataset and get that info?
The text was updated successfully, but these errors were encountered: