-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
v2: Standardizing .zmetadata #113
Comments
The current
I suspect this format was chosen in part because it's slightly more natural in Zarr-Python to look up flat-keys rather than nested keys. But I'm sure performance would be fine with nested keys, too. Inherently both seem fine to me. I trust that performance for parsing either structure in netCDF-C would probably be OK, too? At least for "reasonable" size consolidated metadata? But even if performance would be similar, if it's significantly harder to work with non-nested metadata on some platforms that is definitely worth considering. |
The reason that a nested encoding is slightly preferable for netcdf is that we track groups as independent objects, so we have to parse flat-keys got get the group info out of them. |
JSON objects are unordered, so no, there should be no expectations about sorting. |
@DennisHeimbigner: are you thinking about the structure for V2, V3 or both? |
I was just looking at the existing V2 .zmetadata. Has the issue been raised for V3 yet? |
I see from this comment: |
Can you clarify what you mean here? I think this is the same |
The header info
is added. So it needs standarization also. |
Right, that was also in my example above: #113 (comment) |
Just pinging this discussion based on today's zarr call: we need to sort out how to handle consolidated metadata for V3. Presumably it will be cheaper to list metadata in V3 because of the separation of metadata and data in the tree. But we still need an answer for "unlistable stores". Perhaps this is covered in the spec, but I did not see it. |
Consolidated metadata for v3 is being discussed in #136. Marking this issue for the v2 discussion. |
I want to begin a discussion about standardizing
the .zmetadata format for consolidated metadata.
Suppose we have this Zarr container.
This structure needs to be encoded as JSON in the .zmetadata object.
I can see two obvious encodings:
My observations:
nested encode
but that would depend on the implementation. It would be faster for netcdf-c,
but might not be for zarr-python.
The text was updated successfully, but these errors were encountered: