spec v3: progressive encoding #80

davidbrochart · 2020-06-15T08:16:46Z

I'm wondering if progressive encoding could be supported in Zarr. It is a technique often used on the web, where a low resolution image can be downloaded and displayed first, and then refined as the download continues (see e.g. https://cloudinary.com/blog/progressive_jpegs_and_green_martians).
Zarr currently supports only full resolution contiguous chunks, so if you want to have a global view of the data, even if you are going to coarsen it afterwards, you have to first get all the data. Progressive encoding would allow to save a lot of bandwidth in this case, which is particularly useful for e.g. visualization.
But I'm not sure if it would be easy to fit into the current architecture, or if there is interest in it.

davidbrochart · 2020-06-15T11:27:09Z

Or maybe this could be handled by a special compressor, provided that we can request a given resolution from the Zarr store. Each chunk would then have all the resolutions in them, which could look like:

0/0/0_res0
0/0/0_res1
0/0/0_res2

For full resolution, you would have to read (and combine) the 3 files, and for low resolution only 0/0/0_res0.

davidbrochart · 2020-06-15T12:49:40Z

Might be a duplicate of #23.

Carreau · 2020-06-15T16:47:54Z

Might be a duplicate of #23.

I'm not sure it's the same in the sens that in #23 you do completly store multiple resolution; while what you are requesting here is basically having the "lower frequencies" early in the chunks if I understand correctly.

I would put that under the category "partial read" or "partial decompress" use case.

davidbrochart · 2020-06-15T17:02:59Z

Yes the point is not to duplicate data, even a shrunk version of it, but to decompose the data into progressive layers of details, a kind of Fourier transform to pick up your analogy with frequencies.
It might be a good compressor anyway because of the correlation between successive layers, e.g. the first layer is a coarsen average, the next is the finer grained differences on top of it, etc.

joshmoore · 2020-06-16T14:18:44Z

@davidbrochart : my hope would have been to find a way with v3 to have jpeg2000-like compression across the multiscales. However, that was moved out as a "convention" rather than as part of the spec. I'd certainly be interested in having the two interact reasonably together.

davidbrochart · 2020-06-23T08:00:23Z

Thinking about it, it might be possible to implement progressive encoding already, by adding a new dimension representing the resolution. When reading a zarr array, the decoding would be done according to the resolution value (i.e. reading all the chunks corresponding to the lower resolutions up to the requested one and combining them), and the resolution dimension would be removed from the result.

davidbrochart mentioned this issue Nov 10, 2020

Connecting with numpytiles zarr-developers/community#37

Open

jstriebel added the protocol-extension Protocol extension related issue label Nov 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spec v3: progressive encoding #80

spec v3: progressive encoding #80

davidbrochart commented Jun 15, 2020

davidbrochart commented Jun 15, 2020

davidbrochart commented Jun 15, 2020

Carreau commented Jun 15, 2020

davidbrochart commented Jun 15, 2020

joshmoore commented Jun 16, 2020

davidbrochart commented Jun 23, 2020

spec v3: progressive encoding #80

spec v3: progressive encoding #80

Comments

davidbrochart commented Jun 15, 2020

davidbrochart commented Jun 15, 2020

davidbrochart commented Jun 15, 2020

Carreau commented Jun 15, 2020

davidbrochart commented Jun 15, 2020

joshmoore commented Jun 16, 2020

davidbrochart commented Jun 23, 2020