Feature comparison vs kerchunk #80
Labels
documentation
Improvements or additions to documentation
Kerchunk
Relating to the kerchunk library / specification itself
I made this comparison table, which should go in the docs somewhere.
dict
" with keys for each chunk in arrayManifestArray
instance (wrapping aChunkManifest
instance)dict
", created on a per-chunk basis using theinline_threshold
kwargnumpy.ndarray
instances, created on a per-variable basis using theloadable_variables
kwargdict
" with keys for each array in filexarray.Dataset
with variables wrappingManifestArray
instances (ornumpy.ndarray
instances)f.write(ujson.dumps(h5chunks.translate()).encode())
, then read using anfsspec.filesystem
mappeds.virtualize.to_kerchunk('combined.json', format='JSON')
, then read using anfsspec.filesystem
mapperdf.refs_to_dataframe(out_dict, "combined.parq")
, then read using anfsspec.implementations.reference.ReferenceFileSystem
mapperds.virtualize.to_kerchunk('combined.parq', format=parquet')
, then read using anfsspec.implementations.reference.ReferenceFileSystem
mappermanifest.json
filesds.virtualize.to_zarr()
, then read via any Zarr v3 reader which implements the manifest storage transformer ZEPkerchunk.combine.MultiZarrToZarr
xarray.merge
kerchunk.combine.MultiZarrToZarr
using theconcat_dims
kwargxarray.concat
kerchunk.combine.MultiZarrToZarr
using thecoo_map
kwargxarray.combine_by_coords
with in-memory xarray indexes created by loading coordinate variables firstxarray.combine_nested
dask.delayed
open_virtual_dataset
insidedask.delayed
but eventually instead usingxarray.open_mfdataset(..., parallel=True)
kerchunk.combine.auto_dask
ManifestArray
objects withindask.array.Array
objects insidexarray.Dataset
to use dask'sconcatenate
kerchunk.hdf.SingleHdf5ToZarr
open_virtual_dataset
, viakerchunk.hdf.SingleHdf5ToZarr
or potentiallyhidefix
kerchunk.netCDF3.NetCDF3ToZarr
open_virtual_dataset
, viakerchunk.netCDF3.NetCDF3ToZarr
kerchunk.tiff.tiff_to_zarr
open_virtual_dataset
, viakerchunk.tiff.tiff_to_zarr
or potentiallycog3pio
kerchunk.zarr.ZarrToZarr
open_virtual_dataset
, via ``kerchunk.zarr.ZarrToZarr` ?kerchunk.grib2.scan_grib
open_virtual_datatree
, viakerchunk.grib2.scan_grib
?kerchunk.fits.process_file
open_virtual_dataset
, viakerchunk.fits.process_file
?The text was updated successfully, but these errors were encountered: