-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problems with ATLAS datasets #317
Comments
@sol1105 thanks for looking at the atlas issue :) That means we can support subsetting atlas when we add an "atlas-fix"? |
@cehbrecht Yes, I think so. To be sure I would add further ATLAS test datasets (ATLAS CORDEX, CMIP5) and tests for subset, regrid (and if applicable also average) operators when we implement this fix. I suggest a general fix in clisops: Should we raise this as an issue for netcdf (or possibly xarray) as well? In a v2 of ATLAS maybe the fillvalue and deflate problems should be addressed. Can you inform them about these issues? Edit: Also cdo cannot open these files without problems, since the |
I found some more information on that problem in the |
Hi, you are faced with two viable alternatives:
The ATLAS v1 dataset was crafted using netcdflib version 4.4.1.1 and hdf5lib version 1.10.1, a deliberate decision aimed at optimizing format readability with other library versions to the fullest extent possible. |
@cofinoa Thanks for your reply. The
So our planned |
@sol1105 the PR Unidata/netcdf-c#2716 just make "unreadable" VL datatype datasets/variables which filters are NO-OPTIONAL. Therefore, the ATLAS v1 dataset will be readable for the next release, the filters applied to String variables are optional, and still will be readable for the next netcdf-c. The netCDF library has strong principle to make readable any data been generated by previous library versions, for curation purposes. What was problematic was Unidata/netcdf-c#2231 in netcdf-c version >=4.9. This PR "broke" code that write VL datatypes with filters and they were silently ignored and not applied, but the PR raises an error is being raised, make this code buggy. The PR Unidata/netcdf-c#2716 in next release, will raise an error only if filter is NO OPTIONAL when VL dataype data will be written, and ignored and just warning user that filter is not applied when filter is OPTIONAL. Said that, I will test ATLAS v1 with next netcdf-c release. For the xarray-fix, the 3rd option would be use next netcdf-c release to write the subsetted data. |
Update:
We need to wait for the 4.9.3 release, but my conclusion it's to avoid netcdf-c version >=4.9 AND <4.9.3, because those versions break existing code that worked with previous versions (<4.9). |
I think this can be closed with #319 and roocs/roocs-utils#111 / roocs/roocs-utils#113 . In general however the following issues should be addressed for future versions of the ATLAS datasets, since they may also affect the compatibility with other tools
|
Description
The ATLAS datasets are aggregated CMIP5/6 or CORDEX datasets that have been remapped to a regular grid and contain data from multiple sources in a single data file (arranged along a new dimension
member
): Link1 Link2. It is planned thatclisops
supports the processing of these datasets in the future. First tests show the following problems:clisops filenamer
has to be updated.filenamer simple
cannot write processed output to disk. This is due to a netCDF error, caused by the deflate settings of string/character variables in the ATLAS datasets.What I Did
This fails with:
It works when overwriting the encoding settings of the character/string variables introduced in the ATLAS datasets:
The text was updated successfully, but these errors were encountered: