Include AncillaryVariables and restructure Cube metadata. #3422

lbdreyer · 2019-09-26T00:50:59Z

This includes the functionality to represent CF Ancillary variables on a cube.

This PR also includes a restructuring of dimensional metadata objects (coords, cell measures and ancillary dataset) in a hope to avoid code duplication.

Below is a rough diagram of the new structure:

lib/iris/coords.py

lib/iris/tests/unit/coords/test_CellMeasure.py

lib/iris/coords.py

lib/iris/tests/unit/coords/test_AncillaryVariable.py

pp-mo · 2019-10-14T16:38:25Z

Hi @lbdreyer
Just visiting, I know this isn't ready for review yet.
But as I'm reviewing #3378, I'm seeing that changing the cube print function means you will also need to tweak the content handling in iris.experimental.representation : This provides html output for the Jupyter _repr_html_() handler -- but it does it by reprocessing the cube.__str__() output.

For convenience, I suggest you could get myself or @stephenworsley to do that, as we have both been in that area recently, and it's a bit involved.

BTW maybe that shouldn't be in iris.experimental anymore ?

lbdreyer · 2019-10-15T16:05:15Z

Thanks for raising this @pp-mo. I also noticed that CellMeasure don't seem to be handled either?

I am happy for this work to be included in a separate PR, and have raised an issue to address it: #3467

lib/iris/coords.py

pp-mo · 2019-10-17T14:31:16Z

lib/iris/coords.py

+        values = values.copy()
+
+        # If the metadata is a coordinate and it has bounds, repeat the above
+        # with the bounds.


The comment in the former Coord code here also added "This will not realise lazy data."
I think that is still a useful hint + could usefully be added back.

lib/iris/coords.py

stephenworsley · 2019-10-18T16:06:23Z

lib/iris/cube.py

+                ancillary_variable_summary, cube_header = vector_summary(
+                    [], cube_header, max_line_offset,
+                    ancillary_variables=vector_ancillary_variables)
+                summary += '\n     Ancillary Datasets:\n'


For the sake of consistency, the 'D' in Datasets should be lower case. This also applies to the 'M' in Measures in the above summary += '\n Cell Measures:\n'.

I'd argue it should go the other way; we should update the other section headings (Dimension coordinates -> Dimension Coordinates) since they are headings and so should be capitalised

FWIW I lean toward @lbdreyer here : "All Title Case" makes sense to me.

Should this say Ancillary Variables?

pp-mo · 2019-10-22T17:59:35Z

this method ... pass a 'bounds' keyword to the copy function. This reads oddly, as the local 'copy' definition, immediately below, does not support a bounds keyword.
... But all the other _DimensionMetadata methods, ... use an "override, call parent + extend" strategy instead.
... Hmmmm. Still not quite sure what I think of this yet.

After some extensive discussion !

I observed that the worst of the 'mixed' approach is that, where a Coord method "extends" the _DimensionalMetadata method to do "the same thing to the bounds", we have code which is effectively copied from one class to the other : This not only breaks DRY but puts the 'copy' a long way away where it is not obvious !
E.G. the worst case is probably the __binary_operator__ method.

I think we should either

"separate" : kick all the bounds-handling code out of the _DimensionalMetadata class, and put it all in Coord
* ( see the idea in Ancil var strict lbdreyer/iris#6 )
* or ..
"combine" : include all the bounds-handling code in _DimensionalMetadata, but only provide means to set the bounds in the Coord class.
* ( see the idea in Merge bounds handling into _DimensionalMetadata class. lbdreyer/iris#7 )

Comparison of "separate" to "combine":
Actually I think "combine" is probably slicker, because then several methods don't need to be overriden in Coord at all.
It's certainly easier to make work, starting from where we are.
Also the "separate" approach has the problem that the _DimensionalMetadata is rather "overkill" for just storing a bounds array (and might give slower operation ?)
But "combine" is a bit weaselly, and it takes some explaining...

* Merge bounds handling into _DimensionalMetadata class. * Fix cube arithmetic bug. * Code style fix.

lib/iris/cube.py

pp-mo · 2019-10-23T12:26:04Z

Hi @lbdreyer, and anyone else watching.
I finally got round to doing some more practical checks, and checking out the testing provision.
It's now rather late to make new objections, as we really want to merge this so we can move on to subsequent tasks for the following 2-week development cycle.

So, I'm happy that there are at least some tests to exercise all the new methods.
However, I did some across some problems, and I have identified some missing testing that could usefully be included here :

simple tests for cube.summary() and repr/str, showing ancillary variable with correct dimension mapping
merge example showing mismatched ancillary variables preventing merge
merge example identical ancillary variables shared in result cube
also concatenate equivalents of those 2 merge tests

The problem of cube summary getting the dimension mappings wrong I demonstrated like this:

        >>> cube = Cube(np.zeros((2,2,3)))
        >>> cube.add_dim_coord(DimCoord(np.arange(2), long_name='t'), 0)
        >>> cube.add_dim_coord(DimCoord(np.arange(2), long_name='x'), 1)
        >>> cube.add_dim_coord(DimCoord(np.arange(3), long_name='y'), 2)
        >>> cube.add_ancillary_variable(AncillaryVariable(np.zeros((2,3))), (1,2))
        >>> print(cube)
        unknown / (unknown)                 (t: 2; x: 2; y: 3)
             Dimension coordinates:
                  t                           x     -     -
                  x                           -     x     -
                  y                           -     -     x
             Ancillary Datasets:
                  unknown                     x     x     x

Something like this should be in the above test.
However, this problem is already being addressed anyway : see "Fix cube summary of ancil vars" commit above. Nevertheless, a test exercising this will be useful..

pp-mo · 2019-10-23T12:34:44Z

However, I did some across some problems

It seems a considerable number of cube operations don't handle ancillary variables as they should do, of which the most worrying are ...

indexing (__getitem__) discards AVs
cube copy discards AVs
equality testing (__eq__) ignores them

So right now, if a cube contains AVS, cube.copy() is clearly different (and prints differently) but it tests equal !

But none of this will affect cubes that *don't * contain AVs, so in the interests of progress I think we'd better ignore this for now + defer all those problems to a new issue to tidy up the cube operations later.

lib/iris/tests/unit/cube/test_Cube.py

lbdreyer · 2019-10-23T14:47:19Z

@pp-mo I have added a test for ancillary variables in cube.summary (Note! we have very minimal testing for cube.summary in place! we should address, possibly if we also get a change to improve the html repr)

The tests for concatenate/merge made me realise that it wasn't being handled correctly (and consequently CellMeasures are also handled incorrectly when merging/concatenating). Unfortunately the work to get this done seems substantial and not something we will achieve in the next few hours. I have decided to back out the changes I previously made to _concatenate.py and _merge.py and instead fit that work into #3483

pp-mo · 2019-10-23T17:22:55Z

I think what's done here is ok, though a lot remains to be dealt with.
I won't be happy if we don't fix the obvious problems in the next few days! #3483

stickler-ci reviewed Sep 26, 2019

View reviewed changes

lib/iris/coords.py Outdated Show resolved Hide resolved

lbdreyer force-pushed the ancil_var branch from f9acfe4 to ff8d0ae Compare September 26, 2019 00:53

bjlittle self-assigned this Sep 26, 2019

bjlittle added Feature: NetCDF + CF-conventions Type: Enhancement labels Sep 26, 2019

bjlittle added this to the v2.3.0 milestone Sep 26, 2019

lbdreyer commented Sep 26, 2019

View reviewed changes

lib/iris/tests/unit/coords/test_CellMeasure.py Outdated Show resolved Hide resolved

lib/iris/tests/unit/coords/test_CellMeasure.py Show resolved Hide resolved

lib/iris/coords.py Outdated Show resolved Hide resolved

lib/iris/coords.py Outdated Show resolved Hide resolved

lbdreyer force-pushed the ancil_var branch from ff8d0ae to 9a52089 Compare September 26, 2019 10:06

bjlittle added the Status: Work in Progress label Sep 26, 2019

lbdreyer mentioned this pull request Sep 27, 2019

[PI] Quality Flag Handling #3358

Closed

5 tasks

lbdreyer modified the milestones: v2.3.0, v3.0.0 Sep 27, 2019

bjlittle added the Release: Major label Oct 3, 2019

bjlittle assigned lbdreyer and unassigned bjlittle and lbdreyer Oct 8, 2019

lbdreyer added the T-Shirt: Medium label Oct 10, 2019

lbdreyer self-assigned this Oct 10, 2019

lbdreyer force-pushed the ancil_var branch from 9a52089 to ef294b9 Compare October 11, 2019 13:58

stickler-ci reviewed Oct 11, 2019

View reviewed changes

lib/iris/coords.py Outdated Show resolved Hide resolved

lib/iris/coords.py Show resolved Hide resolved

stickler-ci reviewed Oct 14, 2019

View reviewed changes

lib/iris/tests/unit/coords/test_AncillaryVariable.py Outdated Show resolved Hide resolved

lbdreyer mentioned this pull request Oct 15, 2019

Include CellMeasures and AncillaryVariables in repr_html #3467

Closed

lbdreyer force-pushed the ancil_var branch from c7bfbde to 1478a64 Compare October 15, 2019 22:55

pp-mo reviewed Oct 17, 2019

View reviewed changes

lib/iris/coords.py Outdated Show resolved Hide resolved

pp-mo reviewed Oct 17, 2019

View reviewed changes

lib/iris/coords.py Outdated Show resolved Hide resolved

lbdreyer removed the Status: Work in Progress label Oct 18, 2019

stephenworsley reviewed Oct 18, 2019

View reviewed changes

This was referenced Oct 22, 2019

Ancil var strict lbdreyer/iris#6

Closed

Merge bounds handling into _DimensionalMetadata class. lbdreyer/iris#7

Merged

pp-mo and others added 2 commits October 23, 2019 12:46

Merge bounds handling into _DimensionalMetadata class. (#7)

d3f39b8

* Merge bounds handling into _DimensionalMetadata class. * Fix cube arithmetic bug. * Code style fix.

Review actions

f1f6e56

lbdreyer commented Oct 23, 2019

View reviewed changes

lib/iris/cube.py Outdated Show resolved Hide resolved

Fix cube summary of ancil vars

0e5eac0

pp-mo mentioned this pull request Oct 23, 2019

[PI] Cube operations support for Ancillary Variables #3483

Closed

6 tasks

lbdreyer added 2 commits October 23, 2019 15:21

Add cube.summary test for ancillary variables

05caaa5

Back out changes to concatenate and merge

892ddd0

stickler-ci reviewed Oct 23, 2019

View reviewed changes

lib/iris/tests/unit/cube/test_Cube.py Outdated Show resolved Hide resolved

lib/iris/tests/unit/cube/test_Cube.py Outdated Show resolved Hide resolved

pep 8 and license headers

e5cb181

pp-mo merged commit 25617ca into SciTools:master Oct 23, 2019

stephenworsley mentioned this pull request Nov 13, 2019

Fix cube equality for ancillary variables #3529

Closed

3 tasks

pp-mo mentioned this pull request Nov 19, 2019

Coord as cube #3485

Closed

trexfeathers mentioned this pull request Nov 20, 2019

Ancillary Variables - Fix concatenate, merge #3552

Closed

8 tasks

pp-mo mentioned this pull request Nov 29, 2019

PI-3473: Netcdf loading ancillary variables #3556

Merged

pp-mo mentioned this pull request Jun 3, 2020

Making coords into cubes and vice-versa pp-mo/iris#61

Open

bjlittle mentioned this pull request May 27, 2021

Fb fix cube coord arithmetic #4159

Merged

lbdreyer deleted the ancil_var branch June 23, 2021 15:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include AncillaryVariables and restructure Cube metadata. #3422

Include AncillaryVariables and restructure Cube metadata. #3422

lbdreyer commented Sep 26, 2019

pp-mo commented Oct 14, 2019 •

edited

Loading

lbdreyer commented Oct 15, 2019

pp-mo Oct 17, 2019

stephenworsley Oct 18, 2019

lbdreyer Oct 21, 2019

pp-mo Oct 21, 2019

stephenworsley Oct 22, 2019 •

edited

Loading

pp-mo commented Oct 22, 2019 •

edited

Loading

pp-mo commented Oct 23, 2019 •

edited

Loading

pp-mo commented Oct 23, 2019 •

edited

Loading

lbdreyer commented Oct 23, 2019

pp-mo commented Oct 23, 2019

Include AncillaryVariables and restructure Cube metadata. #3422

Include AncillaryVariables and restructure Cube metadata. #3422

Conversation

lbdreyer commented Sep 26, 2019

pp-mo commented Oct 14, 2019 • edited Loading

lbdreyer commented Oct 15, 2019

pp-mo Oct 17, 2019

Choose a reason for hiding this comment

stephenworsley Oct 18, 2019

Choose a reason for hiding this comment

lbdreyer Oct 21, 2019

Choose a reason for hiding this comment

pp-mo Oct 21, 2019

Choose a reason for hiding this comment

stephenworsley Oct 22, 2019 • edited Loading

Choose a reason for hiding this comment

pp-mo commented Oct 22, 2019 • edited Loading

pp-mo commented Oct 23, 2019 • edited Loading

pp-mo commented Oct 23, 2019 • edited Loading

lbdreyer commented Oct 23, 2019

pp-mo commented Oct 23, 2019

pp-mo commented Oct 14, 2019 •

edited

Loading

stephenworsley Oct 22, 2019 •

edited

Loading

pp-mo commented Oct 22, 2019 •

edited

Loading

pp-mo commented Oct 23, 2019 •

edited

Loading

pp-mo commented Oct 23, 2019 •

edited

Loading