[Feature]: Replace image diff checking in integration tests with metrics checking instead #756

tomvothecoder · 2023-11-29T22:12:00Z

Is your feature request related to a problem?

Currently, tests/integration/test_diags.py runs the all_sets.cfg diagnostics and takes the diffs of the results and compares them against a baseline (whatever is on Chrysalis). We set the minimum diff threshold of non-zero pixels to 2%. The issue with taking a diff of two images is that any noise can break the test (e.g., change in matplotlib formatting, shifting of legend, floating point formatting, different font sizes). The baseline results sometimes need to be updated if
matplotlib updates introduce side-effects. It is challenging to debug the integration tests and they take a long time to run (#643), which bogs down development.

For example, below is the actual, expected, and the difference of both. Notice that the diff is basically just noise from the legend shifting over a bit and a change in the "Test" name.

Describe the solution you'd like

We should compare the underlying metrics in the .json files instead. Users should manually validate the plots are as expected based on the metrics being plotted since that is a more reliable over pixel comparisons.

Describe alternatives you've considered

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

forsyth2 · 2023-11-30T17:41:23Z

Thanks @tomvothecoder I agree this would a more reliable test. I suppose zppy could do the same.

tomvothecoder added the enhancement label Nov 29, 2023

This was referenced Nov 30, 2023

Replace image diff tests with metrics tests E3SM-Project/zppy#536

Open

[Bug]: tests/integration/test_diags.py is failing on local main branch #758

Closed

forsyth2 mentioned this issue Dec 8, 2023

Avoid testing duplication for packages zppy uses E3SM-Project/zppy#537

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Replace image diff checking in integration tests with metrics checking instead #756

[Feature]: Replace image diff checking in integration tests with metrics checking instead #756

tomvothecoder commented Nov 29, 2023 •

edited

Loading

forsyth2 commented Nov 30, 2023

[Feature]: Replace image diff checking in integration tests with metrics checking instead #756

[Feature]: Replace image diff checking in integration tests with metrics checking instead #756

Comments

tomvothecoder commented Nov 29, 2023 • edited Loading

Is your feature request related to a problem?

Describe the solution you'd like

Describe alternatives you've considered

Additional context

forsyth2 commented Nov 30, 2023

tomvothecoder commented Nov 29, 2023 •

edited

Loading