Development dry run #307

valeriupredoi · 2019-10-11T15:23:13Z

OK chaps this is the start of ESMValGroup/ESMValTool#1365 Currently it does pretty much all it needs to do:

run the tool with --dry-run flag and it will run through the recipe and run a default preprocessor, without stopping where data is missing;
it will tell you where data is missing via MISSING DATA in logger;
it runs the cmor checks and fixes and doesn't output the fixed files and then it exits without doing anything else (no diagnostic) - writing to disk is switched off completely bar the main_log and main_log_debug and resources (what's in run) files;
works seamlessly with derived variables and fx ones too;

It is pretty much the standard esmvaltool workflow but with all other stuffs than basics removed, and obviously no diagnostics.

CAVEAT: if there are issues with the cmor checks (either data or metadata) it will stop there, just as esmvaltool prod run would do; I need @jvegasbsc 's advice how to allow for just logging those errors and letting the flow on, I suggest using the same dry_check flag that gets passed through _recipe.py and allows for the dry run, but that means changing a bit the error handling in cmor/ - what says you? 🍺

valeriupredoi · 2019-10-11T15:37:27Z

BTW guys - this, a rather complicated workflow rerouting, in +42/-21 lines -> hell, note of a great robust codebase we have - cheers to yous 🍺 See yous in Allemania next week, pub time now 🍺

valeriupredoi · 2019-10-17T11:43:59Z

ok this one works now without saving and proper without running (or trying to run) any diagnostic, ready for review chaps 🍺

esmvalcore/preprocessor/_io.py

valeriupredoi · 2020-02-03T12:24:08Z

still waiting on #374 - giddyup guys! 🍺

stefsmeets · 2021-05-21T09:10:48Z

Hi @valeriupredoi , I just had a look at this PR, and I'm wondering if it is still relevant with #917 merged. I merged master, and although I'm not super confident I got everything right (there were a lot of code changes generating merge conflicts), I did get it to work... although I'm not sure what kind of output to expect 😅.

valeriupredoi · 2021-05-21T09:30:24Z

cheers muchly @stefsmeets 🍺 You shouldn't get any output since the purpose of this functionality is not to do any analysis but rather to alert the user to missing data, issues with data (CMOR hiccups) etc - I am not sure of the future of this PR though, @bouweandela was not keen to get it in last time I spoke to him but @axel-lauer was - I am on the fence myself since now we can throttle the CMOR checks and we get better error messages via #917 - what do peeps think?

stefsmeets · 2021-05-21T09:52:12Z

That's why I was asking if it is still relevant. Personally, I like the idea because it would take away some of my own frustrations with missing data. Having looked at the code however, the changes are quite convoluted, and it does not make the existing code easier to understand. I'm questioning whether this is the right implementation.

jvegreg · 2021-05-21T10:08:23Z

I think #917 is good enough for when data is missing. I also think we still need a way to check the data without running the full recipe.

About the implementation, it will be possible to just add a dry_run parameter to the task._run function so we just modify the
PreprocessingTask._run implementation to just run the critical preprocessor tasks (load and checking basically, maybe area / level selection) if that flag is set. For the DiagnosticTask it will be just pass.

I have some doubts about how this will interact with the multimodel, though, mostly because I have not a clear idea about how it is implemented

bouweandela · 2021-05-27T10:29:15Z

The implementation here indeed looks a bit too complicated. All that would be needed now to just run the cmor checks is some way to disable saving the preprocessor output to file, right? We already have a nice report for missing data and a way to disable running diagnostics.

valeriupredoi · 2021-05-27T10:45:19Z

I reckon Bouwe is right, Javi - all we need is a run through the data to find the missing ones and detect CMOR issues (depending on the chosen level) not a full run of the preprocessor - that takes time and memory and if we don't save any data what's the point? Gonna have a look at it now 🍺

valeriupredoi · 2021-05-27T10:47:24Z

out of curiosity, do we have a separate report just for missing data? I don't think so, and the stdout is buried in all that screen output (or log) that, even if not in debug mode, is still a lot to sift through

valeriupredoi · 2022-01-18T16:06:37Z

sadly this has been sidelined for too long and now it's obsolete like a phone booth. Closing it 😢

valeriupredoi added 9 commits October 11, 2019 12:46

started implementing dry data checks

8d30979

started implementing dry data checks

7c51fd7

inching closer

8a16687

switched to an overall dry-run

cdf0491

first working version of dry-run

f64237b

adding dry-run to config dict

febfb6a

adding dry-run handling

62a3dce

tweaked func argument

f6c7da5

major simplification

1ff047b

valeriupredoi added the enhancement New feature or request label Oct 11, 2019

valeriupredoi requested review from bouweandela, jvegreg, mattiarighi and axel-lauer October 11, 2019 15:23

valeriupredoi mentioned this pull request Oct 11, 2019

Data alert tool to run before running the recipe ESMValGroup/ESMValTool#1365

Closed

valeriupredoi added the workshop label Oct 11, 2019

jvegasbsc and others added 12 commits October 14, 2019 17:10

Add option to not raise in cmor checks

da8b8ee

linter test fix

005c9ef

removed saving

6d67cbf

no saving if dry run

3cfd1b8

no saving if dry run

2c87a4c

no saving if dry run

465d070

changed a bit

df2d765

changed a bit to accomodate a slightly diffrnt handling

660e33c

working version not to save

120eb43

working version not to save

f9cead9

working version not to save

207ec8b

last change so the ancestry is not asked for if dry run

68c860c

valeriupredoi mentioned this pull request Nov 15, 2019

Restart a partially completed recipe #375

Open

Merge branch 'development' into development_dry_checks

6345cda

JaroCamphuijsen reviewed Nov 28, 2019

View reviewed changes

esmvalcore/preprocessor/_io.py Outdated Show resolved Hide resolved

valeriupredoi mentioned this pull request Dec 3, 2019

Create dedicated dataset issue logs in run/ with only the dataset warnings from CMOR checks #387

Open

bouweandela changed the base branch from development to master January 3, 2020 12:18

mattiarighi removed the workshop label Jan 17, 2020

valeriupredoi added 3 commits February 3, 2020 12:26

Merge branch 'master' into development_dry_checks

753934e

fixed the fix for conflict

12c0c2e

fixed test

9cd0e72

schlunma modified the milestones: IPCC AR6, ESMValTool papers Mar 10, 2020

bouweandela mentioned this pull request Sep 24, 2020

Topics for ESMValTool workshop in November ESMValGroup/ESMValTool#1817

Closed

valeriupredoi mentioned this pull request Jan 5, 2021

Improve error messages when data is missing #917

Merged

stefsmeets added 4 commits May 20, 2021 16:53

Merge branch 'master' into development_dry_checks

f591b77

Fix flake8 errors and formatting warnings

10e6125

Fix merge errors

f4fcbdc

Fix CLI description

36165bf

valeriupredoi closed this Jan 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Development dry run #307

Development dry run #307

valeriupredoi commented Oct 11, 2019 •

edited

Loading

valeriupredoi commented Oct 11, 2019

valeriupredoi commented Oct 17, 2019

valeriupredoi commented Feb 3, 2020

stefsmeets commented May 21, 2021 •

edited

Loading

valeriupredoi commented May 21, 2021

stefsmeets commented May 21, 2021

jvegreg commented May 21, 2021

bouweandela commented May 27, 2021

valeriupredoi commented May 27, 2021

valeriupredoi commented May 27, 2021

valeriupredoi commented Jan 18, 2022

Development dry run #307

Development dry run #307

Conversation

valeriupredoi commented Oct 11, 2019 • edited Loading

valeriupredoi commented Oct 11, 2019

valeriupredoi commented Oct 17, 2019

valeriupredoi commented Feb 3, 2020

stefsmeets commented May 21, 2021 • edited Loading

valeriupredoi commented May 21, 2021

stefsmeets commented May 21, 2021

jvegreg commented May 21, 2021

bouweandela commented May 27, 2021

valeriupredoi commented May 27, 2021

valeriupredoi commented May 27, 2021

valeriupredoi commented Jan 18, 2022

valeriupredoi commented Oct 11, 2019 •

edited

Loading

stefsmeets commented May 21, 2021 •

edited

Loading