Update albedo initialization for identical order of operations #303

apcraig · 2020-03-06T01:31:16Z

PR checklist

With the iso1 changes, one test was failing exact restart on one machine and compiler. After some debugging, it turns out to be an order of operations issue in initialization of albedos that has been there forever but never resulted in problems with exact restart. For whatever reason, this one test on one compiler on one machine picked it up finally. It is fixed and the answers are also bit-for-bit with prior results.

I also fixed initialization of nt_bgc_S when solve_zsal is off. It was not uniquely defined but this did not seem to matter to runs. Still, it should be set.

I added some diagnostics for restart read and write to more easily confirm identical fields written and read.

I modified how the ice_diag files are saved to the log directory by attaching the run timestamp to the filenames. This was long overdue. Prior, runs in the same directory (including restart tests) just overwrote the files on each run. We may want to eventually also change the way we save baselines or rename files in the run directory as this feature is only implemented in the copy of the ice_diag files to the case log directory.

I removed an OMP directive in the albedo initialization as it will not materially help performance since the code is run once. It's also a trivially cheap piece of code.

…minor updates

apcraig · 2020-03-06T01:32:58Z

It would be good to get this fix into the repo by Friday evening for weekend testing. We may also want to update CICE prior to the weekend if we can as well. Will see how reviews go.

dabail10

Looks like there are some leftover print statements. Otherwise, it looks fine.

dabail10 · 2020-03-06T01:45:58Z

configuration/driver/icedrv_restart.F90

@@ -339,7 +339,8 @@ subroutine read_restart_field(nu,work,ndim)

      minw = minval(work)
      maxw = maxval(work)
-      write(nu_diag,*) minw, maxw
+      sumw = sum(work)
+      write(nu_diag,*) subname, minw, maxw, sumw



Do you still want the print statements here?

They were there before, and I've found them to be handy.

These were there before and I just extended the feature a bit. One of the first things I wanted to check was whether all the fields were read as written and whether they were identical. It's helpful to have the same diagnostics written when writing a restart as when reading it, and the sum is a new diagnostic that provides something like a checksum value of the field.

eclare108213

Awesome! Thank you @apcraig
There's no such thing as "bug-free code"... it's surprising this one hasn't popped up before.

eclare108213 · 2020-03-06T02:03:04Z

configuration/driver/icedrv_restart.F90

@@ -339,7 +339,8 @@ subroutine read_restart_field(nu,work,ndim)

      minw = minval(work)
      maxw = maxval(work)
-      write(nu_diag,*) minw, maxw
+      sumw = sum(work)
+      write(nu_diag,*) subname, minw, maxw, sumw



They were there before, and I've found them to be handy.

apcraig · 2020-03-06T02:17:06Z

Just one other comment. The problem, in some ways, is that the same code is implemented twice. I fixed it so they would compute fields with the same order of operations. But the right thing to do would be to pull out this code and create a subroutine with it where both parts of the model are calling it. This is all in the icepack driver, so it wouldn't affect icepack columnphysics. I didn't do that as it's a bit more intrusive, I thought I'd just keep the fix as simple as possible. But it's something that could or maybe should be done.

* serial fix for cheyenne * fix hobart compile

fix albedo initialization operation order for bit-for-bit plus other …

b4b81fe

…minor updates

apcraig requested review from eclare108213 and dabail10 March 6, 2020 01:31

dabail10 reviewed Mar 6, 2020

View reviewed changes

eclare108213 approved these changes Mar 6, 2020

View reviewed changes

apcraig merged commit 6a6bb3f into CICE-Consortium:master Mar 6, 2020

lettie-roach pushed a commit to lettie-roach/Icepack that referenced this pull request Oct 18, 2022

Fix hobart and cheyenne compile issues (CICE-Consortium#303)

e2822e3

* serial fix for cheyenne * fix hobart compile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update albedo initialization for identical order of operations #303

Update albedo initialization for identical order of operations #303

apcraig commented Mar 6, 2020

apcraig commented Mar 6, 2020

dabail10 left a comment

dabail10 Mar 6, 2020

eclare108213 Mar 6, 2020

apcraig Mar 6, 2020

eclare108213 left a comment

eclare108213 Mar 6, 2020

apcraig commented Mar 6, 2020

Update albedo initialization for identical order of operations #303

Update albedo initialization for identical order of operations #303

Conversation

apcraig commented Mar 6, 2020

PR checklist

apcraig commented Mar 6, 2020

dabail10 left a comment

Choose a reason for hiding this comment

dabail10 Mar 6, 2020

Choose a reason for hiding this comment

eclare108213 Mar 6, 2020

Choose a reason for hiding this comment

apcraig Mar 6, 2020

Choose a reason for hiding this comment

eclare108213 left a comment

Choose a reason for hiding this comment

eclare108213 Mar 6, 2020

Choose a reason for hiding this comment

apcraig commented Mar 6, 2020