Error in writing land model output for e3smv2 run #6215
Replies: 4 comments 1 reply
-
Similar issue here #6195 with potential fix |
Beta Was this translation helpful? Give feedback.
-
Sounds like you are running in to an issue that we just posted a fix for in #6206 |
Beta Was this translation helpful? Give feedback.
-
Note that the fix won't help when starting a restart from a previous run that didn't have the fix. Because the 'fixed' model will look for a variable in the lnd restart file that was not saved in the previous run. So, the way to go is:
|
Beta Was this translation helpful? Give feedback.
-
Ok, thanks for pointing out the fix as well as the details about needing to start the run new. |
Beta Was this translation helpful? Give feedback.
-
I am running e3smv2 with the main-2.0 branch (date 20231227) and running into issues writing the land model output which kills the job on perlmutter. I have successfully run from 1970-2014 with Compset F2010 and a time-varying sst using the model_start_type="initial". However, when I try to use the end of that run as the Initial Conditions for a new 45 year run starting in 1970 I have issues. I run with model start type "hybrid" and specify all the proper info for the reference case. It runs for 4 years fine (STOP_N="4") but when it tries to resubmit it essentially crashes very quickly after the resubmission when trying to write the land model .h0. file with the error at the end of this post. I have tried changing the BFBFLAG to false to see if somehow it is some numerical instability that can be avoided, but that didn't work.
If there is a different forum I should post this to please let me know. Thanks!
256: PIO: FATAL ERROR: Aborting... An error occured, Writing variables (number of variables = 180) to file (./20231227.v2.LR.F2010-FAMIP-hybrid-01-19-45year-M.elm.h0.1973-01.nc, ncid=149) using PIO_IOTYPE_PNETCDF iotype failed. Non blocking write for variable (TWS_MONTH_BEGIN, varid=205) failed (Number of subarray requests/regions=1, Size of data local to this process = 5400). NetCDF: Numeric conversion not representable (err=-60). Aborting since the error handler was set to PIO_INTERNAL_ERROR... (/global/u2/r/rebassoo/E3SM/fme/code/20231227-maint20/externals/scorpio/src/clib/pio_darray_int.c: 395)
Beta Was this translation helpful? Give feedback.
All reactions