Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ILAMB for Cori #253

Merged
merged 1 commit into from
Jun 13, 2022
Merged

ILAMB for Cori #253

merged 1 commit into from
Jun 13, 2022

Conversation

forsyth2
Copy link
Collaborator

@forsyth2 forsyth2 commented Jun 7, 2022

Support ILAMB on Cori. The Cori component of #229.

@forsyth2 forsyth2 added the semver: new feature New feature (will increment minor version) label Jun 7, 2022
@forsyth2 forsyth2 self-assigned this Jun 7, 2022
@forsyth2
Copy link
Collaborator Author

forsyth2 commented Jun 8, 2022

@chengzhuzhang I get the following channel initialization failed error when using the cfg below. Same issues as in #247 (comment). The error is occurring on https://github.com/E3SM-Project/zppy/blob/main/zppy/templates/ts.bash#L128.

Error:

+ e3sm_to_cmip --output-path /global/cscratch1/sd/forsyth/zppy_ilamb_cori_output/v2.LR.historical_0201/post/atm/180x360_aave/cmip_ts/monthly/tmp_ts_atm_monthly_180x360_aave_1851-1852-0002 --var-list 'pr, tas, rsds, rlds, rsus' --realm atm --input-path /global/cscratch1/sd/forsyth/zppy_ilamb_cori_output/v2.LR.historical_0201/post/atm/180x360_aave/ts/monthly/2yr --user-metadata /global/cscratch1/sd/forsyth/zppy_ilamb_cori_output/v2.LR.historical_0201/post/scripts/tmp.59944542.5Zje/default_metadata.json --num-proc 12 --tables-path /global/cfs/cdirs/e3sm/diagnostics/cmip6-cmor-tables/Tables
Tue Jun  7 20:38:08 2022: [unset]:_pmi_alps_init:alps_get_placement_info returned with error -1
Tue Jun  7 20:38:08 2022: [unset]:_pmi_init:_pmi_alps_init returned -1
[Tue Jun  7 20:38:08 2022] [c7-5c1s1n1] Fatal error in PMPI_Init_thread: Other MPI error, error stack:
MPIR_Init_thread(537):
MPID_Init(246).......: channel initialization failed
MPID_Init(647).......:  PMI2 init failed: 1
/var/spool/slurmd/job59944542/slurm_script: line 234: 44767 Aborted                 e3sm_to_cmip --output-path ${dest_cmip}/${tmp_dir} --var-list 'pr, tas, rsds, rlds, rsus' --realm atm --input-path ${input_dir} --user-metadata /global/cscratch1/sd/forsyth/zppy_ilamb_cori_output/v2.LR.historical_0201/post/scripts/${workdir}/default_metadata.json --num-proc 12 --tables-path ${cmortables_dir}
+ '[' 134 '!=' 0 ']'
+ cd /global/cscratch1/sd/forsyth/zppy_ilamb_cori_output/v2.LR.historical_0201/post/scripts
+ echo 'ERROR (3)'
+ exit 3

cfg:

[default]
case = v2.LR.historical_0201
environment_commands = "source /global/common/software/e3sm/anaconda_envs/load_latest_e3sm_unified_cori-haswell.sh"
input = /global/cscratch1/sd/forsyth/ilamb_cori_input/v2.LR.historical_0201
input_subdir = archive/atm/hist
mapping_file = /global/homes/z/zender/data/maps/map_ne30pg2_to_cmip6_180x360_aave.20200201.nc
output = /global/cscratch1/sd/forsyth/zppy_ilamb_cori_output/v2.LR.historical_0201
partition = haswell
walltime = "02:00:00"
www = /global/cfs/cdirs/e3sm/www/forsyth/zppy_ilamb_cori_output

[ts]
active = True
frequency = "monthly"
years = "1851:1855:2",
ts_fmt = "cmip"

  [[ atm_monthly_180x360_aave ]]
  input_files = "eam.h0"

  [[ land_monthly ]]
  input_files = "elm.h0"
  input_subdir = "archive/lnd/hist"
  vars = "FSH,LAISHA,LAISUN,RH2M"

[ilamb_run]
active = True
grid = '180x360_aave'
short_name = 'v2.LR.historical_0201'
ts_num_years = 2
years = "1851:1853:2",

@forsyth2
Copy link
Collaborator Author

channel initialization failed error

Using srun fixes this error.

@forsyth2
Copy link
Collaborator Author

Not going to commit the example cfg, but copying here for reference:

[default]
case = v2.LR.historical_0201
environment_commands = "source /global/common/software/e3sm/anaconda_envs/load_latest_e3sm_unified_cori-haswell.sh"
input = /global/cscratch1/sd/forsyth/ilamb_cori_input/v2.LR.historical_0201
input_subdir = archive/atm/hist
mapping_file = /global/homes/z/zender/data/maps/map_ne30pg2_to_cmip6_180x360_aave.20200201.nc
output = /global/cscratch1/sd/forsyth/zppy_ilamb_cori_output/v2.LR.historical_0201
partition = haswell
qos = debug
walltime = "00:30:00"
www = /global/cfs/cdirs/e3sm/www/forsyth/zppy_ilamb_cori_output

[ts]
active = True
frequency = "monthly"
years = "1851:1855:2",
ts_fmt = "cmip"

  [[ atm_monthly_180x360_aave ]]
  input_files = "eam.h0"

  [[ land_monthly ]]
  input_files = "elm.h0"
  input_subdir = "archive/lnd/hist"
  vars = "FSH,LAISHA,LAISUN,RH2M"

[ilamb_run]
active = True
grid = '180x360_aave'
short_name = 'v2.LR.historical_0201'
ts_num_years = 2
years = "1851:1853:2",

@forsyth2 forsyth2 marked this pull request as ready for review June 13, 2022 19:03
@forsyth2 forsyth2 merged commit 5eb7f22 into main Jun 13, 2022
@forsyth2 forsyth2 deleted the ilamb-cori branch June 13, 2022 19:14
@forsyth2 forsyth2 mentioned this pull request Dec 16, 2022
14 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
semver: new feature New feature (will increment minor version)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant