tests: run 'cice.run' with '-nomodules' #724

phil-blain · 2022-05-24T20:37:30Z

PR checklist

Some compiler environments (namely newer version of Intel's oneAPI
toolkit) prevent users form sourcing their initialization script twice,
at least by default.

The CICE test infrastructure currently does this, because
env.$machine_$env is sourced by cice.test and then again by cice.run
(which is ran from cice.test). Sourcing the env file in 'cice.run', when
ran from 'cice.test', is thus unnecessary.

Most env files already support the '-nomodules' flag, which is used by
'cice.setup' when only machine variables are needed, and not a full
compiling environment. Leverage this flag by calling 'cice.run' with
'-nomodules' in 'cice.test'. As a byprodcut, this should make tests run
slightly faster since the environment setup is done only once.

Some compiler environments (namely newer version of Intel's oneAPI toolkit) prevent users form sourcing their initialization script twice, at least by default. The CICE test infrastructure currently does this, because env.$machine_$env is sourced by cice.test and then again by cice.run (which is ran from cice.test). Sourcing the env file in 'cice.run', when ran from 'cice.test', is thus unnecessary. Most env files already support the '-nomodules' flag, which is used by 'cice.setup' when only machine variables are needed, and not a full compiling environment. Leverage this flag by calling 'cice.run' with '-nomodules' in 'cice.test'. As a byprodcut, this should make tests run slightly faster since the environment setup is done only once. Closes: CICE-Consortium#695

apcraig · 2022-05-24T21:27:57Z

How does passing -nomodules to cice.run do anything without a modification to the cice.run script? I don't think cice.run uses or checks for arguments.

phil-blain · 2022-05-24T22:29:52Z

The arguments are passed-through to source env.$machine_$env, so $1 in env.$machine_$env correctly refers to -nomodules from the ./cice.run -nomodules invocation.

apcraig · 2022-05-24T22:40:06Z

That's an interesting feature. So $1, unless reset, just retains it's "last" value. When env.machine_env is sourced, if nothing it passed into it, it doesn't reset the $1, $2, $3 arguments? That seems a little dicey. What if you pass an argument into cice.run that's used in cice.run. Then pass nothing into env.machine_env and it picks up on the $1 in cice.run, which might be completely useless or incorrect? I would have expected the local $n to default to unset by default unless an argument is explicitly set in the execution. Do we really want to leverage/rely on this feature?

phil-blain · 2022-05-25T16:15:47Z

So I checked the Bash documentation (https://www.gnu.org/savannah-checkouts/gnu/bash/manual/bash.html#Bourne-Shell-Builtins) which says:

. (a period)
. filename [arguments]
Read and execute commands from the filename argument in the current shell context. [...] If any arguments are supplied, they become the positional parameters when filename is executed. Otherwise the positional parameters are unchanged.

So for Bash it's a documented behaviour.

But we use csh. Here is what man csh (http://manpages.ubuntu.com/manpages/bionic/man1/bsd-csh.1.html) has to say about the source builtin:

       source name
       source -h name
               The shell reads commands from name.  source commands may be nested; if they
               are nested too deeply the shell may run out of file descriptors.  An error in
               a source at any level terminates all nested source commands.  Normally input
               during source commands is not placed on the history list; the -h option causes
               the commands to be placed on the history list without being executed.

So it is not specified what happens to existing positional arguments, but I understand it to mean that they are implicitely passed. Note that the original csh does not allow any arguments to be passed explicitely to source.

tcsh (http://manpages.ubuntu.com/manpages/bionic/man1/tcsh.1.html) does not add that much info, but it does allow positional arguments to be explicitely passed:

   source [-h] name [args ...]
           The shell reads and executes commands from name.  The commands are not  placed  on
           the  history  list.   If  any args are given, they are placed in argv.  (+) source
           commands may be nested; if they are nested too deeply the shell  may  run  out  of
           file  descriptors.  An error in a source at any level terminates all nested source
           commands.  With -h, commands are placed on  the  history  list  instead  of  being
           executed, much like `history -L'.

So I think we can safely use that feature.

apcraig · 2022-05-26T16:59:30Z

Good to know this feature is likely to be robust. I guess another question is do we really want to leverage this feature. It seems to lack transparency and may cause confusion later on. I would almost prefer if cice.run had an explicit check if -nomodules was passed and if so, that it would be passed to "source env.machine_env" explicitly. Thoughts?

phil-blain · 2022-05-26T18:29:33Z

It depends if we want to drop support for the original csh and just claim outright that our scripts need tcsh to work, because the "original" csh does not accept additional arguments to its source builtin (as the man page excerpts above mention).

I would not be opposed to that as along as all shebangs are changed to #!/usr/bin/tcsh or maybe better #!/usr/bin/env tcsh, but then we might need #!/usr/bin/env -S tcsh -fif we want to keep -f, and then we need env from GNU coreutils 8.30 or newer (EDIT because that's where the -S flag` which allows passing additional arguments to the program name was added).

apcraig · 2022-05-26T18:54:51Z

OK, so one question is whether we are even allowed to pass arguments with csh. What we're doing now should not work. Our scripts are #!/bin/csh but we pass arguments into them. It's working only because it's either using tcsh under the covers or because the implementation of csh under the covers allows it. Or maybe the argument is ignored and we can't tell.

How much risk is there to requiring tcsh in terms of portability? Since the current "csh" seems to be working OK, should we fix this?

A separate question still seems to be whether we want to explicitly or implicitly pass the arguments down the calling tree. I do think I prefer explicit. That would mean adding something like the following to cice.run

set nomodules = ""
if ($#argv == 1) then
  if ($1 == "-nomodules") then
    set nomodules = $1
  endif
endif

.....

source ./env.\${ICE_MACHCOMP} ${nomodules} || exit 2

or similar. Thoughts on both issues?

phil-blain · 2022-05-26T20:07:16Z

Passing arguments to scripts does work in csh, it's passing additional arguments to the source builtin that only works in tcsh i.e.

# this works in csh and tcsh
source some_file
# this only works in tcsh
source some_file -some_argument

apcraig · 2022-06-03T15:55:11Z

So all our scripts are basically setup as csh, but may be running tcsh under the covers. The -nomodules option is mainly useful to speed up running large test suites. I don't think most users do that.

Let me circle back again. Is this to help performance? What does "Some compiler environments (namely newer version of Intel's oneAPI toolkit) prevent users form sourcing their initialization script twice, at least by default" mean? Is there a failure or problem resetting the env with Intel oneAPI?

phil-blain · 2022-06-06T13:46:52Z

What does "Some compiler environments (namely newer version of Intel's oneAPI toolkit) prevent users form sourcing their initialization script twice, at least by default" mean? Is there a failure or problem resetting the env with Intel oneAPI?

It means that the Intel initialization script detects it is being sourced a second time, and aborts, yes. You can force it to do it with a special flag though.

But since I opened this PR, our CS departement has added additional methods to load the compiler, so these changes are not needed for me anymore. So we can drop it if you prefer.

apcraig · 2022-06-13T07:44:38Z

Thanks @phil-blain. At this point, if the feature is not needed, I guess I prefer we just close this PR. I think there is a bigger issue about scripting language and overall implementation. @phil-blain, do you think we should switch to bash or some other language/tool? Is it time to review the current scripts and do some refactoring? Maybe we need a new issue where we can accumulate additional script ideas and then find a time to update? This could include a variety of things including #674, for instance.

phil-blain · 2022-06-13T17:08:20Z

I'm ambivalent about switching tools / languages. It might be quite an involved process...

phil-blain · 2022-06-13T17:09:14Z

I'll close this PR.

phil-blain requested a review from apcraig May 24, 2022 20:37

phil-blain mentioned this pull request May 24, 2022

cice.test should call cice.run with -nomodules #695

Closed

apcraig added the Scripts label May 24, 2022

phil-blain closed this Jun 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests: run 'cice.run' with '-nomodules' #724

tests: run 'cice.run' with '-nomodules' #724

phil-blain commented May 24, 2022

apcraig commented May 24, 2022

phil-blain commented May 24, 2022

apcraig commented May 24, 2022

phil-blain commented May 25, 2022

apcraig commented May 26, 2022

phil-blain commented May 26, 2022 •

edited

Loading

apcraig commented May 26, 2022

phil-blain commented May 26, 2022 •

edited

Loading

apcraig commented Jun 3, 2022

phil-blain commented Jun 6, 2022

apcraig commented Jun 13, 2022

phil-blain commented Jun 13, 2022

phil-blain commented Jun 13, 2022

tests: run 'cice.run' with '-nomodules' #724

tests: run 'cice.run' with '-nomodules' #724

Conversation

phil-blain commented May 24, 2022

PR checklist

apcraig commented May 24, 2022

phil-blain commented May 24, 2022

apcraig commented May 24, 2022

phil-blain commented May 25, 2022

apcraig commented May 26, 2022

phil-blain commented May 26, 2022 • edited Loading

apcraig commented May 26, 2022

phil-blain commented May 26, 2022 • edited Loading

apcraig commented Jun 3, 2022

phil-blain commented Jun 6, 2022

apcraig commented Jun 13, 2022

phil-blain commented Jun 13, 2022

phil-blain commented Jun 13, 2022

phil-blain commented May 26, 2022 •

edited

Loading

phil-blain commented May 26, 2022 •

edited

Loading