transfer files using a separate `paramiko` channel #171

keewis · 2022-08-17T15:28:21Z

closes issues with shell escaping in the batch script #168

The lower-level paramiko library allows using a separate channel to execute something similar to:

python -c 'script = "..."; print(script)' | ssh user@host 'cat - >remote_file'

where script is never evaluated by the shell.

I'm not sure if there's something like this in fabric; I couldn't find anything that would avoid using SFTP (which the admins on my HPC disabled). There were references to SCP in the documentation of fabric, but I don't think we should depend on that: as far as I remember, that protocol is deprecated.

I tried to include a test, but I can't seem to understand the test setup so I can't verify if it passes (I guess I will need to wait on CI for that).

andersy005 · 2022-08-17T15:35:27Z

thank you for this addition, @keewis! The CI has been failing for a while. I plan to look into this today. If you don't mind waiting, we can merge this once the CI is fixed.

Are you able to test this feature on your local HPC machine?

keewis · 2022-08-17T17:42:37Z

no worries, I'm not in a hurry to get this merged so you can take your time (though afterwards it would be great to get a release).

Re testing: of course I did a manual "integration test" on my local HPC (run jupyter-forward --launch-command '...') to verify that it works (it does, but that might be specific to my setup)

… shell

for more information, see https://pre-commit.ci

keewis · 2022-08-18T21:45:02Z

it would probably help very much with the debugging if we could figure out how to have pytest properly redirect stdin, stdout, and stderr.

My guess is that the reason for broken redirect is that the console is instantiated on import while pytest redirects when entering the test function's context. If that's correct, this might be fixed by instantiating console in the runner's __post_init__ (or by monkeypatching jupyter_forward.core.console).

I can try sending in a PR tomorrow, if that sounds good to you.

andersy005 · 2022-08-18T21:50:38Z

I can try sending in a PR tomorrow, if that sounds good to you.

Yes, please. That would be very useful...

keewis · 2022-08-19T09:54:57Z

okay, so after some investigation, it seems that this is caused by the -s option to pytest (configured in setup.cfg) and doesn't have anything to do with the console. This is necessary because we're trying to read from stdin using getpass.getpass.

I'm guessing we would have to monkeypatch _authentication_handler / getpass.getpass in all of the core tests to avoid that.

Another option would be to take the authentication handlers as parameters (one for auth_interactive_dumb and one for auth_password), which would allow us to specify dummy authentication handlers.

I'd probably prefer the latter because it results in a cleaner architecture (monkeypatching is usually a code smell).

Edit: as it turns out, invoke also tries to do operations on stdin (even if the command itself never uses it), which is where the OSErrors is coming from if we run pytest without -s. So not sure how to fix this if we don't want to switch to bare paramiko?

Edit2: setting in_stream=False in every call to session.run, e.g. by overriding run.in_stream in fabric's configuration object, might work as well if we don't need to have remote processes read from stdin.

tests/test_core.py

since we don't actually use the shell

keewis · 2022-08-19T16:01:25Z

well, turns out the issue was not the write but verifying the contents of the file. That means that we can also ignore all the different shells (we don't actually use them) and just take the default shell.

andersy005

This looks great, @keewis! Thanks again for this addition 🎉

andersy005 · 2022-08-19T16:18:49Z

i'm going to merge this shortly unless you have additional changes ...

mnlevy1981 · 2022-08-19T16:35:54Z

well, turns out the issue was not the write but verifying the contents of the file. That means that we can also ignore all the different shells (we don't actually use them) and just take the default shell.

I've been trying to follow this conversation, but I'm a little lost. I saw that you introduced a new test (test_put_file()), and this test was failing on every shell except bash. It sounds like you're okay with this failing on the other shells, and I just want a little clarification about why... is this an issue with the CI environment rather than the code? Is bash the only shell that actually calls runner.put_file()? I guess I'm just asking for a little reassurance that we won't have tcsh users running into the same error that CI was reporting before we removed tcsh from the list of shells that are being tested.

keewis · 2022-08-19T17:57:24Z

I guess I'm just asking for a little reassurance that we won't have tcsh users running into the same error

no worries, I'm happy to answer any questions you might have (I might not be able to answer some of them, though, I'm no expert on shell startup, and in particular I don't know csh / tcsh at all)

Slight correction: the test was passing on every shell except tcsh.

As far as I can tell, the reason it failed was the line of the test where we get the contents of the newly created file to check that it actually contains what we wanted to put there:

jupyter-forward/tests/test_core.py

Line 143 in 5e4230a

out = runner.run_command(f'cat {path}')

Changing the test to call runner.session.run does make it work:

jupyter-forward/tests/test_core.py

Line 143 in 28da4f4

out = runner.session.run(f'cat {path}')

which to me indicates that there's something wrong in run_command, or, more likely, that a common shell startup file (like ~/.profile) contains code incompatible with tcsh, which decided to complain about that to stdout (and all commands executed by _set_log_directory would print the same warning, so...).

Now that the test does not use run_command anymore (which I think is the only time the shell would ever be used), I thought that we wouldn't need to test exactly the same thing for each shell. However, if you don't think that's the case, or just feel safer to run the test on all shells that's fine with me, too.

In any case, TL;DR: the issue was in the test code and not put_file, and put_file works (and is used) regardless of the chosen shell.

mnlevy1981 · 2022-08-19T18:52:49Z

Thanks for clarifying!

Now that the test does not use run_command anymore (which I think is the only time the shell would ever be used), I thought that we wouldn't need to test exactly the same thing for each shell. However, if you don't think that's the case, or just feel safer to run the test on all shells that's fine with me, too.

Given your response, I don't think we need to run this test on every shell so I vote for leaving it bash-only.

keewis · 2022-08-19T20:01:38Z

great! @andersy005, I think this should be ready for merging then. Thanks a lot for the reviews and help!

Edit: and just after posting this I found a few things to improve...

jupyter_forward/core.py

tests/test_core.py

keewis and others added 5 commits August 17, 2022 19:51

add a function to write to a file without exposing the content to the…

afc7613

… shell

use the paramiko client instead of the fabric connection

4773a10

use the new put_file command to create the file

5472e95

display the batch script in the terminal

d571816

[pre-commit.ci] auto fixes from pre-commit.com hooks

6816de2

for more information, see https://pre-commit.ci

keewis force-pushed the file-transfer branch from c6be3e1 to 6816de2 Compare August 17, 2022 17:52

keewis and others added 3 commits August 18, 2022 12:29

fix the test

c2dd3c3

Merge branch 'main' into file-transfer

b69c211

Merge branch 'main' into file-transfer

c264f5f

andersy005 added the enhancement New feature or request label Aug 18, 2022

keewis mentioned this pull request Aug 19, 2022

capture test output #173

Merged

andersy005 and others added 2 commits August 19, 2022 08:34

Merge branch 'main' into file-transfer

5ea2942

clean up the file after the test

584becf

andersy005 reviewed Aug 19, 2022

View reviewed changes

tests/test_core.py Show resolved Hide resolved

keewis added 8 commits August 19, 2022 17:11

add more custom remote handlers to avoid warnings from getpass

4318ffd

try using temporary files instead

dc322d1

don't initialize the log directory

d64afd7

remove the unnecessary auth handler overrides

ea44b33

try explicitly specifying the shell to run

5e4230a

back to creating / overwriting the file

5e59c11

use the raw session to verify the contents

28da4f4

only test on a single shell

fe04b0e

since we don't actually use the shell

keewis force-pushed the file-transfer branch from 89d03f0 to fe04b0e Compare August 19, 2022 16:04

andersy005 approved these changes Aug 19, 2022

View reviewed changes

keewis commented Aug 19, 2022

View reviewed changes

jupyter_forward/core.py Outdated Show resolved Hide resolved

keewis commented Aug 19, 2022

View reviewed changes

tests/test_core.py Outdated Show resolved Hide resolved

keewis and others added 3 commits August 19, 2022 22:11

convert to normal string

b7de500

actually use the default shell

07dd3db

define the auth handlers as normal functions

4df86bb

andersy005 merged commit cd1eefe into ncar-xdev:main Aug 23, 2022

keewis deleted the file-transfer branch August 24, 2022 08:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transfer files using a separate `paramiko` channel #171

transfer files using a separate `paramiko` channel #171

keewis commented Aug 17, 2022 •

edited

Loading

andersy005 commented Aug 17, 2022

keewis commented Aug 17, 2022 •

edited

Loading

keewis commented Aug 18, 2022 •

edited

Loading

andersy005 commented Aug 18, 2022

keewis commented Aug 19, 2022 •

edited

Loading

keewis commented Aug 19, 2022

andersy005 left a comment

andersy005 commented Aug 19, 2022

mnlevy1981 commented Aug 19, 2022

keewis commented Aug 19, 2022 •

edited

Loading

mnlevy1981 commented Aug 19, 2022

keewis commented Aug 19, 2022 •

edited

Loading

transfer files using a separate paramiko channel #171

transfer files using a separate paramiko channel #171

Conversation

keewis commented Aug 17, 2022 • edited Loading

andersy005 commented Aug 17, 2022

keewis commented Aug 17, 2022 • edited Loading

keewis commented Aug 18, 2022 • edited Loading

andersy005 commented Aug 18, 2022

keewis commented Aug 19, 2022 • edited Loading

keewis commented Aug 19, 2022

andersy005 left a comment

Choose a reason for hiding this comment

andersy005 commented Aug 19, 2022

mnlevy1981 commented Aug 19, 2022

keewis commented Aug 19, 2022 • edited Loading

mnlevy1981 commented Aug 19, 2022

keewis commented Aug 19, 2022 • edited Loading

transfer files using a separate `paramiko` channel #171

transfer files using a separate `paramiko` channel #171

keewis commented Aug 17, 2022 •

edited

Loading

keewis commented Aug 17, 2022 •

edited

Loading

keewis commented Aug 18, 2022 •

edited

Loading

keewis commented Aug 19, 2022 •

edited

Loading

keewis commented Aug 19, 2022 •

edited

Loading

keewis commented Aug 19, 2022 •

edited

Loading