Adds `-m` option to saved .pex file interpreter emulation #418

markyen · 2017-09-27T20:15:26Z

This allows you to invoke a module from a saved PEX file with app.pex -m <module>, in addition to invoking a script with app.pex <file> and a REPL with app.pex, as could already be done previously.

This makes a saved PEX file more suitable as a basic replacement for the python binary on platforms that allow you to do that but that don't have good support for Python dependency management themselves, such as PySpark.

kwlzn

thanks for the PR, this looks reasonable to me w/ one comment.

would also like to see an integration test exercising the new feature before landing.

kwlzn · 2017-09-27T20:28:11Z

pex/pex.py

+      import code
+      code.interact()
+    elif sys.argv[1] == '-m':
+      mod = sys.argv[2]


seems like this unchecked access of index position 2 would cause an unhandled IndexError for the case of ./a.pex -m.

lorencarvalho · 2017-10-02T00:48:15Z

isn't this what PEX_MODULE=foo.bar:main and PEX_INTERPRETER=1 already do?

This allows you to invoke a module from a saved PEX file with `app.pex -m <module>`, in addition to invoking a script with `app.pex <file>` and a REPL with `app.pex`, as could already be done previously. This makes a saved PEX file more suitable as a basic replacement for the `python` binary on platforms that allow you to do that but that don't have good support for Python dependency management themselves, such as PySpark.

markyen · 2017-10-02T07:42:58Z

Added an extra check to avoid possibleIndexErrors and added integration tests.

It's true that this new flag does something you could already accomplish by setting PEX_MODULE: instead of running ./app.pex -m <module>, you could run PEX_MODULE=<module> ./app.pex and get the same result.

But my use case involves trying to use the built PEX file as the Python binary with a framework that doesn't know about PEX_MODULE; this framework just takes a configurable Python binary tries to invoke it with $PYTHON <script> or $PYTHON -m <module>. I'd like to be able to point $PYTHON at a built PEX file.

lorencarvalho · 2017-10-02T16:41:48Z

it might be worth noting that the .pexrc feature can inject environment variables for you if your environment doesn't support that. Placing a .pexrc file in the same CWD (or in $HOME) the pex file will read it.

Another way of achieving what you want without modifying pex source is to simply have a wrapper script that handles this sys.argv inspection and then runs an os.execve with environment variables set. Or even just a bash script, here's an example of one a tool I use builds:

#!/bin/bash

exec /usr/bin/env PEX_ROOT="$BASEDIR/libexec" PEX_MODULE="apollo:main" $BASEDIR/bin/server.pex "$@"

It should also be noted that you can use pex files as the shebang of a script a la #!/usr/bin/env yourpex.pex.

Not trying to discourage this change, just offering potential solutions that might work w/o cutting a new feature.

kwlzn · 2017-10-02T19:04:34Z

my understanding is that the -m <module> mode is specifically to smooth pex usage in PySpark. it's not the first time I've heard this request and since the change is limited to the default entrypoint handler it seems perfectly fine to me.

markyen · 2017-10-02T19:39:55Z

Indeed, in order to get pex working with PySpark now, I'm doing something very similar to what @sixninetynine suggested: with every PySpark job we write, I include a bash script that contains ./app.pex interpreter.py $@, and an interpreter.py that contains a copy of the pex.execute_interpreter function with the added ability of handling the -m flag, just as it appears in this PR.

This has been working fine for us for the last couple months, but as the number of PySpark jobs grows, so too does the number of places where this set of shim scripts needs to be duplicated, and shipped along with every run of every job.

I feel it would be cleaner to move the functionality into the upstream interpreter logic, but of course I'll leave the final design decision to you!

kwlzn

thanks for adding tests - one last comment and we should be good to merge.

kwlzn · 2017-10-03T15:56:33Z

pex/pex.py

+      code.interact()
+    elif len(sys.argv) > 2 and sys.argv[1] == '-m':
+      mod = sys.argv[2]
+      sys.argv = sys.argv[2:]


sys.argv should not be modified here.

noting that sys.argv is modified below in the existing code too.. but I don't think that's necessary either. mind yanking it too while you're in here?

actually.. scratch that.. I see why the sys.argv modification is needed now (sorry about the thrash).

I'm actually not sure what the reason for sys.argv modification was, so I was just trying to match the behavior below. If a different modification is needed here, let me know and I can make that change.

markyen · 2018-01-05T19:33:40Z

@kwlzn If you let me know what the desired behavior is for modifying sys.argv, I'm happy to make a change to match it.

markyen · 2018-02-12T20:53:11Z

If anything else needs to be changed before this can be merged, please let me know and I'll be happy to change it.

markyen · 2018-03-12T20:43:09Z

@kwlzn Any thoughts on what further changes are needed?

markyen · 2020-11-09T01:22:30Z

Superseded by #563

kwlzn suggested changes Sep 27, 2017

View reviewed changes

markyen force-pushed the module branch from 3770bbc to 98ab6ef Compare October 2, 2017 07:41

kwlzn reviewed Oct 3, 2017

View reviewed changes

markyen force-pushed the module branch from 857b173 to 98ab6ef Compare October 3, 2017 18:30

markyen closed this Nov 9, 2020

markyen deleted the module branch November 9, 2020 01:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds `-m` option to saved .pex file interpreter emulation #418

Adds `-m` option to saved .pex file interpreter emulation #418

markyen commented Sep 27, 2017

kwlzn left a comment

kwlzn Sep 27, 2017

lorencarvalho commented Oct 2, 2017

markyen commented Oct 2, 2017

lorencarvalho commented Oct 2, 2017

kwlzn commented Oct 2, 2017

markyen commented Oct 2, 2017

kwlzn left a comment

kwlzn Oct 3, 2017

kwlzn Oct 3, 2017

kwlzn Oct 3, 2017

markyen Dec 22, 2017

markyen commented Jan 5, 2018

markyen commented Feb 12, 2018

markyen commented Mar 12, 2018

markyen commented Nov 9, 2020

Adds -m option to saved .pex file interpreter emulation #418

Adds -m option to saved .pex file interpreter emulation #418

Conversation

markyen commented Sep 27, 2017

kwlzn left a comment

Choose a reason for hiding this comment

kwlzn Sep 27, 2017

Choose a reason for hiding this comment

lorencarvalho commented Oct 2, 2017

markyen commented Oct 2, 2017

lorencarvalho commented Oct 2, 2017

kwlzn commented Oct 2, 2017

markyen commented Oct 2, 2017

kwlzn left a comment

Choose a reason for hiding this comment

kwlzn Oct 3, 2017

Choose a reason for hiding this comment

kwlzn Oct 3, 2017

Choose a reason for hiding this comment

kwlzn Oct 3, 2017

Choose a reason for hiding this comment

markyen Dec 22, 2017

Choose a reason for hiding this comment

markyen commented Jan 5, 2018

markyen commented Feb 12, 2018

markyen commented Mar 12, 2018

markyen commented Nov 9, 2020

Adds `-m` option to saved .pex file interpreter emulation #418

Adds `-m` option to saved .pex file interpreter emulation #418