Switch to using a simpler-but-still-hacky way of invoking stata #6

bloodearnest · 2021-05-07T16:29:14Z

We create a wrapper script around the supplied script that writes a file
if it successfully runs the script. We use the presence of this file as
indication of success or failure, and exit appropriately.

We can then use the regular stata (without -b), which gives us a much simpler streaming stdout.

We also refactor the Dockerfile to our new approach, and add a very basic test suite

We create a wrapper script around the supplied script that writes a file if it successfully runs the script. We use the contents of this file as indication of success or failure, and exit approriately. We can then use the regular stata (without -b), which gives stdout much more simply. Also, add an experimental features to auto import ado files in the ./libraries directory of a study.

- use buildkit with tags - add packages.txt with base image tooling - remove python3 dep - have the renew-license script install expect

evansd

Ah, this is both so great and so terrible that it needs to exist. Only real question for me is over the wrapper= thing.

evansd · 2021-05-10T15:57:43Z

entrypoint.sh

+test -z "${STATA_LICENSE:-}" && { echo "No STATA_LICENSE environment variable found"; exit 1; }
+echo "$STATA_LICENSE" > /usr/local/stata/stata.lic
+
+script=$1


Would shellcheck tell us to use script="$1" here as, theoretically, the argument could contain spaces?

So shellcheck doesn't complain, as it knows $1 must be space-less, for it to be parsed as $1 on the cli! But it does no harm, so will add

Worth adding shellcheck to the lint command though.

Args are definitely not guaranteed to be space-less, unless I'm misunderstanding you

dave@carnap:~/tmp$ cat test.sh #!/bin/bash echo "\$1 is $1" dave@carnap:~/tmp$ ./test.sh 'hello there' $1 is hello there

right, fair point. I've already fixed it anyway, though.

I guess my point was about what actions would have as their first argument is designed to be interpreted as a path to pass to stata, but there's no harm in being explicit about it.

Also foo=$1 as assignment is fine in bash, it's if you use it in other commands its an issue (which in hindsight is I think the real reason shellcheck didn't complain).

Also foo=$1 as assignment is fine in bash

Ah, so it is! I've always been so paranoid about quoting everything I never realised that

evansd · 2021-05-10T16:00:50Z

entrypoint.sh

+# actual script
+
+tmp=$(mktemp)
+wrapper=wrapper.$script


I don't think this transformation will work in general. $script will usually contain slashes so analysis/my_script.do becomes wrapper.analysis/my_script.do which will give you an error when you try to write to it. Maybe just use mktemp again?

arg, good point.

So, I didn't want to use temp, as that made for ugly logs for users to see (I want to link the wrapper name to their name as much as possible). It's easy enough to fix though, with ${script%.do}.wrapper.do

Bash string transformations will never cease to amaze me!

I agree it makes sense to try to keep the name sensible-looking, I just wonder if there's any way to do that while keeping it out of /workspace. The trap .. rm solution is neat but there's still something a bit yucky about writing temporary stuff as root in the working directory. What about something like replacing the workspace path segment with wrapped so you'd get e.g. /wrapped/analysis/script.do? Not saying that's a great solution either but feels worth thinking through the solution space a bit further.

I think we need to be explicit about the fact there is a wrapper, but obvious what its wrapping.

Which for me means the basename not being the same as the scriptname (e.g. I expect most people would not notice /wrapped/ above and they'd just read /workspace/ there, which potential confusion),,

e.g. if their action is stata analysis/model.do, the log files will talk about analysis/model.wrapper.do, i.e same path prefix.

Also, I'm wary of putting it in a different directory, in case there are some odd CWD semantics in stata.

We could do analysis/wrapper-model.do or something similar, but stata seems to require a .do extension.

Also, I'm wary of putting it in a different directory, in case there are some odd CWD semantics in stata.

That is a fair worry. Fine to leave as is I think. Just felt worth checking there wasn't a good alternative.

evansd · 2021-05-10T16:02:46Z

entrypoint.sh

+# stata is super odd in its cli interactions and behaviour. So we wrap up the
+# actual script


I reckon future us would be grateful for more detail here. Specifically the fact it always exits 0, that it doesn't stop on error if you pipe commands in, and that it doesn't log to stdout if you use batch mode.

Yep, already had a change pending for this, and a README update

evansd · 2021-05-10T16:04:51Z

tests/run.sh

@@ -0,0 +1,55 @@
+#!/bin/bash


Tests! 🎉

Also, clean up some files left over from test runs, and add shellcheck linting

bloodearnest force-pushed the taming-stata-cli branch from 8ae43ea to 5674e66 Compare May 10, 2021 15:34

bloodearnest marked this pull request as ready for review May 10, 2021 15:35

bloodearnest added 3 commits May 10, 2021 16:40

Refactor Dockerfile to use new build style.

35d60ec

- use buildkit with tags - add packages.txt with base image tooling - remove python3 dep - have the renew-license script install expect

Add very basic tests

d698c76

bloodearnest force-pushed the taming-stata-cli branch from 5674e66 to d698c76 Compare May 10, 2021 15:40

evansd requested changes May 10, 2021

View reviewed changes

Add more documentation around this change

fcc5e4c

Also, clean up some files left over from test runs, and add shellcheck linting

evansd approved these changes May 10, 2021

View reviewed changes

bloodearnest merged commit 95d6e97 into main May 10, 2021

bloodearnest deleted the taming-stata-cli branch May 10, 2021 20:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch to using a simpler-but-still-hacky way of invoking stata #6

Switch to using a simpler-but-still-hacky way of invoking stata #6

bloodearnest commented May 7, 2021 •

edited

Loading

evansd left a comment

evansd May 10, 2021

bloodearnest May 10, 2021

evansd May 10, 2021

bloodearnest May 10, 2021

bloodearnest May 10, 2021 •

edited

Loading

bloodearnest May 10, 2021 •

edited

Loading

evansd May 10, 2021

evansd May 10, 2021

bloodearnest May 10, 2021

evansd May 10, 2021

bloodearnest May 10, 2021 •

edited

Loading

evansd May 10, 2021

evansd May 10, 2021

bloodearnest May 10, 2021

evansd May 10, 2021

		# stata is super odd in its cli interactions and behaviour. So we wrap up the
		# actual script

Switch to using a simpler-but-still-hacky way of invoking stata #6

Switch to using a simpler-but-still-hacky way of invoking stata #6

Conversation

bloodearnest commented May 7, 2021 • edited Loading

evansd left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bloodearnest May 10, 2021 • edited Loading

Choose a reason for hiding this comment

bloodearnest May 10, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bloodearnest May 10, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bloodearnest commented May 7, 2021 •

edited

Loading

bloodearnest May 10, 2021 •

edited

Loading

bloodearnest May 10, 2021 •

edited

Loading

bloodearnest May 10, 2021 •

edited

Loading