feat(builtin): split node_loader into node_patcher #1003

Toxicable · 2019-08-13T09:58:41Z

PR Checklist

Tests for the changes have been added (for bug fixes / features)
Docs have been added / updated (for bug fixes / features)

PR Type

Feature (please, look at the "Scope of the project" section in the README.md file)

What is the current behavior?

Unable to use bazels module resolution under a child_process or worker_threads

Issue Number: #312 #227

What is the new behavior?

This change splits up node_launcher into node_loader.js + node_patcher.js
node_loader.js - job is to run any bootstrap modules, setup source map support and finally call the users main entry point.
node_patcher - job is to only patch the module resolution logic of require so it can resolve modules under bazels runfile layout.
Part of this change also adds logic so that any worker (child process or thread) also has their require patched correctly so that they can require modules just the same as in the main thread.
Meaning we'll modify node_launcher.sh to call node with --require=${target}node_patcher.js.
In my testing, this change alone will enable works for child_process on nodejs 10.x and greater, however, we will need nodejs 12.x to enable this fix for worker_threads.
With this in mind, we'll make a test for either case but allow the worker_threads one to fail untill 12.x is LTS and we can enable it as the default.

Does this PR introduce a breaking change?

Yes
No
Even those i'd consider it an "internal" api, if anyone is relying on the current behaviour of node_loader then this change will break it.

Other information

soldair

I'm still getting up to speed here but these are my notes on this change. Please note that some of these comments are more about the project and maybe out of scope for this pr.

the upgrade to node 12 LGTM except we need to pick up 12.8.1 which has some critical security fixes. If there are any existing docs on how we've decided to manage the nodejs version for users that would be useful. If there is a default we should discuss the merits of pinning it to a semver range so users by default get patch level fixes.

I'm not sure why we refactored out patcher from loader aside from the good reason that i could be and it's cleaner.

Some background in the issue description or link to a related issue would be really useful.

The effect is that the entry point gets a lot more boostrap modules loaded than child processes or worker threads. For things like source-map-support this means inconsistent debugging experience for those that compiled from typescript etc but maybe we shouldn't be helping them with that in the first place.

internal/node/node_patcher.js

internal/node/test/BUILD.bazel

internal/node/node_loader.js

soldair · 2019-08-19T16:10:25Z

internal/node/node_loader.js

+// we patch require here
+module.constructor._load(patchModulePath)
+// then we also patch process.execArgs sothat any child process also does this patch
+process.execArgv = process.execArgv.concat([`--require=${patchModulePath}`]);


Will nodejs will already have a --require=...loader.js arg in execArgv?
So when a child process or worker executes it again i would expect this logic to run node with node --require=loader.js --require=patcher.js and add another --require=patcher.js
the extra --require=...patcher.js are a no op because loader requires it but the failure mode here is using up the processes argv space unexpectedly.

Yeah, this isn't right at all, I was just testing what would happen if you modify the execArgv from inside the program - turns out it's ignored by node in the child process (maybe it should be marked as readonly then?)
Either way, we'll move this into the node_launcher.sh script

migrated to the --require from launcher implementation, so we don't need this anymore

Toxicable · 2019-08-19T20:35:45Z

@soldair Thanks for the review, i've updated the description with a better explaination of the goals here. Which should answer some of the questions you had around here, will try get some time tonight to make some changes so it reflects the description :P

Toxicable · 2019-08-24T21:01:10Z

The only failure now is because of this logic

if [[ $patcher = /* ]]
then
  NODE_OPTIONS+=( "--require=$patcher" )
else
  NODE_OPTIONS+=( "--require=./$patcher" )
fi

Is there a proper way to check for absolute/relative or to normalize this path in node_laucher?

soldair · 2019-08-27T16:25:22Z

support for spawning node.

This is a really long answer to the question about the patcher file path handling.

I Think we need to add support for a few other ways nodejs can be spawned.
for example.

As a process looked up in the PATH

child_process.spawnSync('node',['-e','require("lib")'])

As a process invoked indirectly from the shell (very common with npm scripts)

child_process.execSync('node -e \'require("lib")\'')

The thing I think is missing to make these examples work is node needs to be resolved to a script from the PATH. We add directory containing an executable script called exactly node and (node.bat for windows) that has the absolute path to the nodejs binary and the file we intend to --require as our loader.

I think that it should be pretty much exactly this.

#!/bin/sh
export RULES_NODE_PROXY_SCRIPT=$0
exec $(templated_absolute_path_to_real_node)/bin/node -r $(templated absolute path to loader) "$@"

exec to drop the sh process, the absolute path to nodejs, the absolute path to the loader, and forward all other arguments.

Inside of the loader we set

process.execPath = path.resolve(process.env.RULES_NODE_PROXY_SCRIPT)

this supports programs that attempt to preserve execPath themselves (like npm), child_process.fork, and worker_threads i guess

Then we make sure the directory that contains our patched bin is in the path.

let binDir = path.dirname(process.env.RULES_NODE_PROXY_SCRIPT);
if(process.env.PATH.split(path.delimeter).indexOf(binDir) === -1){
  process.env.PATH = binDir+path.delimiter+process.env.PATH;
}

Toxicable · 2019-08-27T20:19:19Z

@soldair Those look like some pretty easy changes to make. Although I don't think ill get any time on this week till the weekend to make those changes. If you want to pick it up however im always on slack.

Toxicable · 2019-08-28T08:19:44Z

I've addressed those points and looks like it's working.

It'll still fail some tests since I don't quite know how to resolve the issue I mentioned above about the patcher path.

It's also failing in a few places with /usr/bin/env: 'bash': No such file or directory

Not sure what hashbang I should be using for the node_proxy script.

alexeagle · 2019-09-03T19:37:06Z

Note, with the upcoming linker approach, the node_modules directory on disk when the action starts should already be in the state all node programs expect, so we may not need this.

(remains to be seen whether all features of node_loader.js can be replaced by the linker, eg. runfiles, but I think it's eminently possible)

Toxicable · 2019-09-03T20:21:38Z

Sounds good to me. If we're able to remove th need for node_patcher.js then im happy

googlebot added the cla: yes label Aug 13, 2019

alexeagle requested a review from soldair August 15, 2019 21:15

soldair reviewed Aug 19, 2019

View reviewed changes

Toxicable marked this pull request as ready for review August 23, 2019 23:03

Toxicable requested review from alexeagle, gregmagolan and kyliau as code owners August 23, 2019 23:03

Toxicable force-pushed the workers branch from 4aed8f5 to 14c7af3 Compare August 24, 2019 01:02

Fabian Wiles added 2 commits August 24, 2019 13:03

feat(builtin): split node_loader into node_patcher

14c7af3

fix(builtin): account for absolute

d52d55a

soldair self-assigned this Aug 27, 2019

Toxicable force-pushed the workers branch 3 times, most recently from ff10a5a to 006325d Compare August 28, 2019 08:19

Toxicable force-pushed the workers branch 4 times, most recently from b087991 to 64772d8 Compare August 28, 2019 08:44

fix(builtin): remove debug

64772d8

Toxicable closed this Sep 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(builtin): split node_loader into node_patcher #1003

feat(builtin): split node_loader into node_patcher #1003

Toxicable commented Aug 13, 2019 •

edited

Loading

soldair left a comment

soldair Aug 19, 2019

Toxicable Aug 19, 2019

Toxicable Aug 25, 2019

Toxicable commented Aug 19, 2019

Toxicable commented Aug 24, 2019

soldair commented Aug 27, 2019 •

edited

Loading

Toxicable commented Aug 27, 2019

Toxicable commented Aug 28, 2019 •

edited

Loading

alexeagle commented Sep 3, 2019

Toxicable commented Sep 3, 2019

feat(builtin): split node_loader into node_patcher #1003

feat(builtin): split node_loader into node_patcher #1003

Conversation

Toxicable commented Aug 13, 2019 • edited Loading

PR Checklist

PR Type

What is the current behavior?

What is the new behavior?

Does this PR introduce a breaking change?

Other information

soldair left a comment

Choose a reason for hiding this comment

soldair Aug 19, 2019

Choose a reason for hiding this comment

Toxicable Aug 19, 2019

Choose a reason for hiding this comment

Toxicable Aug 25, 2019

Choose a reason for hiding this comment

Toxicable commented Aug 19, 2019

Toxicable commented Aug 24, 2019

soldair commented Aug 27, 2019 • edited Loading

support for spawning node.

Toxicable commented Aug 27, 2019

Toxicable commented Aug 28, 2019 • edited Loading

alexeagle commented Sep 3, 2019

Toxicable commented Sep 3, 2019

Toxicable commented Aug 13, 2019 •

edited

Loading

soldair commented Aug 27, 2019 •

edited

Loading

Toxicable commented Aug 28, 2019 •

edited

Loading