Parallel sampling in with multiprocesses #1369

horizon-blue · 2022-03-02T20:27:58Z

Summary:
Related feature request: #1350.

Key Changes:

This diff implements the multiprocessing logic and introduces a new argument, run_in_parallel, so users can choose to run multi chain inference in parallel (in subprocesses).
- For the progress bar, it seems like as long as we pass tqdm a lock and use the position arg in the progress bar, tqdm can correctly update the progress bar for each subprocess, so we don't need to keep a subprocess dedicated to update progress bar like Pyro does :). On downside of this approach comparing to a dedicated process is that in Jupyter notebook, it seems like the order of the progress bar can be messed up (so the progress bar for the 5th chain can appear on the 1st row, see screenshot below), but that shouldn't matter in our use case.
- (The screenshot is taken from a toy snippet to test the progress bar, not from BM :))
We also need to change how samples are gathered, because sending RVIdentifier back and forth between processes can change its hash values. As a result, we can run into KeyError when merging dictionaries of samples sent to the main process. The solution here is to return a list of Tensors instead and use the order of queries to determine which Tensor correspond to which RVIdentifier.
User can use the new mp_context argument to control how to form a new subprocess (see multiprocessing doc for details)
- Note: for gradient-based methods such as NMC and NUTS, the usual caveats of running autograd with fork-based multiprocessing still applies: https://github.com/pytorch/pytorch/wiki/Autograd-and-Fork. Seems like autograd initializes some internal state when it is being executed for the first time, and fork-mode multiprocessing will copy that state into subprocesses, which can be problematic, so PyTorch recommends using "spawn" mode for multiprocessing, but spawn mode doesn't work in interactive environment such as Jupyter notebooks. One way to work around this issue in Jupyter notebook is to keep using the default "fork" mode, but do not initialize the autograd state in the main process (i.e. always run inferences in subprocesses). This is not an elegant solution, but at least it works. From previous conversation with OpenTeams, it seems like Dask does not triggers PyTorch's autograd warning, so we should still look into that to see if it can be a better long term solution.
When run_in_parallel is True, we will pre-sample the seed for each chainand pass that to subprocesses. This will ensure that the RNG for each chain is set to a different state
- We could use the same mechanism to set the seed for non-parallel inference as well, but doing so will change the stochastic behavior of our existing tutorials and use cases, so I'd rather not do that right now since there have been a lot of changes already in this diff :)

Differential Revision: D34574082

facebook-github-bot · 2022-03-02T20:28:12Z

This pull request was exported from Phabricator. Differential Revision: D34574082

Summary: Pull Request resolved: facebookresearch#1369 Key Changes: - This diff implements the multiprocessing logic and introduces a new argument, `run_in_parallel`, so users can choose to run multi chain inference in parallel (in subprocesses). - For the progress bar, it seems like as long as we pass `tqdm` a lock and use the position arg in the progress bar, `tqdm` can correctly update the progress bar for each subprocess, so we don't need to keep a subprocess dedicated to update progress bar like Pyro does :). On downside of this approach comparing to a dedicated process is that in Jupyter notebook, it seems like the order of the progress bar can be messed up (so the progress bar for the 5th chain can appear on the 1st row, see screenshot below), but that shouldn't matter in our use case. {F706198308} - (The screenshot is taken from a toy snippet to test the progress bar, not from BM :)) - We also need to change how samples are gathered, because sending `RVIdentifier` back and forth between processes can change its hash values. As a result, we can run into `KeyError` when merging dictionaries of samples sent to the main process. The solution here is to return a list of `Tensor`s instead and use the order of queries to determine which `Tensor` correspond to which `RVIdentifier`. - User can use the new `mp_context` argument to control how to form a new subprocess ([see multiprocessing doc for details](https://docs.python.org/3.8/library/multiprocessing.html#contexts-and-start-methods)) - **Note**: for gradient-based methods such as NMC and NUTS, the usual caveats of running autograd with fork-based multiprocessing still applies: https://github.com/pytorch/pytorch/wiki/Autograd-and-Fork. Seems like autograd initializes some internal state when it is being executed for the first time, and fork-mode multiprocessing will copy that state into subprocesses, which can be problematic, so PyTorch recommends using "spawn" mode for multiprocessing, but spawn mode doesn't work in interactive environment such as Jupyter notebooks. One way to work around this issue in Jupyter notebook is to keep using the default "fork" mode, but do not initialize the autograd state in the main process (i.e. always run inferences in subprocesses). This is not an elegant solution, but at least it works. From previous conversation with OpenTeams, it seems like Dask does not triggers PyTorch's autograd warning, so we should still look into that to see if it can be a better long term solution. - When `run_in_parallel` is `True`, we will pre-sample the seed for each chainand pass that to subprocesses. This will ensure that the RNG for each chain is set to a different state - We could use the same mechanism to set the seed for non-parallel inference as well, but doing so will change the stochastic behavior of our existing tutorials and use cases, so I'd rather not do that right now since there have been a lot of changes already in this diff :) Differential Revision: D34574082 fbshipit-source-id: 175951bac029957712466dbe25e892f32c48e155

facebook-github-bot · 2022-03-04T22:08:00Z

This pull request was exported from Phabricator. Differential Revision: D34574082

Differential Revision: D34569431 fbshipit-source-id: 0fca923708d29df4ee2fa83c665eeb66d0385859

Summary: Pull Request resolved: facebookresearch#1369 Key Changes: - This diff implements the multiprocessing logic and introduces a new argument, `run_in_parallel`, so users can choose to run multi chain inference in parallel (in subprocesses). - For the progress bar, it seems like as long as we pass `tqdm` a lock and use the position arg in the progress bar, `tqdm` can correctly update the progress bar for each subprocess, so we don't need to keep a subprocess dedicated to update progress bar like Pyro does :). On downside of this approach comparing to a dedicated process is that in Jupyter notebook, it seems like the order of the progress bar can be messed up (so the progress bar for the 5th chain can appear on the 1st row, see screenshot below), but that shouldn't matter in our use case. {F706198308} - (The screenshot is taken from a toy snippet to test the progress bar, not from BM :)) - We also need to change how samples are gathered, because sending `RVIdentifier` back and forth between processes can change its hash values. As a result, we can run into `KeyError` when merging dictionaries of samples sent to the main process. The solution here is to return a list of `Tensor`s instead and use the order of queries to determine which `Tensor` correspond to which `RVIdentifier`. - User can use the new `mp_context` argument to control how to form a new subprocess ([see multiprocessing doc for details](https://docs.python.org/3.8/library/multiprocessing.html#contexts-and-start-methods)) - **Note**: for gradient-based methods such as NMC and NUTS, the usual caveats of running autograd with fork-based multiprocessing still applies: https://github.com/pytorch/pytorch/wiki/Autograd-and-Fork. Seems like autograd initializes some internal state when it is being executed for the first time, and fork-mode multiprocessing will copy that state into subprocesses, which can be problematic, so PyTorch recommends using "spawn" mode for multiprocessing, but spawn mode doesn't work in interactive environment such as Jupyter notebooks. One way to work around this issue in Jupyter notebook is to keep using the default "fork" mode, but do not initialize the autograd state in the main process (i.e. always run inferences in subprocesses). This is not an elegant solution, but at least it works. From previous conversation with OpenTeams, it seems like Dask does not triggers PyTorch's autograd warning, so we should still look into that to see if it can be a better long term solution. - When `run_in_parallel` is `True`, we will pre-sample the seed for each chainand pass that to subprocesses. This will ensure that the RNG for each chain is set to a different state - We could use the same mechanism to set the seed for non-parallel inference as well, but doing so will change the stochastic behavior of our existing tutorials and use cases, so I'd rather not do that right now since there have been a lot of changes already in this diff :) Differential Revision: D34574082 fbshipit-source-id: 32237561392a0e7b9a4b7392a297fdc35642f331

facebook-github-bot · 2022-03-04T23:57:58Z

This pull request was exported from Phabricator. Differential Revision: D34574082

facebook-github-bot added CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported labels Mar 2, 2022

horizon-blue force-pushed the export-D34574082 branch from 45349b0 to 32431c8 Compare March 4, 2022 22:08

horizon-blue added 2 commits March 4, 2022 15:57

Refactor body of single chain computation into a method

faa2d4d

Differential Revision: D34569431 fbshipit-source-id: 0fca923708d29df4ee2fa83c665eeb66d0385859

horizon-blue force-pushed the export-D34574082 branch from 32431c8 to ace799f Compare March 4, 2022 23:57

facebook-github-bot closed this in dc066af Mar 7, 2022

horizon-blue mentioned this pull request Mar 7, 2022

Running chains in parallel #1350

Closed

horizon-blue mentioned this pull request Mar 24, 2022

Remove unnecessary flushing for tqdm progress bar #1383

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel sampling in with multiprocesses #1369

Parallel sampling in with multiprocesses #1369

horizon-blue commented Mar 2, 2022 •

edited

Loading

facebook-github-bot commented Mar 2, 2022

facebook-github-bot commented Mar 4, 2022

facebook-github-bot commented Mar 4, 2022

Parallel sampling in with multiprocesses #1369

Parallel sampling in with multiprocesses #1369

Conversation

horizon-blue commented Mar 2, 2022 • edited Loading

facebook-github-bot commented Mar 2, 2022

facebook-github-bot commented Mar 4, 2022

facebook-github-bot commented Mar 4, 2022

horizon-blue commented Mar 2, 2022 •

edited

Loading