Richardson Lucy Parallelization #237

avalluvan · 2024-08-29T16:42:48Z

Code Modifications
Backend (updates to existing files)

Image_deconvolution.py:
New deconvolution algorithm class added to dictionary: “RLparallel”
Parameter filepath is propagated through to deconvolution algorithm
RichardsonLucy.py and RichardsonLucySimple.py:
Tiny changes in init definition to accept parameter filepath string
User Interface
Define number of nodes/processors to use in parameter in config.yml deconvolution:parameter:numproc

RichardsonLucyParallel.py
Maintains standard user interface: initialize() and run_deconvolution()
No changes to initialize() call tree
Run_deconvolution() invokes the following in RichardsonLucyParallel.py:

initialization() → writes event, bg model and response files to disk to prepare for parallel execution. Event and bg model are stored as dense p-vectors in .csv. Response file is stored as p⊗q matrix as .h5.
iteration() → intended functionality has been disabled as MPI script cannot work within the framework. Need a shell call → subprocess.run call to RLparallelscript.py which includes all iteration steps.
finalization() → removes files written to disk.

Employing the Parallel Code
Extremely simple:

Change imagedeconvolution_parfile_gal_511keV.yml deconvolution:algorithm to "RLparallel."
Specify the number of processors/nodes to use through imagedeconvolution_parfile_gal_511keV.yml deconvolution:parameter:numproc to an integer appropriate to your system.
Right now, the tutorial notebook has not been explicitly modified as we do not have plans to deploy this with DC3. Nevertheless, please let me know if that is recommended.

Limitations

No access to interactive parameter overrides yet
Current method is yet to be tested on a supercomputer where environment variables are less flexible making subprocess calls less transferable
This is an alpha version of parallel RL → No smoothing, bg normalization, etc.
Results are not in the same format
Model and delta_model at every iteration are saved in separate .csv files
No direct methods to return result object without saving results to disk. Could potentially load results during finalization() step.
Result format is primarily a product of input format (simple dense vectors vs histogram objects based implementation in serial code).

Next Steps

Test implementation on UCSD supercomputer
Incorporate RL optimizations in parallel version
Add interactive parameter overrides to config.yml
Move iteration loop in run_deconvolution() to respective algos (Thoughts?)

…orithm_classes

…keleton from RichardsonLucy.py

…pt.py

New files: RichardsonLucyParallel.py and RLparallelscript.py Potentially modified files: dataIF_COSI_DC2.py deconvolution_algorithm_base.py image_deconvolution_data_interface_base.py image_deconvolution.py model_base.py

RichardsonLucySimple.py and RichardsonLucy.py were modified to include the propagation of the config file from the user facing image_deconvolution object to the respective deconvolution algorithms

codecov · 2024-08-29T16:45:03Z

Codecov Report

Attention: Patch coverage is 12.01717% with 205 lines in your changes missing coverage. Please review.

Project coverage is 69.17%. Comparing base (d02846a) to head (3036a7a).

Files with missing lines	Patch %	Lines
cosipy/image_deconvolution/RLparallelscript.py	0.00%	156 Missing ⚠️
...sipy/image_deconvolution/RichardsonLucyParallel.py	28.98%	49 Missing ⚠️

Files with missing lines	Coverage Δ
cosipy/image_deconvolution/RichardsonLucy.py	`96.03% <100.00%> (ø)`
cosipy/image_deconvolution/RichardsonLucySimple.py	`94.33% <100.00%> (ø)`
cosipy/image_deconvolution/image_deconvolution.py	`96.15% <100.00%> (+0.19%)`	⬆️
...sipy/image_deconvolution/RichardsonLucyParallel.py	`28.98% <28.98%> (ø)`
cosipy/image_deconvolution/RLparallelscript.py	`0.00% <0.00%> (ø)`

hiyoneda · 2024-09-12T13:44:19Z

Hi, Anaya. Thank you for submitting this PR. I have reviewed the changes and noticed a few issues that I want to discuss with you.

Currently, most of the essential parts of the RL algorithm are performed in RLparallelscript.py. I am concerned about this because it may reduce the flexibility to maintain the code. In actual image analysis, it is very likely to use several different modified RL algorithms, such as standard RL, accelerated RL, MAP RL, maximum entropy, etc, and compare the results. So, the main purpose of DeconvolutionAlgorithmBase is to allow us to use these different algorithms in the same manner to make things easier. But with RLparallelscript.py, we have to duplicate these algorithm classes for the parallel calculation.
The data handling of the response matrix, like preparing intermediate files, is performed in RichardsonLucyParallel.py. I suggest separating these parts from the algorithm classes because I need to modify the algorithm classes every time the response format changes. As for now, we assume that the response matrix is a single matrix, but we may use a different format (neural network, matrix with different parameterization, a combination of several matrices, etc.). So, I prefer to prepare a child class for the data interface class and hide such a data handling part there. Ideally, the parallel calculation can also be implemented in the data interface. I think that such implementation would also be a good first step in thinking about parallel calculation in another case, like model fitting.

Can you please consider these issues first and tell me the potential challenges for them? If you agree with them, we can discuss how to address them together.

avalluvan · 2024-09-30T21:44:20Z

Hi Hiroki,

As we have discussed before, it might not be possible to invoke an MPI-based script without performing a shell execute as it requires an mpiexec python <filename> call as opposed to a python <filename> call. MPI-based code could potentially be run on just the master node, keeping all the child nodes waiting, unless a specific, parallel-code needs to be executed. I think this would require some changes from the ground up such that MPI initializations are a part of the root modules being called (such as init.py in the cosipy/cosipy/* directory.). However, I am not sure how that will interfere (and if there are any workarounds) with Jupyter Notebooks. I can take a look at these aspects and revert back.
I agree on this. Now may be a good time to revisit these sections and generalize them, and I think I will be able to do this myself.
Thanks for your inputs and we can also continue these discussions in future meetings.

…ed to subsequently overwrite remote. Merge remote-tracking branch 'refs/remotes/origin/develop' into develop

Switching to histpy.Histogram()

Can also work with eps-to-Em mapping. Need to generalize

Interpolated scheme in get_point_source_response() tested and works as intended.

…onse

…etectorResponse.

Feature/general response

israelmcmc · 2024-12-03T21:37:02Z

@avalluvan It took me a while (sorry!), but I finally got to look at your code in detail. Good job! I didn't have experience with MPI and I got to learn a lot from your work :)

A couple of things:

I truncated your branch to keep only the MPI RL stuff --i.e. excluding the new response handling-- and open a new PR with it (Auxiliary PR: Richardson Lucy Parallelization #271). It's good to keep one feature per PR. I also removed the parameter_file input (since the parameter Configurator class already keeps track of the file path) and removed some other small changes that were not needed by the parallel code (just to make reviewing easier). Please take a look.
I created this script to illustrate @hiyoneda 's point (which I agree with). It's the parallel versions of this other one that I created to test the current version of the imaging module (PR Major updates on the image deconvolution modules #188). You can execute it with e.g. mpiexec -n 4 python toy_ParallelImageDeconvolutionDataInterfaceBase.py and it will plot the deconvolved model (from random data generated internally). Instead of re-implementing the RichardsonLucy class, you create a ImageDeconvolutionDataInterfaceBase child class that performs the computation in parallel. Instead of spawning multiple mpiexec internally, you just let the user do, as usual when working with clusters --it's OK if this doesn't work within a Jupyter notebook. This allows you to use all current and future improvements to the deconvolution algorithm(s) without duplicating code. Please take a look and let us know what you think.

avalluvan · 2024-12-09T17:02:36Z

Hi Israel. This pull request is unlikely to include any response reparametrization code. However, my response reparametrization pull request unfortunately contains this pull request's content along with response handling modifications.

Thanks for pointing out the parameter.absolute_path feature.

I will work with the script that you have provided and get back on this.

israelmcmc · 2024-12-09T17:09:54Z

Thanks @avalluvan.

To get a branch that contains only the detector response handling and not the MPI RL stuff look into git rebase, in particular around this section:

In your case "Topic A" is the parallelization stuff, and "Topic B" is the respect handling stuff.

avalluvan · 2024-12-13T00:26:17Z

Thanks for the inputs. I have significantly restructured the code and will open a new pull request for it.

avalluvan added 10 commits August 14, 2024 18:59

Added new option "RLparallel" to ImageDeconvolution.deconvolution_alg…

79109f3

…orithm_classes

Renamed RLparallel.py to RichardsonLucyParallel.py and adapted code s…

c918ab5

…keleton from RichardsonLucy.py

Installed mpi4py to cosipy venv and added it to skeleton

015f648

Created subprocess call. Ported MPI to separate script RLparallelscri…

f259d36

…pt.py

Added code to register results if save_results_flag is set to True

4cde4dd

End-to-end working script version commit

e98fb79

Unclear what files are causing conflicts

cd45e44

Add modified files and new RL parallel script

ef279f4

New files: RichardsonLucyParallel.py and RLparallelscript.py Potentially modified files: dataIF_COSI_DC2.py deconvolution_algorithm_base.py image_deconvolution_data_interface_base.py image_deconvolution.py model_base.py

Another tiny change

871c9a8

RichardsonLucySimple.py and RichardsonLucy.py were modified to include the propagation of the config file from the user facing image_deconvolution object to the respective deconvolution algorithms

Modified configuration file for RLparallel

27c8afe

avalluvan added enhancement New feature or request good first issue Good for newcomers labels Aug 29, 2024

avalluvan added this to the v4.0 - DC4 milestone Aug 29, 2024

avalluvan requested review from israelmcmc and hiyoneda August 29, 2024 16:42

avalluvan self-assigned this Aug 29, 2024

avalluvan added 11 commits October 18, 2024 09:15

Merge branch 'cositools:develop' into develop

f079988

git fetch changes unclear. Performing a commit to avoid data loss.

f044589

This merge may result in data losses. Copies of files have been creat…

3036a7a

…ed to subsequently overwrite remote. Merge remote-tracking branch 'refs/remotes/origin/develop' into develop

Create LMDR.ipynb and add response creation attempt with h5py.

35b8293

Switching to histpy.Histogram()

Created general code for multidimensional interpolation

d04f3c3

Attempted listmode response on the fly generation

ac47f23

Add ListModeResponse.py

c257676

Used histpy functions to dramatically simplify the codebase

7de27fd

Modified code to accept len(dr.axes) or len(dr.axes)-1 number of inputs.

4c23188

Can also work with eps-to-Em mapping. Need to generalize

Added comments to get_point_source_response()

71085fd

Tested new interpolation scheme on LMDR.ipynb

dc35034

avalluvan added 6 commits November 11, 2024 08:57

Adhoc fix for interpolated psr calculation with deepcopy

f8bbaa4

Simplified __getitem__ and get_point_source_response()

cdcb9b0

Interpolated scheme in get_point_source_response() tested and works as intended.

Ported ListModeResponse interpolation functionalities to DetectorResp…

aaff648

…onse

Removed ListModeResponse.py. All developed features incorporated in D…

caa2d44

…etectorResponse.

Merge pull request #1 from avalluvan/feature/general_response

2c80caa

Feature/general response

Added code to compress empty axes, i.e., those with axis shape = (1,)

dafd4f8

israelmcmc mentioned this pull request Nov 27, 2024

Auxiliary PR: Richardson Lucy Parallelization #271

Closed

israelmcmc self-assigned this Dec 5, 2024

avalluvan closed this Dec 13, 2024

israelmcmc mentioned this pull request Dec 13, 2024

Richardson Lucy Parallelization V2 #274

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Richardson Lucy Parallelization #237

Richardson Lucy Parallelization #237

avalluvan commented Aug 29, 2024

codecov bot commented Aug 29, 2024 •

edited

Loading

hiyoneda commented Sep 12, 2024

avalluvan commented Sep 30, 2024

israelmcmc commented Dec 3, 2024

avalluvan commented Dec 9, 2024 •

edited

Loading

israelmcmc commented Dec 9, 2024

avalluvan commented Dec 13, 2024

Richardson Lucy Parallelization #237

Richardson Lucy Parallelization #237

Conversation

avalluvan commented Aug 29, 2024

codecov bot commented Aug 29, 2024 • edited Loading

Codecov Report

hiyoneda commented Sep 12, 2024

avalluvan commented Sep 30, 2024

israelmcmc commented Dec 3, 2024

avalluvan commented Dec 9, 2024 • edited Loading

israelmcmc commented Dec 9, 2024

avalluvan commented Dec 13, 2024

codecov bot commented Aug 29, 2024 •

edited

Loading

avalluvan commented Dec 9, 2024 •

edited

Loading