Handle data exported via MetaXpress #201

fstur · 2024-12-10T11:20:19Z

When exporting the data via the MetaXpress software, the data is structured slightly differently, than when simply copying the data folder. The main differences are:
-Projections are stored in the ZStep_0 folder
-The 'acquisition ID' folder also contains the plate-name
-The md_id field is missing
-The extension is .TIF instead of .tif

Changes to acommodate this: I adjusted the regex pattern to match all different cases, and added a line to switch all z=="0" to z=None.

In addition, if the plate is a mixed acquisition, the MetaXpress export will fill the incomplete stacks with either the single plane, or the MIP (data is duplicated), making all channels full stacks. I decided to process those just with the StackAcquisition class, and not support the MixedAcquisition class.

I had to add an additional check in _compute_z_spacing to check that the channel from which z is computed is not such a 'filled stack'.

I also added the imagexpress_ZMB_MetaXpress/test_plate_acquisitions.py tests and additional test-data.

If it's ok with you, it would be great if we could incorporate these changes to also handle MetaXpress exported files.

codecov · 2024-12-10T11:22:17Z

Codecov Report

Attention: Patch coverage is 98.50000% with 3 lines in your changes missing coverage. Please review.

Project coverage is 93.16%. Comparing base (1870eee) to head (87ae54a).
Report is 6 commits behind head on main.

Files with missing lines	Patch %	Lines
src/faim_ipa/hcs/imagexpress/acquisition.py	80.00%	0 Missing and 3 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #201      +/-   ##
==========================================
+ Coverage   92.75%   93.16%   +0.41%     
==========================================
  Files          64       65       +1     
  Lines        3920     4128     +208     
  Branches      246      260      +14     
==========================================
+ Hits         3636     3846     +210     
+ Misses        269      265       -4     
- Partials       15       17       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

imagejan

Thanks @fstur for the pull request!

Is there a chance we can reduce the number of files in the test data to the absolute minimum required, e.g. by reducing to less time points and z steps?
Also, we might want to have just empty (0kb) files for those .TIF files that aren't required to load metadata.

Regarding the folder organization, I'd suggest keeping the tests in tests/hcs/imagexpress as that's corresponding to the Python package we are testing. How about just a new file test_plate_acquisitions_exported.py or some such?

For the test data, I suggest naming the folder ImageXpress_exported and remove the ZMB. Also, MetaXpress is the acquisition software and ImageXpress the device, right? I suggest we stick with just ImageXpress for the naming, or are there good reasons to make the separation, (e.g. alternative software that is used on the ImageXpress systems)?

I added a few (sometimes nitpicky) comments.

imagejan · 2024-12-11T09:25:19Z