New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Colocalization module #17

Merged

tdrose merged 9 commits into master from dev

Feb 6, 2024

Member

tdrose commented Dec 19, 2023

Colocalization is a common metric for analysis of imaging MS/spatial metabolomics data.
In a previous publication from Theo (ColocML), they evaluated metrics/processing steps for computing colocalization. I implemented the best-performing metric.

I think it is a useful addition to the package for users to perform this kind of analysis/processing which cannot be done by other downstream analysis packages such as scanpy or squidpy.

tdrose added 2 commits

December 18, 2023 17:30


          Colocalization module

ac6bcc2


          Documentation for coloc module

tdrose added the enhancement label

tdrose requested a review from aeisenbarth

December 19, 2023 16:15

tdrose self-assigned this


          Docs examples fix.

a12792f

tdrose mentioned this pull request

Implement ColocML processing and colocalization computation #16

Closed

2 tasks

tdrose linked an issue

that may be closed by this pull request

Implement ColocML processing and colocalization computation #16

Closed

2 tasks

Member Author

tdrose commented Jan 11, 2024

Once merged, closes #16 .

aeisenbarth reviewed

View reviewed changes

docs/examples.rst Outdated

+                 # It has the same dimensions as `adata.X`
+                 colocalization.colocML_preprocessing(adata, layer="colocml_preprocessing")
+                 # Compute the pairwise colocalization metrix between all ion images

Collaborator

aeisenbarth Dec 22, 2023

matrix (or metrics?)

Member Author

tdrose Feb 5, 2024

Done.

metaspace_converter/colocalization.py Outdated

		from metaspace_converter.constants import COLOCALIZATION


		def colocML_preprocessing(

Collaborator

aeisenbarth Dec 22, 2023

I would rename to coloc_ml_preprocessing, closer to Python naming conventions.

Member Author

tdrose Feb 5, 2024

Done.

metaspace_converter/colocalization.py Outdated

+                  """
+                  # Select data
+                  if layer == None or layer not in adata.layers.keys():

Collaborator

aeisenbarth Jan 2, 2024

I think the second condition is quite permissive. If an AnnData has both a default X and some layers, a user providing an unknown layer name would unexpectedly get the default layer, but since no error is raised, assume they got the requested layer.

When doing data analysis, is this level of permissiveness useful (because some datasets may have the layer, others not)?

Member Author

tdrose Feb 5, 2024

Agree. Now raising an error if layer not available.

metaspace_converter/colocalization.py Outdated

+                  coloc = _pairwise_cosine_similarity(data)
+                  # Save colocalization
+                  adata.uns[COLOCALIZATION] = coloc

Collaborator

aeisenbarth Jan 2, 2024

It would also fit well as adata.varp[COLOCALIZATION] = coloc since it is pairwise on features.

Member Author

tdrose Feb 5, 2024

True. Good idea. Will move it to anndata.varp

metaspace_converter/tests/colocalization_test.py Outdated

Comment on lines 86 to 90

+                      # Layer exists
+                      assert COLOCML_LAYER in adata.layers.keys()
+                      # Layer sizes match
+                      assert adata.X.shape == adata.layers[COLOCML_LAYER].shape

Collaborator

aeisenbarth Jan 2, 2024

I would move these above # Test median filtering because the other assertions would have failed anyways. That way, we first have the check that preprocessing was done, and then that it was done correctly.

Member Author

tdrose Feb 5, 2024

Done.

metaspace_converter/tests/colocalization_test.py Outdated

+                      assert observed == expected_preprocessing
+                      # Quantile thresholding
+                      adata.X[0].reshape(-1)

Collaborator

aeisenbarth Jan 2, 2024

This doesn't modify the receiver. You can remove the line adata.X[0].reshape(-1).

Member Author

tdrose Feb 5, 2024

True. Probably just an artifact from my internal testing. Removed it.

metaspace_converter/tests/colocalization_test.py Outdated

Comment on lines 41 to 43

+                      dict(num_ions=5, height=10, width=10),
+                      dict(num_ions=0, height=2, width=3),
+                      dict(num_ions=4, height=4, width=4),

Collaborator

aeisenbarth Jan 2, 2024

To me, the third and first case seem to cover the same classes of equivalent inputs, that means I would not expect the third one fails without the first one failing as well.

But the second one covers two different edge cases (no ions, rectangular image). I would separate them like this:

        dict(num_ions=5, height=10, width=10),
        dict(num_ions=0, height=10, width=10),
        dict(num_ions=5, height=3, width=4),

Member Author

tdrose Feb 5, 2024

Good idea. Done.

metaspace_converter/tests/colocalization_test.py Outdated

Comment on lines 69 to 75

+                      expected_preprocessing = np.median(actual[0][:3, :3])
+                      observed = adata.layers[COLOCML_LAYER][adata.uns[METASPACE_KEY]["image_size"][X] + 1, 0]
+                      assert observed == expected_preprocessing
+                      expected_preprocessing = np.median(actual[1][:3, :3])
+                      observed = adata.layers[COLOCML_LAYER][adata.uns[METASPACE_KEY]["image_size"][X] + 1, 1]
+                      assert observed == expected_preprocessing

Collaborator

aeisenbarth Jan 2, 2024 •

edited

Loading

These assertions are not obvious to me as a reader. I would extract the inline expression to a variable width and explain the indexing. For example:

        # Check that preprocessing was done correctly.
        # Probe a single pixel of the resulting array and compute the expected median filter.
        x = 1
        y = 1
        width = adata.uns[METASPACE_KEY]["image_size"][X]

        expected_pixel_value = np.median(actual[0][y-1:y+2, x-1:x+2])
        actual_pixel_value = adata.layers[COLOCML_LAYER][y * width + x, 0]
        assert actual_pixel_value == expected_pixel_value

One can then even check all ions in one go with

        expected_pixel_features = np.median(actual[:, y - 1:y + 2, x - 1:x + 2], axis=[1, 2])
        actual_pixel_features = adata.layers[COLOCML_LAYER][y * width + x, :]
        np.testing.assert_allclose(actual_pixel_features, expected_pixel_features)

Member Author

tdrose Feb 5, 2024

Good idea, done.

metaspace_converter/tests/colocalization_test.py Outdated

Comment on lines 77 to 84

+                      # Quantile thresholding
+                      adata.X[0].reshape(-1)
+                      colocML_preprocessing(
+                          adata, median_filter_size=(3, 3), quantile_threshold=0.5, layer=COLOCML_LAYER
+                      )
+                      assert all(np.sum(adata.layers[COLOCML_LAYER] == 0, axis=0) <= 0.5 * adata.X.shape[0])

Collaborator

aeisenbarth Jan 2, 2024

It is often preferrable to have a single test case in a test function. For a second case, you could parametrize the function with @pytest.parametrize("quantile_threshold", [0, 0.5]) or, if the call or assertions are too different, write a separate test function.

Collaborator

aeisenbarth Jan 2, 2024

I would make the assertion a bit clearer by extracting the parameter to a variable and computing the assertion values in separate lines.

        quantile_threshold = 0.5
        colocML_preprocessing(
            adata, median_filter_size=(3, 3), quantile_threshold=quantile_threshold, layer=COLOCML_LAYER
        )
        quantile_value = quantile_threshold * adata.X.shape[0]
        actual_thresholded = np.sum(adata.layers[COLOCML_LAYER] == 0, axis=0)
        assert all(actual_thresholded <= quantile_value)

Member Author

tdrose Feb 5, 2024

Done. Added quantile_threshold to parameterization.

tdrose added 5 commits

February 5, 2024 10:56


          renaming of coloc_ml_processing, testing, and moving colocs to varp

1de5c51


          Changed test scenarios.

7bc4dca


          Testing median filter for all ions of one pixel.

94a2203


          Separated quantile thresholding and median filtering in test.


          Added layer argument to anndata_to_image_array function.

178e56c

tdrose commented

View reviewed changes

metaspace_converter/anndata_to_array.py

@@ @@ -16,13 +17,15 @@ def _check_pixel_coordinates(adata: AnnData) -> bool: @@
                   return np.all(np.equal(pixel_list, required_pixels))
-              def anndata_to_image_array(adata: AnnData) -> np.ndarray:
+              def anndata_to_image_array(adata: AnnData, layer: Optional[str]=None) -> np.ndarray:

Member Author

tdrose Feb 5, 2024

Also added layer argument to the anndata_to_image_array function. With this also the processed images from layers can be extracted.

tdrose requested a review from aeisenbarth

February 5, 2024 12:30

aeisenbarth approved these changes

View reviewed changes

Collaborator

aeisenbarth left a comment

Besides one typo ready to merge!

metaspace_converter/colocalization.py Outdated

+                          ion image and all pixels below the quantile threshold will be set to 0.
+                  Returns:
+                      None. The processed data is saved in ``layer``. If layer is net to None, ``adata.X`` will

Collaborator

aeisenbarth Feb 5, 2024

~~net~~ set


          Fixed typo

9d09565

tdrose merged commit 9c7e08d into master

3 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels