update harmony to new implementation #308

Intron7 · 2024-12-11T19:23:18Z

Fixes #299

flying-sheep · 2024-12-16T16:27:00Z

I’ll check this out tomorrow, it’s too big to start now!

flying-sheep

The harmonize function has a really nice layout. it’s easy to follow what it does, nice!

However it seems like you’re establishing a bunch of parameters that get reused and never changed after the initialization. You could e.g.

create it a frozen dataclass with methods so you can use the parameters using self.<name>, or
create a NamedTuple that you can pass around containing the parameters

flying-sheep · 2024-12-17T10:15:25Z

pyproject.toml

 "src/rapids_singlecell/preprocessing/_harmonypy_gpu.py" = ["PLR0917"]
 "src/rapids_singlecell/decoupler_gpu/_method_mlm.py" = ["PLR0917"]
 "src/rapids_singlecell/decoupler_gpu/_method_wsum.py" = ["PLR0917"]
+"src/rapids_singlecell/preprocessing/_harmony/__init__.py" = ["PLR0917"]


You should ignore these inline (#noqa: PLR0917) instead of per file.

Also only if absolute necessary, I think it’s one of the best rules there is. I understand that it numba doesn‘t respect *, but that’s why it should be done inline

flying-sheep · 2024-12-17T10:19:46Z

src/rapids_singlecell/preprocessing/_harmony/_fuses.py

don’t name variables LIKE_CONSTANTS

don’t name variables with single letter names (except for a, b for binary operators, i for enumerate and similar conventions)

flying-sheep · 2024-12-17T10:24:15Z

src/rapids_singlecell/preprocessing/_harmony/__init__.py

+        X (cp.ndarray): Input 2D array.
+
+    Returns:
+        cp.ndarray: Row-normalized 2D array.


No need to duplicate the types here. (also applies to other places where you might have done that)

X: Input 2D array. Returns: Row-normalized 2D array.

flying-sheep · 2024-12-17T12:44:20Z

src/rapids_singlecell/preprocessing/_harmony/_kernels/_normalize.py

+    int tid = threadIdx.x;  // Thread index within the block
+
+    // Ensure we're within matrix bounds
+    if (row >= rows) return;


no error? so is that a convolution that’s expected to run with invalid arguments?

Yes kinda. That come from blocks that are overlapping. eg. 32 but only 29 cells.

flying-sheep · 2024-12-17T12:59:13Z

src/rapids_singlecell/preprocessing/_harmony/__init__.py

+    return X
+
+
+def _normalize_cp(X: cp.ndarray, p: int = 2) -> cp.ndarray:


why name it “p”?

Thats the name of the variable in torch

flying-sheep · 2024-12-17T13:07:22Z

src/rapids_singlecell/preprocessing/_harmony/__init__.py

+        _clustering(
+            Z_norm,
+            Pr_b,
+            Phi,
+            R,
+            E,
+            O,
+            n_clusters,
+            theta,
+            tol_clustering,
+            objectives_harmony,
+            max_iter_clustering,
+            sigma,
+            block_proportion,
+        )


OK, this is the reason why PLR0917 exists. Don’t suppress it, specify everything by name instead.

flying-sheep · 2024-12-17T13:08:34Z

src/rapids_singlecell/preprocessing/_harmony/__init__.py

+        Z_hat = _correction(Z, R, Phi, O, ridge_lambda, correction_method)
+        Z_norm = _normalize_cp(Z_hat, p=2)
+        if verbose:
+            print(f"\tCompleted {i + 1} / {max_iter_harmony} iteration(s).")


don’t you have some logging infra you should use instead?

I don’t want to see a single print statement in any library I use (if it has a CLI, that one may have print statements)

src/rapids_singlecell/preprocessing/_harmony/__init__.py

flying-sheep · 2024-12-17T13:31:01Z

src/rapids_singlecell/preprocessing/_harmony/__init__.py

+        if verbose:
+            print(f"\tCompleted {i + 1} / {max_iter_harmony} iteration(s).")
+
+        if _is_convergent_harmony(objectives_harmony, tol=tol_harmony):


so this one is an out parameter of _clustering? what else is being modified? You should make that clear

Co-authored-by: Philipp A. <flying-sheep@web.de>

Intron7 · 2024-12-17T14:24:52Z

removed all prints and added a warning if harmony didnt converge

add basic code

Loading
Loading status checks…

00c8846

Intron7 requested review from ilan-gold and removed request for ilan-gold December 11, 2024 19:23

Intron7 marked this pull request as draft December 11, 2024 19:23

Intron7 added 3 commits December 16, 2024 10:36

add typing

Loading
Loading status checks…

8c80ab3

add returns

Loading
Loading status checks…

5c7061e

add typing to fuse

Loading
Loading status checks…

eaf3014

Intron7 added the run-gpu-ci label Dec 16, 2024

github-actions bot removed the run-gpu-ci label Dec 16, 2024

Intron7 requested a review from flying-sheep December 16, 2024 09:55

add comments and starting point

Loading
Loading status checks…

a20472d

Intron7 added the run-gpu-ci label Dec 16, 2024

github-actions bot removed the run-gpu-ci label Dec 16, 2024

remove fuse clip

Loading
Loading status checks…

36b0d94

Intron7 added the run-gpu-ci label Dec 16, 2024

github-actions bot removed the run-gpu-ci label Dec 16, 2024

Intron7 marked this pull request as ready for review December 16, 2024 10:35

update releases

Loading
Loading status checks…

80d7ab9

Intron7 added the run-gpu-ci label Dec 16, 2024

github-actions bot removed the run-gpu-ci label Dec 16, 2024

Merge branch 'main' into update-harmony

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode

Loading
Loading status checks…

d98c1fd

Intron7 added the run-gpu-ci label Dec 16, 2024

github-actions bot removed the run-gpu-ci label Dec 16, 2024

flying-sheep requested changes Dec 17, 2024

View reviewed changes

Intron7 and others added 2 commits December 17, 2024 14:58

Intron7 added 2 commits December 17, 2024 15:26

update partial fixes

Loading
Loading status checks…

389fe27

update

Loading
Loading status checks…

3668726

Intron7 added the run-gpu-ci label Dec 17, 2024

github-actions bot removed the run-gpu-ci label Dec 17, 2024

Intron7 and others added 2 commits December 17, 2024 16:41

update kwags

Loading
Loading status checks…

5570ef2

Merge branch 'main' into update-harmony

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode

Loading
Loading status checks…

880fbaa

Intron7 added the run-gpu-ci label Dec 17, 2024

add comment for _clustering

Loading
Loading status checks…

028393a

github-actions bot removed the run-gpu-ci label Dec 17, 2024

Intron7 added the run-gpu-ci label Dec 17, 2024

add release note

Loading
Loading status checks…

faa93ff

github-actions bot removed the run-gpu-ci label Dec 17, 2024

Intron7 requested a review from flying-sheep December 17, 2024 15:55

Intron7 added the run-gpu-ci label Dec 17, 2024

Intron7 enabled auto-merge (squash) December 17, 2024 15:55

github-actions bot removed the run-gpu-ci label Dec 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update harmony to new implementation #308

update harmony to new implementation #308

Intron7 commented Dec 11, 2024 •

edited by flying-sheep

Loading

flying-sheep commented Dec 16, 2024

flying-sheep left a comment

flying-sheep Dec 17, 2024

flying-sheep Dec 17, 2024

flying-sheep Dec 17, 2024

flying-sheep Dec 17, 2024

Intron7 Dec 17, 2024

flying-sheep Dec 17, 2024

Intron7 Dec 17, 2024

flying-sheep Dec 17, 2024

flying-sheep Dec 17, 2024

flying-sheep Dec 17, 2024

Intron7 commented Dec 17, 2024

		return X


		def _normalize_cp(X: cp.ndarray, p: int = 2) -> cp.ndarray:

update harmony to new implementation #308

Are you sure you want to change the base?

update harmony to new implementation #308

Conversation

Intron7 commented Dec 11, 2024 • edited by flying-sheep Loading

flying-sheep commented Dec 16, 2024

flying-sheep left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Intron7 commented Dec 17, 2024

Intron7 commented Dec 11, 2024 •

edited by flying-sheep

Loading