leiden nmi ari #24

adamgayoso · 2022-10-09T04:24:43Z

Adds leiden nmi ari scores to more closely match scib (they use louvain). This does a search of 10 resolutions of leiden clustering to pick the optimal NMI (as in scIB, but it uses 20 res params)

codecov · 2022-10-09T04:26:49Z

Codecov Report

Merging #24 (318467f) into main (a258e0b) will decrease coverage by 0.51%.
The diff coverage is 90.24%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #24      +/-   ##
==========================================
- Coverage   93.75%   93.23%   -0.52%     
==========================================
  Files           9        9              
  Lines         288      325      +37     
==========================================
+ Hits          270      303      +33     
- Misses         18       22       +4

Flag	Coverage Δ
unittests	`93.23% <90.24%> (-0.52%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/scib_metrics/_ari_nmi.py	`92.15% <90.00%> (-7.85%)`	⬇️
src/scib_metrics/__init__.py	`100.00% <100.00%> (ø)`

justjhong · 2022-10-11T03:00:09Z

tests/test_basic.py

+
+def test_nmi_ari_cluster_labels_leiden_parallel():
+    X, labels = dummy_x_labels(return_symmetric_positive=True)
+    nmi, ari = scib_metrics.nmi_ari_cluster_labels_leiden(X, labels, optimize_resolution=True, n_jobs=2)


is it possible to check this against some leiden impl to make sure the clusters are the same? Is this reliable for diff random seeds?

ah jk i see its leiden vs louvain

leiden is faster. And the igraph implementation is even faster than the one scanpy uses

justjhong · 2022-10-11T03:05:29Z

src/scib_metrics/_ari_nmi.py

+            )
+        except ImportError:
+            logger.info("Using for loop over resolutions. pip install joblib for parallelization.")
+            out = [nmi_ari_cluster_labels_leiden(X, labels, False, r) for r in resolutions]


this recursive pattern feels like a recipe for bugs. would be much cleaner to just split out the else case into a helper then call that from both cases (optimize or not optimize)

leiden nmi ari

79a3f20

adamgayoso requested a review from justjhong October 10, 2022 17:23

justjhong reviewed Oct 11, 2022

View reviewed changes

no recursion

aaf7e90

justjhong approved these changes Oct 11, 2022

View reviewed changes

Merge branch 'main' into leiden

318467f

adamgayoso enabled auto-merge (squash) October 11, 2022 03:20

adamgayoso merged commit f88d530 into main Oct 11, 2022

adamgayoso deleted the leiden branch October 14, 2022 02:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

leiden nmi ari #24

leiden nmi ari #24

adamgayoso commented Oct 9, 2022

codecov bot commented Oct 9, 2022 •

edited

Loading

justjhong Oct 11, 2022

justjhong Oct 11, 2022

adamgayoso Oct 11, 2022

justjhong Oct 11, 2022

adamgayoso Oct 11, 2022

justjhong Oct 11, 2022

leiden nmi ari #24

leiden nmi ari #24

Conversation

adamgayoso commented Oct 9, 2022

codecov bot commented Oct 9, 2022 • edited Loading

Codecov Report

justjhong Oct 11, 2022

Choose a reason for hiding this comment

justjhong Oct 11, 2022

Choose a reason for hiding this comment

adamgayoso Oct 11, 2022

Choose a reason for hiding this comment

justjhong Oct 11, 2022

Choose a reason for hiding this comment

adamgayoso Oct 11, 2022

Choose a reason for hiding this comment

justjhong Oct 11, 2022

Choose a reason for hiding this comment

codecov bot commented Oct 9, 2022 •

edited

Loading