Paths playground #15

flying-sheep · 2020-03-18T12:42:18Z

not a PR, just a place to comment and play around before this gets merged into AnnData or scanpy.

Done:

both obs_df and get_obs_vector support paths as shorthand "X/Actb" or long ("obs", "A/B")
obs_df returns a Data Frame with unique column names. Smartly resolves collisions: "obsm/X_pca/0" becomes "X_pca1" while "obsm/protein/CD11b_TotalSeqB" becomes "CD11b_TotalSeqB"

TODO:

allow leaving things out when not ambiguius ("X_pca/0" should work)
allow sequences: [("obsm", "X_pca", [0, 1])] would expand to [("obsm", "X_pca", 0), ("obsm", "X_pca", 1)]
obsp support

review-notebook-app · 2020-03-18T12:42:24Z

Check out this pull request on

You'll be able to see Jupyter notebook diff and discuss changes. Powered by ReviewNB.

paths_get.py

ivirshup · 2020-03-18T12:59:16Z

paths_get.py

+    # path is shorthand: "obs/Foo" and "o/Foo"
+    if not isinstance(path, str):
+        return path
+    path = path.split("/")


Watch out for this, since column names in dataframes can currently have a "/" in them.

Also, we don't stop dataframe indices from having slashes. We're only expecting two "/" unless the value is in uns, right?

So if we use slashes we need actual parsing then? I mean things are pretty deterministic, so we could set the limit for splitting depending on the attribute. That would be super headachy when combined with fallbacks and shorthand though.

If I'm thinking about the same thing, I think we always needed actual parsing. Dataframe indices and columns can be arbitrary strings.

ivirshup · 2020-03-18T13:01:22Z

I don't think we should allow access strings like: "obsm/X_pca/0", since the integer is ambiguous for dataframes. Is it meant to be the position, or the value?

flying-sheep · 2020-03-18T13:01:51Z

that’s why I didn’t want data frames in .obsm.

It’s unambiguous when X_pca is an array, and named stuff should go into dedicated anndata modes (which don’t exist yet)

we can fix that in AnnData by not allowing data frames with int colnames or colnames that look like ints…

ivirshup · 2020-03-18T13:10:32Z

that’s why I didn’t want data frames in .obsm

They're so useful though. I think it'd be fine to just have something like ad.Ref('obsm/X_pca", 0)

flying-sheep · 2020-03-18T13:12:51Z

That’s just inconsistent and more verbose than a tuple. Why separate the first descent by a slash in a string and the second by a comma?

This seems simpler: ("obsm", "X_pca", 0)

Or this if we have an array: "obsm/X_pca/0"

paths_get.py

ivirshup · 2020-03-18T13:20:46Z

I was thinking more separating the container and the index. This could also allow:

ad.Ref("obsm/X_pca", (0, 1, 2))
ad.Ref("obs", r"leiden*")

Of course, the init could have a signature like ___init__(*args) and pass off to a recursive parsing function.

flying-sheep · 2020-03-18T13:58:25Z

What’s the difference between a key and an index? is thing in obs[thing] a key or an index? I’d say it’s both.

We used to call that index “component”, but that falls flat when using a data frame and specifying a column name …

Zethson · 2023-04-27T12:30:53Z

Given that this includes the citeseq tutorials I don't think that this is relevant anymore

ivirshup and others added 4 commits March 17, 2020 20:47

initial multiomics

e146095

Update with joint clustering + geometric normalization

bbef674

Add protein plots

e3ff044

First version of path getting module

5e882ac

ivirshup reviewed Mar 18, 2020

View reviewed changes

paths_get.py Show resolved Hide resolved

ivirshup reviewed Mar 18, 2020

View reviewed changes

paths_get.py Show resolved Hide resolved

ivirshup reviewed Mar 18, 2020

View reviewed changes

paths_get.py Show resolved Hide resolved

flying-sheep mentioned this pull request Mar 19, 2020

Ref paths scverse/anndata#342

Closed

24 tasks

Zethson closed this Apr 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Paths playground #15

Paths playground #15

flying-sheep commented Mar 18, 2020 •

edited

Loading

review-notebook-app bot commented Mar 18, 2020

ivirshup Mar 18, 2020

flying-sheep Mar 18, 2020

ivirshup Mar 18, 2020

ivirshup commented Mar 18, 2020

flying-sheep commented Mar 18, 2020 •

edited

Loading

ivirshup commented Mar 18, 2020

flying-sheep commented Mar 18, 2020 •

edited

Loading

ivirshup commented Mar 18, 2020

flying-sheep commented Mar 18, 2020 •

edited

Loading

Zethson commented Apr 27, 2023

Paths playground #15

Paths playground #15

Conversation

flying-sheep commented Mar 18, 2020 • edited Loading

review-notebook-app bot commented Mar 18, 2020

ivirshup Mar 18, 2020

Choose a reason for hiding this comment

flying-sheep Mar 18, 2020

Choose a reason for hiding this comment

ivirshup Mar 18, 2020

Choose a reason for hiding this comment

ivirshup commented Mar 18, 2020

flying-sheep commented Mar 18, 2020 • edited Loading

ivirshup commented Mar 18, 2020

flying-sheep commented Mar 18, 2020 • edited Loading

ivirshup commented Mar 18, 2020

flying-sheep commented Mar 18, 2020 • edited Loading

Zethson commented Apr 27, 2023

flying-sheep commented Mar 18, 2020 •

edited

Loading

flying-sheep commented Mar 18, 2020 •

edited

Loading

flying-sheep commented Mar 18, 2020 •

edited

Loading

flying-sheep commented Mar 18, 2020 •

edited

Loading