Add neuron name/type to neuprint_connection_table #132

jefferis · 2020-08-16T19:34:15Z

No description provided.

* no changes to logic, just (a bit) easier to read

* neurons without types were being dropped * proper tests for new functionality

romainFr

That looks good. Do we want to also consider returning even more metadata fields, like "notes", "status", "pre", "downstream" and the likes? They're usually helpful in subsequent analysis together with the type information.

jefferis · 2020-08-17T18:24:36Z

I guess the alternative is to merge in all metadata. I thought this was a good compromise for many purposes. I will likely add another function that adds or updates metadata for an existing data.frame. Something like this:

neuprint_add_meta <- function(x, idname='bodyid', ignore.case = TRUE, ...) {
  if(!is.data.frame(x)) stop("I expect a data frame")
  cx=colnames(x)
  if(isTRUE(ignore.case)) {
    cx=tolower(cx)
    idname=tolower(idname)
  }
  matchcol=stats::na.omit(colnames(x)[match(idname, cx)])
  if(length(matchcol)!=1)
    stop("id column:", idname, " not present exactly once in input data frame!")
  
  # nb only check unique ids
  meta=neuprint_get_meta(unique(x[[matchcol]]), ...)
  # just merge the body id column
  merged=merge(x[matchcol], meta, by.x=matchcol, by.y='bodyid', all.x = T, sort=F)
  # make sure we have same number of rows in both tables
  stopifnot(isTRUE(all.equal(nrow(x),nrow(merged))))
  # make sure that the id orders match exactly
  merged=merged[match(x[[matchcol]], merged[[1]]), ]
  # and then check that ids are identical
  stopifnot(isTRUE(all.equal(x[[matchcol]], merged[[1]])))
  # now set columns that are present in meta (overwriting dups)
  x[colnames(merged)]=merged
  x
}

You would then use it like this:

mbon01ds=neuprint_connection_table("MBON01", threshold=5)
mbon01ds=neuprint_add_meta(mbon01ds, idname="partner")
# do your analysis

romainFr · 2020-08-17T19:52:45Z

Yes, that's basically what our workflows looks like right now. So pulling it right when pulling the connections would save the overhead of finding them in the database twice. But I suppose it is a matter what the most common workflows are?

On a related topic, we usually reformat our connection tables into a to/from (name.from/name.to, type.fom/type.to...) format to not be dependent on the "prepost" column. Would such a reformatting function be of interest for neuprintr?

jefferis · 2020-08-17T21:56:36Z

Do you want to sketch out your format?

romainFr · 2020-08-17T22:12:39Z

Yes, starting from a connection table with added metadata for both the partners and the "source" neurons, I do something like :

   connectionTable <- connectionTable %>% mutate(from = ifelse(prepost==1,bodyid,partner),
                                                  to = ifelse(prepost==1,partner,bodyid),
                                                  name.from = as.character(ifelse(prepost==1,name,partnerName)),
                                                  name.to = as.character(ifelse(prepost==1,partnerName,name)),
                                                  type.from = as.character(ifelse(prepost==1,type,partnerType)),
                                                  type.to = as.character(ifelse(prepost==1,partnerType,type))
    ) %>%
      select(-bodyid,-partner,-name,-partnerName,-partnerType,-type,-prepost)
    return(connectionTable)

I'm thinking that to put the connections in context it would then make sense to add to that downstream.from(or post.from) and upstream.to (or pre.to) and their ROI specific equivalents if the request is ROI specific.

The other potential fields (status.from and status.to, notes.from and to) may also come in handy in some analysis/brain regions.

I'd be happy to make a PR for that if that's useful.

jefferis · 2020-08-22T14:43:29Z

@romainFr I'm merging this, but I'd be very happy to see a PR along the lines that you suggest so long as it stays as lean as possible.

jefferis added 3 commits August 10, 2020 02:34

refactor neuprint_connection_table cypher

302f045

* no changes to logic, just (a bit) easier to read

Add more detailed connection table

45536d1

fix new bug dropping rows in connection table

22e7fca

* neurons without types were being dropped * proper tests for new functionality

jefferis requested a review from romainFr August 16, 2020 22:14

romainFr reviewed Aug 17, 2020

View reviewed changes

jefferis merged commit 217d0a9 into master Aug 22, 2020

jefferis deleted the feature/richer-conn-table branch May 21, 2022 06:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add neuron name/type to neuprint_connection_table #132

Add neuron name/type to neuprint_connection_table #132

jefferis commented Aug 16, 2020

romainFr left a comment

jefferis commented Aug 17, 2020 •

edited

Loading

romainFr commented Aug 17, 2020

jefferis commented Aug 17, 2020

romainFr commented Aug 17, 2020

jefferis commented Aug 22, 2020

Add neuron name/type to neuprint_connection_table #132

Add neuron name/type to neuprint_connection_table #132

Conversation

jefferis commented Aug 16, 2020

romainFr left a comment

Choose a reason for hiding this comment

jefferis commented Aug 17, 2020 • edited Loading

romainFr commented Aug 17, 2020

jefferis commented Aug 17, 2020

romainFr commented Aug 17, 2020

jefferis commented Aug 22, 2020

jefferis commented Aug 17, 2020 •

edited

Loading