-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support transposed assay data #42
Comments
Works fine as far as
(I still need to work on the sessionInfo():
|
FWIW, starting with HDF5Array 1.19.7,
|
Should be fixed in devel with v1.1.5 |
I have the same issue with |
@royfrancis Can you please provide some code/file to reproduce the issue? I'm hoping this exact issue is solved but you could be getting the same warning for other reasons. |
Many hours later... Now I have a docker image to reproduce the error.
The h5ad file that I am converting is unfortunately huge. Extended GBmap (11GB) from here. Rename
It worked with a few other smaller files that I tried. For example 1295 cells or 7999 cells or 141401 cells. Maybe it's an issue with the h5ad file? 🤔 To build image from scratch, place the three files below into a directory: Dockerfile
env.yml
convert.R
Run this in that directory to build the docker image. |
I have had a quick look at this. I think what is happening is that the dataset is too big and transposing the |
I don't know the context but if the goal is to support Bioconductor work flows does it make sense to
which has done the transformation anyway? And is more-or-less instant, since it does not actually load the entire data into memory? Oops I see a similar comment from @hpages upstream. And also perhaps that the problem is in |
Yeah I think this would be the correct approach for the R reader but not sure it helps with the Python reader where we already have the data in memory. This kind of scalability is probably another reason to work on the R reader more, at the very least you avoid having two copies of everything. |
Was this issue ever solved? I have the same problem:
And the resulting counts matrix is all zeroes. |
Zellkonverter didn't work for my file of interest, so my solution was simply to export as text and convert into R format. The code I used is below. This solution is extremely memory intensive depending on the size of your dataset. There are probably better ways to do this. H5AD -> CSV -> R
|
@royfrancis Thanks for your response! I ended up being able to just download the raw data I needed from a public repository into R format so skipping the h5 step. Hope this issue will be fixed soon! |
@jenellewallace It seems like you came up with a solution for your case but for anybody else we made updates to the R reader in the |
Hi everyone, I would like to jump in on this. I am working with the dataset from GSE174188 ( In
However, after switching to
It turns out that that the warning "raw 'X' matrix does not support transposition and has been skipped" is the underlying cause of the assays and rowdata dimensions being mismatched, which finally results in SummarizedExperiment complaining. In the light of this thread, I also tried with |
|
Note that I am still able to load the data with older versions on the same machine, so I do not expect memory issues to be the culprit here... One more hint: it seems that this line
results in a 32K x 1.2M matrix in the old versions, but now only returns a 1999 x 1.2M matrix. Let me know if I can provide any more info to resolve this! |
My understanding (from @hpages) is that transposed assay data matrices should be supported in HDF files, but after downloading Human Cell Atlas h5ad data at https://data.humancellatlas.org/explore/projects/f83165c5-e2ea-4d15-a5cf-33f3550bffde/expression-matrices?catalog=dcp2 (look for the h5ad files) I see
and indeed the matrix is all zeros.
This can be reproduced programmatically with
The text was updated successfully, but these errors were encountered: