-
Notifications
You must be signed in to change notification settings - Fork 166
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
output files "infercnv.references.txt" and "infercnv.observations.txt " not found #520
Comments
Hi @wcl2260 , 1- This is most likely related to 4-, the number of subclusters generated is too high, and because CNVs are called per subcluster, there are a lot more CNVs called and they are of lower confidence. This makes the Bayesian network step take a long time to run. 2- They used to be only 21 steps, but one of them has been split in two to enable changing the Bayesian filtering threshold without having to rerun the whole model, so the number increased to 22. There are some more changes coming so I was planning to update the wiki when those are released. 3- The output of those files when plotting has been disabled by default, but you can easily enable it again by adding the option "write_expr_matrix=TRUE" to 4- There are a few things here. First the "cluster_by_groups" option means that cells will be clustered first by the annotations provided as input before subclustering as opposed to combining all of them together before doing the subclustering. In your case, there is only 1 annotation for observation cells, "Epithelial_cells", so the option does not effectively change anything. Regards, |
@GeorgescuC Hi Christophe,
if (!requireNamespace("BiocManager", quietly = TRUE)) The newly installed package was turned out to be inferCNV (version 1.14.2). Could you please tell me how to update inferCNV to the latest version of inferCNV? |
@GeorgescuC Hi Christophe,
Error in seq_len(max(obs_annotations_groups)) : How to fix it? library(parallel) future::plan("multicore",workers=40) |
@GeorgescuC Hi Christophe, This is my infercnv code: future::plan("multicore",workers=40) Any ideas why this might be the case? |
Hi @wcl2260 , First, for the install issue, version 1.15.2 is on BioConductor's devel so you need to specify "BiocManager::install(version='devel')" before the install line. If you are unable to install from there, it might be due to the devel version using R version 4.3 as well, in which case you can install infercnv from this Github directly instead. For the subclustering, the results using "leiden_resolution=0.001" seem good based on the branches of the dendrogram, where the top level splits form a descending stair (like can be on the figure in your first post). The legend you see at the bottom with only "Epithelial_cells" will always display the annotations you provided as input, not the subclustering, except if specifically using the "plot_subclusters()" method. For the first error ("Error in seq_len(max(obs_annotations_groups)) :"), I encountered it while looking at a different post and fixed it in the latest commit. The issue happens when running plot_subclusters() on an object that was analyzed with cluster_by_groups=FALSE, but does not affect the rest of the run, and does not rerun if the subclustering finished, which is why when you rerun infercnv, a different error appeared. For the second issue, could you post the full log? Regards, |
Dear team,
Thanks for the great package!
I have several problems with running infercnv::run() function.
1-first, it has been running for over 1 week for about 8000 observations and 9221 refernces (Observation data size: Cells= 8389 Genes= 6045, Reference data size: Cells= 9221 Genes= 6045 ). Is this normal?
2-second, the run step jumped directly from 20 (STEP 20: Converting HMM-based CNV states to repr expr vals) to 22 (STEP 22: Denoising step) while the official website shows only 21 steps.
This is the script:
infercnv_obj = CreateInfercnvObject(raw_counts_matrix="D:/TBNC/12.inferCNV_scCNV/expFile.txt",
annotations_file="D:/TBNC/12.inferCNV_scCNV/metaFiles.txt",
delim="\t",
gene_order_file="D:/TBNC/12.inferCNV_scCNV/geneFile.txt",
ref_group_names=c("T_cells")
infercnv_obj = infercnv::run(infercnv_obj,
cutoff=0.1,
out_dir='infercnv_out_subclusters_mode_20230301/',
cluster_by_groups=TRUE,
analysis_mode="subclusters",
denoise=TRUE,
HMM=TRUE,
output_format = "pdf")
3-third, output files "infercnv.references.txt" and "infercnv.observations.txt " were not found. What might be the cause of the failure to output these files?
4-fourth, is it possible to discriminate which are malignant and which are not from this graph. Also, why is the graph did not show anysubcluster as option "cluster_by_group" was "TRUE"? Is any parameter missed?(please see image)
The text was updated successfully, but these errors were encountered: