2023-cytomarkerpaper-analysis

Code to reproduce the figures and results from the cytomarker paper.

Data directory configuration

Prior to running, the following data files need to be placed in the data directory:

singlecells.csv from the cytomarker paper data (download from zenodo here: https://zenodo.org/records/13891857)
The nygc multimodal pbmc data set from: https://datasets.cellxgene.cziscience.com/de42a173-458a-429c-b129-c26bcd3adb3b.h5ad, named as nygc-pbmc.h5ad
The transcriptome and proteome data from Nicolet et al. 2022 from https://doi.org/10.1371/journal.pone.0276294.s006
The protein-RNA correlation table from the Gygi Lab here: https://gygi.hms.harvard.edu/data/ccle/Table_S4_Protein_RNA_Correlation_and_Enrichments.xlsx, put into the depmap sub-directory in data

To run all:

cd 2023-cytomarkerpaper-analysis
snakemake --cores all # user can specify number of cores

data/sce_screen_full.rds and data/sce_screen_subsample.rds
/figs/screen_cluster_heatmap_unscaled.png and /figs/screen_cluster_heatmap_scaled.png to be used in cluster interpretation. User should then create data/cluster-interpretation-nov23.xlsx (what clusters are what cell types)
results/nygc_pbmc_subsampled.rds
figs/screen-vs-scrna.pdf
figs/rna-protein-scatter.pdf and figs/cytomarker_sens_spec_no_ms.pdf
figs/heatmap_mammary_single_cell.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
barcoding-scripts		barcoding-scripts
data		data
imc-analysis		imc-analysis
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
README.md		README.md
Snakefile		Snakefile
cytomarker-paper-analysis.Rproj		cytomarker-paper-analysis.Rproj