This repository outlines the work completed as part of the study "Limiting the impact of protein leakage in single-cell proteomics" (Leduc et al. 2024).
If performing a single cell proteomics study, the use permeability stain such as Sytox Green can be helpful for excluding damage cells from downstream analysis. If permeablized cells were not excluded from the sample preparation process, they can still be identified computationally to avoid artifacts in your analysis. To facilitate this task, we developed a classifyer based on proteomic profiles from experimental data from intact and permeablized cells.
To use the classifyer, install the QuantQC
package:
devtools::install_github("https://github.com/Andrew-Leduc/QuantQC")
library(QuantQC)
Once the package is installed, the relevant function is FindPermeableCells()
. The input is a matrix (Protein X single cells) with Uniprot identifiers on the rownames. An optional argument is the species. The default is species = "Mouse"
. Human is also supported, species = "Human"
.
The output is a vector length number single cells. Each value is the probability of permeablization. The distribution may differ depending on the data set but generally values observed probabilities above 0.2 were observed as permeable cells.
To reproduce the analysis from the paper, download this repository and run the scripts described below. The order of analysis is listed below, however users can start from any point in the processing pipline.
All the data is conveniently read in from google drive links, so no data downloading is needed, just clone scripts and run analysis!
QuantQC functions to do initial processing and QC of the LC-MS/MS data and metadata mapping. Outputs are Protein X single cell matricies for the Frozen and Fresh experiments and associated meta data files.
Protein X single cell matricies are integrated via the LIGER algorithm (Welch 2019) with mRNA-seq data from the same tissue to transfer cell type identification. Outputs are meta data files with an additional column for cell type identity.
The final script takes in the meta data and protein data matricies and performs the analysis of protein abundance differences between permeable and intact cells from figures 1d,e and figure 2 from (Leduc et al 2024).
-
Preprint: Leduc A, Xu Y, Shipkovenska G, Dou Z, Slavov N, Limiting the impact of protein leakage in single-cell proteomics, bioRxiv, doi: 10.1101/2024.07.26.605378
-
nPOP sample preparation Website | Download data from Leduc et al., 2022 | Download data from protocol