Scripts for regenerating figures in the paper entitled:"Enterocyte-like differentiation defines the metabolic phenotype of CMS3 colorectal cancers and provides a therapeutic vulnerability." Download data from https://www.synapse.org/#!Synapse:syn53040345/ to run the scripts.
- organoids.RData: in vitro mouse models generated in this study and available at GSE223323
- invivo.organoids.RData (in vivo mouse models generated in this study and available at GSE270929)
- CellLine.GSE248558.RData (CRC cell lines treated with CPS1 inhibition and untreated controls available at GSE248558)
- Synapse.RData (human gene expression datasets proflied using U133 Plus 2.0 platform collected in the work of Guinney et al.)
- Synapse.High.RData (human gene expression datasets proflied using U133 Plus 2.0 platform collected in the work of Guinney et al. + TCGA CRC samples)
- scRNAseq.colon.GSE125970.RData (single cell data of normal colon collected from GSE125970)
- msigdb.v7.4.symbols.gmt (gene signatures obtained from Broad Institute)
- Synapse.genesets.gmt (gene signatures obtained from work of Guinney et al.)
- jem_20191130_tables2.xlsx (gene signatures of human cell types developed by Wang et al.)
- Haber.Combine.All.xlsx (gene signatures of mouse SI cell types developed by Haber et al.)
- GSE146476_Intestinal-region-specific-Wnt-signalling_gene_counts.txt (count data directly obtained from publicly available data at GSE146476)
- fastq2count.organoids.GSE223323.txt (GSE223323 and GSE270929)
- fastq2count.cellline.GSE248558.txt (GSE248558)
In the final files both count data and normalized data are available:
- data.handling.organoids.R --> organoids.RData (GSE223323)
- data.handling.invivo.organoids.R --> invivo.organoids.RData (GSE270929)
- data.handling.cellLine.GSE248558.R --> CellLine.GSE248558.RData (GSE248558)
- Scripts: data.handling.Synapse.R
- Scripts: Fig.4C.Synapse.Enterocyte.R, Fig.S4C.GSEA.KRAS.MUTvsWT.R
- Datasets: Synapse.RData, msigdb.v7.4.symbols.gmt
- Scripts: Fig.6E.S5A.GSEA.CPS1.Inhibition.R
- Datasets: CellLine.GSE248558.RData, msigdb.v7.4.symbols.gmt, jem_20191130_tables2.xlsx
- Scripts: Fig.4B.CellType.Metabolism.R
- Datasets: scRNAseq.colon.GSE125970.RData, msigdb.v7.4.symbols.gmt
- Scripts: Fig.4A.Genotype.Cell.All.R, Fig.3C.gsplot.cms.R
- Datasets: organoids.RData, Haber.Combine.All.xlsx, Synapse.genesets.gmt, msigdb.v7.4.symbols.gmt
- Scripts: Fig.S2.R
- Datasets: GSE146476_Intestinal-region-specific-Wnt-signalling_gene_counts.txt, organoids.RData
- Scripts: First.CMS.Stratification.R, Training.Mouse.Classifier.R, Fig.S3DE.Classifier.Coef.R, https://github.com/atorang/mouseCMS
- Datasets: organoids.RData, Synapse.genesets.gmt, msigdb.v7.4.symbols.gmt
- Scripts: Fig.3B.classification.R, https://github.com/atorang/mouseCMS
- Datasets: organoids.RData
- Scripts: Fig.4D.Met.Ent.Scores.R
- Datasets: Synapse.High.RData, msigdb.v7.4.symbols.gmt
- Scripts: Fig.S4B.ssGSEA.organoids.R, Fig.6B.ssGSEA.CellLines.R
- Datasets: organoids.RData, Synapse.genesets.gmt, msigdb.v7.4.symbols.gmt