Skip to content

Shamir-Lab/Multi-Omics-Cancer-Benchmark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

Multi-Omics-Cancer-Benchmark

This repository contains the code that was used for the benchmark on TCGA data. All preprocessed datasets, survival data and clinical labels that were used in the analysis are available here: http://acgt.cs.tau.ac.il/multi_omic_benchmark/download.html.

To run the benchmark, some configuration is needed, mainly settings paths to the datasets, survival and clinical data. Additionally, paths to binaries are required for some methods (MultiNMF, rMKL-LPP).

The expected directory structure is: for the clinical data, a single directory with all clinical data files. The datasets and survival data are organized as follows: a single root directory, with subdirectories for every cancer type. The directory for each cancer type contains a file for every omic, and a file called "survival" with survival data. Omic files, survival data files and clinical label files should be formatted as the files in http://acgt.cs.tau.ac.il/multi_omic_benchmark/download.html.

NOTE: we discovered an error we made in the benchmark's code, where our choice of the number of clusters used by MCCA and LRACluster was not the one suggested by the authors. After changing the criterion to choose the number of clusters, the performance of both methods became worse. We therefore kept our version to choose the number of clusters. See a more detailed explanation in http://acgt.cs.tau.ac.il/multi_omic_benchmark/silhouette_error.html.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages