Skip to content

File Format: Mutation Data (Samples)

Mark Keller edited this page Jan 31, 2019 · 1 revision

The samples file is used to map samples to patients, for example when mapping clinical data (stored by patient) to mutation data (stored by sample).

A samples file must contain two columns:

  • Patient: The patient ID. Should match the patient ID in the clinical file.
  • Sample: The sample ID. Should match the sample ID in the mutation counts files.

Multiple sample IDs may map to the same patient ID.

An example of this file is as follows:

Patient Sample
TCGA-OR-A5J1 TCGA-OR-A5J1-01A-11D-A29I-10
TCGA-OR-A5J2 TCGA-OR-A5J2-01A-11D-A29I-10
... ...