forked from evarol/HYDRA
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADME.txt
143 lines (92 loc) · 4.85 KB
/
README.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
HYDRA
Section of Biomedical Image Analysis
Department of Radiology
University of Pennsylvania
Richard Building
3700 Hamilton Walk, 7th Floor
Philadelphia, PA 19104
Web: https://www.med.upenn.edu/sbia/
Email: sbia-software at uphs.upenn.edu
Copyright (c) 2018 University of Pennsylvania. All rights reserved.
See https://www.med.upenn.edu/sbia/software-agreement.html or COPYING file.
Author:
Erdem Varol
===============
1. INTRODUCTION
===============
This software performs clustering of heterogenous disease patterns within patient group. The clustering is based on seperating the patient imaging features from the control imaging features using a convex polytope classifier. Covariate correction can be performed optionally.
===============
2. TESTING & INSTALLATION
===============
This software has been primarily implemented in MATLAB for Linux operating systems.
----------------
Requirements
----------------
- Matlab optimization toolbox
- Matlab version >2014
----------------
Installation
----------------
Hydra can be run directly in a matlab environment without compilation.
OPTIONAL:
If the user wants to run hydra as a standalone executable, then it must be compiled as following (using the additionally obtained matlab compiler "mcc"):
Run the following command in a MATLAB environment:
mcc -m hydra.m
-----------------
Test
-----------------
We provided a test sample in the test folder.
To test in matlab enviroment, use the command:
hydra('-i','test.csv','-o','.','-z','test_covar.csv','-k',3,'-f',3)
To test in command line using the compiled executable, use the command:
hydra -i test.csv -o . -z test_covar.csv -k 3 -f 3
This runs a HYDRA experiment which may take a few minutes. The test case contains a subset of a functional MRI study dataset by T. Satterwaithe comprising 100 subjects and their functional ROI's. The output is the clustering labels of the input subjects (only patients are clustered) at varying clustering levels. Also, the clustering stability at varying levels is output to show the rationale for choosing the clustering level.
-----------------
Test Verification
-----------------
Pre-computed HYDRA results have been included in directory "Pre_computed_test_results". The user may verify that their test results match the pre-computed results to confirm proper set-up. If the clustering occurred properly, ARI for clustering level k=3 should be greater than that of clustering level k=2.
==========
3. USAGE
==========
I. Running "HYDRA":
Here is a brief introduction to running HYDRA. For a complete list of parameters, see --help option.
To run this software, you will need an input csv file, with the following mandatory fields in the following column order:
(Column 1) ID: ID for subject
(Column 2---(last minus 1)) features: features to be used for clustering
(Column (last)) groups: label whether the subject is control (-1) or patient (1)
NOTE: Controls must be strictly -1 and patients must be 1 label.
NOTE: Label headers names are not strict.
An example input csv file looks as following:
ID, feature_1, feauture_2, feature_3, group
subject_1, 5, 1, 79.3, -1
subject_2, 10, 1, 71.4, 1
subject_3, 3, 1, 82.7, -1
Optionally, you can provide a covariate file that will be used to remove covariate effects from imaging features before HYDRA analysis. The covariate file has the following format:
(Column 1) ID: ID for subject
(Column 2---(last)) covariates: covariates of subjects
An example covariate csv file looks as following:
ID, age, sex
subject_1, 29, 1
subject_2, 35, 1
subject_3, 51, 0
If you install the package successfully, there will be two ways of running HYDRA:
1. Running HYDRA in a matlab environment, a simple example:
hydra('-i','test.csv','-o','.','-z','test_covar.csv','-k',3,'-f',3)
2. Running matlab compiled HYDRA executables in the command line, a simple example:
hydra -i test.csv -o . -z test_covar.csv -k 3 -f 3
The software returns:
1. HYDRA_results.mat in the specified output directory.
This mat file stores the following variables
CIDX - clustering indices for subjects (rows) at varying levels (columns)
ARI - adjusted rand index of clustering at varying levels, clustering level at the highest ARI should be selected
ID - subject ID of rows
===========
4. REFERENCE
===========
If you find this software useful, please cite:
Varol, Erdem, Aristeidis Sotiras, Christos Davatzikos, and Alzheimer's Disease Neuroimaging Initiative. "HYDRA: Revealing heterogeneity of imaging and genetic patterns through a multiple max-margin discriminative analysis framework." NeuroImage 145 (2017): 346-364.
===========
5. LICENSING
===========
See https://www.med.upenn.edu/sbia/software-agreement.html or COPYING.txt file.