Rf parallel #20

yael1994 · 2021-03-14T15:38:46Z

No description provided.

yael1994 · 2021-04-21T08:00:41Z

https://trello.com/c/xWGS9AxJ/11-parallel-random-forests

yael1994 · 2021-04-21T08:03:03Z

https://trello.com/c/dVXOJFRg/13-stop-random-forest-on-a-value-threshold

yael1994 · 2021-04-21T08:04:34Z

https://trello.com/c/eMb9vmYj/16-stop-phase-3-before-random-forest

shacharmo

Was this thoroughly tested? are we getting same or better results then before quicker?
Are the results reproducible, i.e. running the same experiments leads to same results (hint: check for seed in train_random_forest)

model_fitting/random_forest.py

shacharmo

Also, fix conflicts

IgOmeProfiling_pipeline.py

model_fitting/random_forest.py

shacharmo · 2021-05-03T18:08:16Z

model_fitting/train_random_forest.py

+
+
+def generate_heat_map(df, number_of_features, hits_data, number_of_samples, output_path):
+    train_data = np.log2(df+1) if hits_data else df 


Same comment as above, this doesn't support p-value correctly

I thought it was decided after we showed to the client the logs to leave only for hits.

Even if this is true (and I'm not sure they wanted it for p-val), the p-val has opposite values, i.e. 0 is "best" value

model_fitting/train_random_forest.py

shacharmo

Still need to properly address p-val.
Note that values are opposite, i.e., value of 0 for number of hits greater than all shuffles.

Regarding log scale, maybe add it as controllable parameter.
If so, do it in another PR

IgOmeProfiling_pipeline.py

model_fitting/random_forest.py

yael1994 added 10 commits November 26, 2020 11:24

fix the color heatmap at random forest

216147d

fix the commit from the pull request

b65ff16

add a flag for stop before run random forest

ec5944d

strat to build run parallel random forest

68aaa18

random forest parallel with new file train_random forest

0200272

spaces

2fe6ba7

save results in new file at each model

c89503d

Arranging the code

73ef454

fix prints to summary file

8a4c6e8

add flages to module wraper

94dd7ec

yael1994 requested a review from shacharmo March 14, 2021 15:38

change the position of break

f552698

shacharmo requested changes Apr 25, 2021

View reviewed changes

model_fitting/random_forest.py Outdated Show resolved Hide resolved

model_fitting/random_forest.py Outdated Show resolved Hide resolved

model_fitting/random_forest.py Outdated Show resolved Hide resolved

yael1994 and others added 5 commits April 27, 2021 22:34

change the imports

375ac90

add seed for train random forest

766a435

Merge branch 'reads_filtration_change_seq_length' into RF_parallel

a72f396

move new params out to all pipeline

409d658

move new params out to all pipeline

952bd61

shacharmo requested changes May 3, 2021

View reviewed changes

yael1994 and others added 2 commits May 4, 2021 20:13

resolve the comments

c31ce03

Merge branch 'reads_filtration_change_seq_length' into RF_parallel

f803d87

shacharmo requested changes May 10, 2021

View reviewed changes

IgOmeProfiling_pipeline.py Outdated Show resolved Hide resolved

model_fitting/random_forest.py Outdated Show resolved Hide resolved

Fixed typeos

63112a8

shacharmo approved these changes May 19, 2021

View reviewed changes

Merge branch 'reads_filtration_change_seq_length' into RF_parallel

9525e37

shacharmo merged commit f29afc4 into reads_filtration_change_seq_length May 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rf parallel #20

Rf parallel #20

yael1994 commented Mar 14, 2021

yael1994 commented Apr 21, 2021

yael1994 commented Apr 21, 2021

yael1994 commented Apr 21, 2021

shacharmo left a comment

shacharmo left a comment

shacharmo May 3, 2021

yael1994 May 4, 2021

shacharmo May 10, 2021

shacharmo left a comment



		def generate_heat_map(df, number_of_features, hits_data, number_of_samples, output_path):
		train_data = np.log2(df+1) if hits_data else df

Rf parallel #20

Rf parallel #20

Conversation

yael1994 commented Mar 14, 2021

yael1994 commented Apr 21, 2021

yael1994 commented Apr 21, 2021

yael1994 commented Apr 21, 2021

shacharmo left a comment

Choose a reason for hiding this comment

shacharmo left a comment

Choose a reason for hiding this comment

shacharmo May 3, 2021

Choose a reason for hiding this comment

yael1994 May 4, 2021

Choose a reason for hiding this comment

shacharmo May 10, 2021

Choose a reason for hiding this comment

shacharmo left a comment

Choose a reason for hiding this comment