How can I generate the training datasets? #14

llfzllfz · 2022-09-19T10:41:45Z

I've download the RNAStralign from the mxfold2, and it has 8 subfolders. With your code in process_data_newdataset.py, I just find the os.listdir(), and it can't solve the subfolders. So what should I do to generate the training datasets?
Thanks.

sperfu · 2022-09-21T02:52:21Z

Hi there,

It depends on how you would like to deal with these data. In our work, we merged all these files in the RNAStralign dataset into one folder and use all the dataset for training. If you choose to check the performance on various species, you may need to use these separated subfolders as illustrated in e2efold paper. So all in all, it depends on how you would like to operate.

Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I generate the training datasets? #14

How can I generate the training datasets? #14

llfzllfz commented Sep 19, 2022

sperfu commented Sep 21, 2022

How can I generate the training datasets? #14

How can I generate the training datasets? #14

Comments

llfzllfz commented Sep 19, 2022

sperfu commented Sep 21, 2022