Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

recoverDir option doesn't appear to work #21

Closed
zrlewis opened this issue Jan 17, 2019 · 10 comments
Closed

recoverDir option doesn't appear to work #21

zrlewis opened this issue Jan 17, 2019 · 10 comments
Assignees

Comments

@zrlewis
Copy link

zrlewis commented Jan 17, 2019

Hi-

I had a a RepeatModeler (version DEV) job time out and I am trying to restart using this command:

RepeatModeler -pa 16 -recoverDir RM_28668.SunJan131256362019 -database repeatDB >& repeatModel_restart3.out

However, RM doesn't seem to recognize completed rounds in the directory. Here's the error:

RepeatModeler Version DEV
================================
Search Engine = ncbi
Random Number Seed: 1547740043
Database = repeatDB ......
  - Sequences = 54313
  - Bases = 2854780185


Oops...the ./RM_28668.SunJan131256362019 run did not get passed round-1.
It makes more sense to restart this run from the beginning.
Remove the -recoverDir option and rerun the program.

Yet there are completed rounds in the directory. See below:

ls ./RM_28668.SunJan131256362019
consensi.fa  families.stk  round-1  round-2  round-3  round-4  round-5

Any ideas on how to get the job restarted while recognizing completed rounds?

Thanks!
Zack

@rmhubley
Copy link
Member

rmhubley commented Apr 4, 2019

The DEV release is not meant to be run by users -- it's the active development branch of RepeatModeler. What you want is the latest release from the "releases" link instead.

@rmhubley rmhubley closed this as completed Apr 4, 2019
@rmhubley rmhubley reopened this Apr 4, 2019
@rmhubley
Copy link
Member

rmhubley commented Apr 4, 2019

I misspoke about the "DEV" release. If you did in fact, download this release from the RepeatMasker website or from the "releases" link on github then you may have the full released version after all. I just realized that during our switch from CVS to GIT the RepeatMasker & RepeatModeler version designation was incorrectly set in the downloads.

Back to your issue. The way RepeatModeler determines if a round completed is to look for particular files in each of the round directories:

  • round-1/consensi-refined.fa or, if this file
    is empty/missing it looks for round-1/sampleDB-1/fa.rscons.filtered
  • round-2/consensi.fa
    ...
  • round-n/consensi.fa

Are these files there and non-empty?

@rmhubley rmhubley self-assigned this Apr 5, 2019
@zrlewis
Copy link
Author

zrlewis commented Apr 9, 2019

Hi,

Thanks for taking a look at this.

round-1:

-rw-r--r-- 1 zrl3 dunn  39M Feb  5 16:14 sampleDB-1.fa
-rw-r--r-- 1 zrl3 dunn  80M Feb  5 16:14 sampleDB-1.fa.lfreq
-rw-r--r-- 1 zrl3 dunn 2.6M Feb  5 16:14 sampleDB-1.fa.rscons
-rw-r--r-- 1 zrl3 dunn    0 Feb  5 16:14 sampleDB-1.fa.rscons.filtered

However, consensi.fa from subsequent rounds are empty:

-rw-r--r-- 1 zrl3 dunn 0 Feb  5 16:14 round-2/consensi.fa
-rw-r--r-- 1 zrl3 dunn 0 Feb  5 16:11 round-3/consensi.fa
-rw-r--r-- 1 zrl3 dunn 0 Feb  5 16:11 round-4/consensi.fa

By the way, I have now gotten full runs to finish when starting from scratch. However, making sure that the recoverDir option is working appropriately still seems useful.

@rmhubley
Copy link
Member

rmhubley commented Apr 9, 2019

Right...so this looks like Round-1 didn't even complete. The directory doesn't contain the "consensi-refined.fa" and the sampleDB-1.fa.rscons.filtered is empty. I suspect there is something strange with the round-1/sampleDB-1.fa.rscons file. If you send me that file I could take a look further. The bottom line is that it failed on the first step so even if we find the problem, the recover mechanism is still going to have to start from the very beginning.

@Juke34
Copy link

Juke34 commented Dec 4, 2019

Hi I have exactly the same problem. Wanted to resume a job (stopped due to time limit). Apparently it was working on round 6 when it stopped but when I try to resume I get this messge: Oops...the /scratch/jacda119/RM_12063.FriNov291313512019 run did not get passed round-1

In the round1 folder I have:

-rw-rw-r-- 1 jacda119 jacda119 45093970 Nov 29 13:18 sampleDB-1.fa
-rw-rw-r-- 1 jacda119 jacda119 73244770 Nov 29 13:18 sampleDB-1.fa.lfreq
-rw-rw-r-- 1 jacda119 jacda119  1454634 Nov 29 13:24 sampleDB-1.fa.rscons
-rw-rw-r-- 1 jacda119 jacda119        0 Nov 29 13:24 sampleDB-1.fa.rscons.filtered

In the subsequent round (2 to 6 ) the consensi.fa file is empty.

I use RepeatModeler version open-1.0.8.
Here is the sampleDB-1.fa.rscons file from round1.
sampleDB-1.fa.rscons.zip

@jebrosen
Copy link
Member

jebrosen commented Dec 4, 2019

@Juke34 How did you install and configure RepeatScout and RepeatModeler? Is the path you configured for RepeatScout missing the program filter-stage-1.prl?

@Juke34
Copy link

Juke34 commented Dec 4, 2019

I use repeatmodeler 1.0.8 from bioconda.
There is RepeatScout in the path but nothing called filter-stage-1.prl.

@jebrosen
Copy link
Member

jebrosen commented Dec 4, 2019

Yes, the bioconda package is broken - in particular that issues looks like bioconda/bioconda-recipes#9988.

For this reason we still recommend manual installation (http://www.repeatmasker.org/RepeatModeler/), or out new containerized version that is suitable for some users (https://github.com/Dfam-consortium/TETools).

@Juke34
Copy link

Juke34 commented Dec 4, 2019

I need Conda recipe, I will try to fix it. thank you for your help.

@Juke34
Copy link

Juke34 commented Dec 6, 2019

I have fixed the Conda recipe, now `filter-stage-1.prl is accessible in the path but my output is still the same for round1:

-rw-rw-r-- 1 jacda119 jacda119 44977382 Dec  6 09:54 sampleDB-1.fa
-rw-rw-r-- 1 jacda119 jacda119 72311486 Dec  6 09:54 sampleDB-1.fa.lfreq
-rw-rw-r-- 1 jacda119 jacda119  1419475 Dec  6 10:01 sampleDB-1.fa.rscons
-rw-rw-r-- 1 jacda119 jacda119        0 Dec  6 10:01 sampleDB-1.fa.rscons.filtered

any idea?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants