-
Notifications
You must be signed in to change notification settings - Fork 3.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Repeatmodeler package not working #9988
Comments
|
Exactly same issue here. Manually installed RepeatModeler worked well on the same assembly.
|
coda Repeatmodeler package still not working properly |
I believe the underlying issue is here: The RepeatScout package in bioconda does not include |
I'm working on it with this PR #19088. I guess it could work like it is in this PR (without nseg) but to be perfect it would be nice to include nseg. But there is currently no recipe for it. nseg is available here ftp://ftp.ncbi.nih.gov/pub/seg/nseg. |
In the build3 of RepeatScout nseg and filter-stage-1.prl are now included, but the result seems still the same. How can we check manually which step is failing? Maybe one of this tool do not behaves like they should |
I believe at this point the remaining bug is in the There could be other problems in the custom |
Thank you for your help, I succeeded to fix that today. Now this step is working fine.
|
@Juke34 Does that happen for every model or only some of them? I can try to reproduce that in a clean environment... probably next week. |
I found! It is because the PATH to RepeatClassifier, Refiner and TRFMask are wrong. They are called directly in the RepeatModeler folder (in share), that cause an error e.g: while they have to be called as the other tools by the bin folder where there is a wrapper to call them like that: But I'm getting close, I found a (nasty) way to fix the problem by modifying the code during installation with a sed command. I'm trying locally and it seems to work ... will see if I face up another problem |
Yes... the |
It run until the end now. Except at the very end I have this message
Is this RepeatPeps.lib.psq important? What it is use for? Can we skip this step? If we need this file how can we include it in the package? I don't know from where it is supposed to come |
I think that message is wrong - it should say to re-run |
That means I need to fix the repeatmasker recipe too to create this file. Could you tell me the steps needed to create this file without using the configure? There is a file called RepeatPeps.lib so I guess there is one step to create the RepeatPeps.lib.psq from it. |
It used to be done by the package - https://github.com/bioconda/bioconda-recipes/blob/master/recipes/repeatmasker/build.sh#L22. Like RepeatModeler, the latest version of RepeatMasker has a |
Actually is just a file made by makeblastdb command. No need to touch the RepeatMasker recipe for that, I can fix it directly from the RepeatModeler recipe then. |
The recipe is now fixed in repeatmodeler-1.0.11 build pl526_2 (see #19137). The only thing remaining is the last step (RepeatClassifier) that use RepeatMasker that will be skipped by default. You will get this message:
This is because no nucleotide repeat library is included in RepeatMasker. So it is recommended to download the DB of your choice (get licence to use RepBase) to get this working properly.
|
@helltish you can close the issue now. |
Hello, I just got this bug
How exactly should I fix it? Should I run this command:
So, running the above command generated the following three files:
And then restarting using the command (-recoverDir) results in this:
So...am I good? |
Do The RepeatMasker.lib should be there $CONDA_PREFIX/share/RepeatMasker/Libraries/ |
Hello, I had the exact same issue.
It should look like this
Hope it helps. (I'm using RepeatMasker 2.0.1) repeatmasker 4.0.9_p2 pl526_2 bioconda |
I checked repeatmasker 4.0.9_p2 and 4.1.0 and indeed the RepeatMaskerLib db in not set properly (on OSX at least)...
|
For RepeatMasker version 4.0.9_p2 the easiest would be to do For version 4.1.0 we use the following command: |
I am still getting this error when trying to generate the I don't quite follow all the above discussion on how to fix this. Where in the conda bin directory should the .lib.nsq file be placed? Nathan |
Can you please describe this line? I'm new to this. |
So i ran the above code you suggest in the miniconda share dir. ./configure downloads required databasesecho -e "\n2\n$(dirname $(which rmblastn))\n\n5\n" > tmp && ./configure < tmp But the directory still only looks like this. |
I configured it to use HMMER3.1 however. Would this be the problem? |
@mudithekanayake, For the first Of course, you can do it in the interactive mode by setting one by one, but I was suggesting a kind of shortcut. |
@insectnate I don't think that would be a problem. |
I did that and still get a Library directory of |
Can you do the |
I did the ./configure where I am prompted to confirm the $PATH to each of the dependencies. I have tried it both with specifying RMBlast and HMMER3.1 but never get the makeblastdb files shown above. Is there another way to do configure that I am missing where there are more settings? |
|
Ok I will try that. By this do you mean deleting the conda install of RepeatMasker and installing again? Thanks for all your help. Nathan |
Yes. Or, you could create another environment name. |
Possibly related, the script If I manually add that to
Note also for the above comments, I'm pretty sure it's not kosher to modify files/dirs in the conda tree after install by conda. I think this breaks conda. |
I had a same issue that RepeatMasker.lib and its blastdb were not provided by RepeatMasker configure. The solution I could was downloading the previous version-4.1.0 of RepeatMasker independently to conda then configured. Hope it works for you guys. |
Now that #25163 is complete, I was able to download There are still a few reasons one might prefer a manual installation, for example the LTR structural search method introduced RepeatModeler 2.0 requires some dependencies that are not yet in bioconda. |
I recently had the same issue as described above, with the error message "missing /RepeatMasker/Libraries/RepeatPeps.lib.psq" when I tried to run RepeatClassifier. Just in case anyone else experiences this issue, I solved it by re-configuring RepeatMasker and manually downloading all the libraries from https://home.cc.umanitoba.ca/~psgendb/doc/BIRCH/doc/local/pkg/RepeatMasker/Libraries/ to the directory RepeatMasker/Libraries. |
@JuliaLopezDelgado Please specify the version you have used. Because depending the version the problem is solved. |
I am using RepeatMasker v.4.1.2 and RepeatModeler v.2.0.2 |
@JuliaLopezDelgado Bioconda had packages for RepeatMasker 4.1.1 (broken) and RepeatMasker 4.1.2-p1 (fixed), skipping over RepeatMasker 4.1.2 (FWIW, also fixed). Did you mean 4.1.2-p1? The libraries available at home.cc.umanitoba.ca are over 10 years old. It is only by chance that they could even work today - and worse, old libraries might appear to work and silently fail or produce wrong/misleading results. For that reason I generally recommend against trying to "mix" files from new and old versions of RepeatMasker and RepeatModeler. |
Thanks for letting me know @jebrosen ! I have rerun the RepeatMasker configure script and also ran: This is my RepeatMasker/Libraries/ contents: Do you have any suggestions on how to fix this? I have manually installed RepeatMasker v.4.1.2 and RepeatModeler v.2.0.2. Thanks in advance, |
@JuliaLopezDelgado It looks like you are missing the file I noticed in your file list that there are files from multiple versions of RepeatMasker. Mixing files from different versions can cause confusion and sometimes errors, so we usually recommend installing updates to RepeatMasker and RepeatModeler in new directories. If you continue experiencing issues with a manual installation of RepeatMasker/RepeatModeler, please see also our GitHub pages and issue trackers for these projects, which are not specific to bioconda: https://github.com/rmhubley/RepeatMasker/issues, https://github.com/Dfam-consortium/RepeatModeler/issues. |
Thank you @jebrosen! The makeblastdb command generated the .psq file and RepeatClassifier now works. I didn't install multiple versions of RepeatMasker, so I'm unsure why there are files from different versions. These are the commands I used to install it: Does this look okay? Thank you! |
Sorry @JuliaLopezDelgado, after a re-read I think I got a few version numbers confused. That looks fine to me, although the latest version of Dfam is now up to Dfam 3.5. |
but when i type makeblastdb -dbtype prot -in RepeatMasker/Libraries/RepeatPeps.lib, it shows that "Error: NCBI C++ Exception: |
The path to the lib is wrong try: Before you run the command check if the path is correct: |
Thank you so much for letting me know !! now repeatmodeler can work properly |
Dear developer Team,
we have compared the bioconda Repeatmodeler package with a local installation of Repeatmodeler (with all dependencies).
The local installation of Repeatmodeler produced an output, which is reasonable for our input file. The conda package does not produce any output. Please find our logs to compare. The input data was the same in both cases.
Thanks a lot in advance!
The text was updated successfully, but these errors were encountered: