-
Notifications
You must be signed in to change notification settings - Fork 199
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
mmseqs splitdb does not preserve annotation of originating fasta files #376
Comments
You should probably use the MPI support within MMseqs2 to do this: MMseqs2 MPI will automatically split either the query or target database to fit within memory and will produce a single result database. You'll have to compile MMseqs2 with MPI support though (
|
I was trying to avoid MPI so far... Mostly because I run mmseqs as part of a nextflow pipeline, and I'm not even sure if it is possible to use MPI from there, because nextflow usually takes care of the parallelization. |
I added the line to create all the necessary symlinks for |
This works 🎉 |
Expected Behavior
I use
splitdb
to runmmseqs search
in parallel on a HPC (SGE).For each match, I would like to retrieve the name of the original fasta file with
mmseqs convertalis --format-output "...,qset,tset,..."
.Current Behavior
Specifying
qset
ortset
leads to a segmentation fault. Runningsearch
andconvertalis
on the full db works without issues.Steps to Reproduce (for bugs)
MMseqs Output (for bugs)
Your Environment
Include as many relevant details about the environment you experienced the bug in.
45c4de7f1daefa06b45688195305eadedaea4d97
CentOS Linux 7 64bit / Linux 3.10.0-1127.13.1.el7.x86_64
The text was updated successfully, but these errors were encountered: