Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

blastn 2.15 support #105

Closed
muffato opened this issue Jul 15, 2024 · 5 comments
Closed

blastn 2.15 support #105

muffato opened this issue Jul 15, 2024 · 5 comments
Assignees
Labels
bug Something isn't working question Further information is requested

Comments

@muffato
Copy link
Member

muffato commented Jul 15, 2024

When switching to blastn 2.15 (tried both 2.15.0--pl5321h6f7f691_0 and _1), there is this warning on stderr:

The -taxids command line option requires additional data files. Please see the section 'Taxonomic filtering for BLAST databases' in https://www.ncbi.nlm.nih.gov/books/NBK569839/ for details.

which suggests we could be doing something wrong, but I can't find on https://www.ncbi.nlm.nih.gov/books/NBK569839/#_usrman_BLAST_feat_Taxonomic_filtering_fo_ any information on making the option work.

In the source code c++/src/algo/blast/blastinput/blast_args.cpp I can find that the warning is printed whenever an exception is raised

        try{ 
            tb.reset(new CTaxonomy4BlastSQLite());
        }    
        catch(CException &){
            LOG_POST(Warning << "The -taxids command line option requires additional data files. Please see the section 'Taxonomic filtering for BLAST databases' in https://www.ncbi.nlm.nih.gov/books/NBK569839/ for details.");
        }    

so it seems there really is something wrong, but I don't know what. It'd be great to get more information about that exception, what it means, where exactly it's raised, etc/.

I also tried downloading a recent NT databases and faced the same problem. Maybe we're missing some files ?

@muffato muffato converted this from a draft issue Jul 15, 2024
@muffato muffato added bug Something isn't working question Further information is requested labels Jul 15, 2024
@tkchafin
Copy link

tkchafin commented Oct 1, 2024

Just posting as I think this covers embedding the taxids in custom blastdbs, I can look in more detail when I get a chance later
https://www.ncbi.nlm.nih.gov/books/NBK569841/
https://www.ncbi.nlm.nih.gov/books/NBK569839/#usrman_BLAST_feat.Taxonomic_filtering_fo

@muffato
Copy link
Member Author

muffato commented Oct 2, 2024

We download the NR database from the NCBI and it should already contain the taxon IDs. The problem is that I had the impression that the error was only showing up with 2.15 (same database and same command-line otherwise). Let me find an example

@muffato
Copy link
Member Author

muffato commented Oct 3, 2024

Yes, here is an example

$ cd ~mm49/link/scratch123/blastn/214
$ singularity exec -B /lustre,/nfs /nfs/treeoflife-01/teams/shared/nextflow/cache/nxf_singularity/depot.galaxyproject.org-singularity-blast-2.14.1--pl5321h6f7f691_0.img /bin/bash $PWD/command.sh
Using ./current/nt

$ singularity exec -B /lustre,/nfs /nfs/treeoflife-01/teams/shared/nextflow/cache/nxf_singularity/quay.io-singularity-blast-2.15.0--pl5321h6f7f691_1.img /bin/bash $PWD/command.sh
Using ./current/nt
The -taxids command line option requires additional data files. Please see the section 'Taxonomic filtering for BLAST databases' in https://www.ncbi.nlm.nih.gov/books/NBK569839/ for details.

If I remember correctly, same problem with a fresh download of the NT database at ~mm49/link/scratch123/blastn/db

@tkchafin tkchafin self-assigned this Nov 20, 2024
@tkchafin
Copy link

tkchafin commented Nov 21, 2024

tracking progress in JIRA but adding the solution here for posterity

Apparently we need to stage the tax* files into the working dir. This seems to conflict with what the online docs say, but at least the solution on our end is simple. I'll patch the blastn module to copy these over

(base) tc25@farm22-head2:~/blast$ cp current/tax* .
(base) tc25@farm22-head2:~/blast$ singularity exec -B /lustre,/nfs https://depot.galaxyproject.org/singularity/blast:2.16.0--hc155240_2 /bin/bash $PWD/command.sh
Using ./current/nt

@muffato
Copy link
Member Author

muffato commented Nov 21, 2024

🙌🏼

@github-project-automation github-project-automation bot moved this from Todo to Done in Genome After Party Nov 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working question Further information is requested
Projects
Status: Done
Development

No branches or pull requests

2 participants