Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make list of all GenBank accessions for type "environmental sample" or "ENV" #11

Open
max-mapper opened this issue Sep 8, 2022 · 1 comment
Assignees

Comments

@max-mapper
Copy link

I think we need to download all gbenv* files from https://ftp.ncbi.nlm.nih.gov/genbank/gbenv1 e.g. gbenv1.seq.gz - gbenv72.seq.gz, then extract them and parse all the VERSION AB000684.1 values to get a list of accessions like

AB000684.1

Then we can use that list to filter out all the environmental samples from the NT fasta files the next time we run BWA

@max-mapper
Copy link
Author

I am assuming AB000684.1 shows up in one of the NT chunks somewhere, we might want to validate that

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants