Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

provide pema main data product in a 7-level taxonomy format #52

Open
hariszaf opened this issue Apr 17, 2023 · 3 comments
Open

provide pema main data product in a 7-level taxonomy format #52

hariszaf opened this issue Apr 17, 2023 · 3 comments
Labels
enhancement New feature or request FAIR improvement LW mid priority priority for the LW developers

Comments

@hariszaf
Copy link
Owner

It would be super useful to return the pema main output (otu/asv table) in a 7-level taxonomy format, meaning all taxonomy assignments are as:

d__Bacteria; p__Abyssubacteria; c__SURF-5; o__SURF-5; f__SURF-5; g__SURF-5; s__SURF-5 sp003598085
@hariszaf hariszaf added the enhancement New feature or request label Apr 17, 2023
@hariszaf
Copy link
Owner Author

hariszaf commented Jun 13, 2023

Regarding COI, this is now covered under #56 --> the outputs are already in the required 7-levels.

Regarding 16S, we still wait for the Silva update. However, we have been waiting for a while and are getting a bit fed-up with waiting, hence it would be useful to do this ourselves. For advice on how to do this (and if it is feasible), consult with @hariszaf and @cpavloud

@hariszaf
Copy link
Owner Author

Regarding the ITS gene and the Unite database: one thing you could do is to get the General FASTA release (download) file and from there get the sequences id.

For example:
>Glomeraceae|AM076560|SH146432.05FU|refs|k__Fungi;p__Glomeromycota;c__Glomeromycetes;o__Glomerales;f__Glomeraceae;g__;s__uncultured_Glomus

The AM076560 is the sequence id.

Using that, you can get from the NCBI the organism it comes from
https://www.ncbi.nlm.nih.gov/nuccore/AM076560
and therefore, its NCBI taxonomy id.

@kmexter
Copy link
Collaborator

kmexter commented Jun 14, 2023

may be some interplay with #29 here

@kmexter kmexter added update update a resource used in pema to a more recent version LW mid priority priority for the LW developers and removed update update a resource used in pema to a more recent version labels Jun 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request FAIR improvement LW mid priority priority for the LW developers
Projects
None yet
Development

No branches or pull requests

2 participants