Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What does the 'cds-nr98-core.fa' mean #142

Open
lychen83 opened this issue Jan 29, 2020 · 1 comment
Open

What does the 'cds-nr98-core.fa' mean #142

lychen83 opened this issue Jan 29, 2020 · 1 comment
Labels

Comments

@lychen83
Copy link

Dear all,

In the default config file ptx.cfg. There is one line --ref-cluster %bin%/../data/cds-nr98-core.fa
Is it mean using the cds-nr98-core.fa as the reference for reads extracting or cp assembly?
It looks like the file cds-nr98-cor.fa includes some cp gene sequences.

I have extracted cp reads from total RNA-seq reads. I just want to assembly the cp genes, and use them for phylogenetic analysis. Can chloroExtractor generate oriented contigs with these extracted cp reads?

I appreciate any help.

Best regards,
Lingyun

@iimog
Copy link
Member

iimog commented Jan 31, 2020

Hi Lingyun,

the file cds-nr98-core.fa contains indeed 24 chloroplast genes from various species: accD,atpA,atpB,atpI,cemA,matK,ndhB,ndhK,petA,petB,psaA,psaB,psbA,psbB,psbC,psbD,rbcL,rpl2,rpoA,rpoB,rpoC1,rps12,rps2,rps4.
These are used in the scale_reads step to stop screening reads when a certain amount of chloroplast reads are found. They are also used to detect chloroplast contigs in the final assembly (if it is not a single circular chloroplast anyway). They are not used as references in the sense of a reference guided assembly.
For your specific task: it might be possible to use chloroExtractor to get contigs for the separate genes but this is clearly outside the scope (scope is de-novo assembly of the chloroplast genome from genomic reads) and I would not expect good results.
Feel free to try. Additionally, you can use any part of chloroExtractor that you think might be useful including the cds-nr98-core.fa.
Maybe @greatfireball has additional ideas?

Best regards,
Markus

@iimog iimog added the question label Jan 31, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants