Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generate both gene statistics & sequences from GeenuFF db #96

Open
alisandra opened this issue Aug 7, 2019 · 1 comment
Open

Generate both gene statistics & sequences from GeenuFF db #96

alisandra opened this issue Aug 7, 2019 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@alisandra
Copy link
Contributor

Be able to generate both length statistics (N50, total, longest, shortest, etc), and actual sequences for at least

  • mRNA
  • protein
  • pre-mRNA
  • introns
  • exons

and break the above down into all transcripts and longest transcripts (or proteins?) per super locus

@alisandra alisandra added the enhancement New feature or request label Aug 7, 2019
@alisandra alisandra self-assigned this Aug 7, 2019
@alisandra
Copy link
Contributor Author

Still to do--

ranges:

  • UTRs
  • protein
  • CDS w/ phase accounted for
  • intergenic
  • upstream & downstream?

general functionality:

  • filter masked
  • statistical summary of lengths
  • post processing, e.g. kmer counting within extracted sequence

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant