-
Notifications
You must be signed in to change notification settings - Fork 128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Gff annotations #354
Gff annotations #354
Conversation
…ncluding specifying strand as +/- and going to 1-based locations.
…in translate. It might be more sensible to move this into export in case no translations are done.
…d by the accession number
Output from `augur translate` and `augur export v2` is GFF-like. `augur export v1` produces BED-like coordinates. See JSON schemas for details.
with a few minor changes, we can also make augur use annotations like this:
This already works for vcf, but it would be quite straight forward to also allow this for fasta alignments. The reference sequence could then be supplied as fasta or simply by name and we don't need to mess around with genbank files anymore. this gff/tsv is much easier to edit. |
I'd def like to keep supporting GenBank though - at least for me that's the fastest way to get annotations for something new. Genbank's GFF export doesn't seem to capture all features. But agree that if you're working on something long-term (or sufficiently in need of editing) this would be an easier format to edit and maintain. Definitely try out any new GFF parser on the TB GFF & others before dumping BioGFF for them - they can be much less tidy than your example! 😅 |
Moves feature annotation to a subset of GFF syntax. Closes #187, as future extensions can be added by extending the GFF features we support as needed. This results in changes to the output of
augur translate
andaugur export v2
(see below). The format of mutations is unchanged.annotations
start
end
strand
seqid
type
"+"
or"-"
"+"
or"-"
augur translate
output"+"
or"-"
augur export v1
output1
or-1
augur export v2
output"+"
or"-"
mutations
nt & aa are both 1-based