Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Other file formats not directly supported by htslib #68

Closed
charlesgregory opened this issue Apr 26, 2021 · 2 comments
Closed

Other file formats not directly supported by htslib #68

charlesgregory opened this issue Apr 26, 2021 · 2 comments
Labels
enhancement New feature or request question Further information is requested

Comments

@charlesgregory
Copy link
Contributor

I think we should support bed, gtf, gff3 and other bioinformatics formats (we already support fastq). We are already past the point of being bindings for htslib. This also aligns well with #55. Conversion from bed coordinates to sam or vcf coordinates should be supported directly. Potentially also supporting PAF?

@charlesgregory charlesgregory added enhancement New feature or request question Further information is requested labels Apr 26, 2021
@jblachly
Copy link
Member

Agree in principle

I have a repo GFF3D (with a sweet 3D logo 🚀 ) -- this is a good candiate for inclusion, although we need a GFF3 writer as well.

What do you envision for , say, a BED reader? it is pretty simple.

@charlesgregory
Copy link
Contributor Author

charlesgregory commented Apr 26, 2021

Pretty simple. We should conform with bed format, at least the 12 columns that UCSC and Ensemble agree on. Or we could build as we go kinda similar to the way gff3d does. Get fields if they exist. Other than that use a BGZFile for reading (gets us bgzf, gzip, and network support) and parse each line into a set of our coordinate system. Other fields are optional. Bedgraph could be another format to support. Though we could go crazy trying to support all the formats.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants