-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Example code for reading GATK4 GenomicsDBImport #12
Comments
The new query_variants_calls function is nice and does almost all I need. This is the code I came up with:
One issue I have is that Mutect2 generates a DP field in both FORMAT and INFO, one is the unfiltered one the filtered. Looks like the unfiltered is picked. That's fine, but AD is a vector with the filtered REF and ALT reads, but only the REF reads make it into the data.frame. That's a bug, right? And quick question, is there an easy way to cycle through the available arrays + columns, say in batches of 100,000 columns? My database is small, so reading by chromosome is fine, but wondering if there is a cleaner way. Best, Update iterate through all arrays:
|
@lima1, sorry I was not watching this repository. Just wanted to let you know this is work in progress. Please let me know if you have solved your issue. Otherwise I will get back with answers soon.
Are you open to using spark or mpi or any parallel processing paradigm? |
Thanks @nalinigans . The code works well enough for my purpose, but would be curious to hear your suggestions once the code matured! Thanks for all your work, it’s really nice. |
Hi Nalini,
I was playing around with this package. Thanks a lot providing these R bindings! Much appreciated.
Do you have by any chance an example for iterating through a GATK4 GenomicsDBImport database?
I'm trying to build something similar to CreateSomaticPanelOfNormals which tries to fit beta-binomial distributions to all heterozygous variants. So simply extracting the AD field for all variants and all samples.
I'll figure it out if you don't have anything in place, but I thought I ask as it seems like a fairly standard task.
Thanks a lot,
Markus
The text was updated successfully, but these errors were encountered: