-
Notifications
You must be signed in to change notification settings - Fork 594
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CalculateGenotypePostiors minor updates to javadoc and logger type #5601
Conversation
- Clarify tool documentation: - Remove statistical notes and provide link to GATK Article#11074 for background and math - Consolidate Notes and Caveats sections - Clarify at top the three different sources of priors and tool behavior regarding these - Clarify for family priors the tool only considers trio groups - Add Laura's comment that recent updates allow the tool to appropriately apply priors to indels - Change logger.info to logger.warn for situation where trio pedigree file is incomplete - Note that in this situation, in the absence of other refinement, the results are identical to the input
Thank you at @ldgauthier for volunteering to review and for all of the feedback. The Travis checks are still in progress but the only code change is with the logger type. |
Codecov Report
@@ Coverage Diff @@
## master #5601 +/- ##
===============================================
- Coverage 87.047% 87.021% -0.026%
- Complexity 31525 31624 +99
===============================================
Files 1928 1934 +6
Lines 145343 145816 +473
Branches 16090 16107 +17
===============================================
+ Hits 126517 126891 +374
- Misses 12966 13048 +82
- Partials 5860 5877 +17
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some clarifications.
...va/org/broadinstitute/hellbender/tools/walkers/variantutils/CalculateGenotypePosteriors.java
Show resolved
Hide resolved
* independently. This assumption of independent draws follows from the assumption of Hardy-Weinberg equilibrium (HWE). | ||
* Thus, HWE is imposed on the likelihoods as a result of CalculateGenotypePosteriors.</p> | ||
* <p> | ||
* The tool uses priors from three different data sources: (i) one or more supporting germline population callsets |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we say something more like "The tool can use priors from three different data sources"? Most cases won't use all three.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What are the annotations, specifically?
The tool will use MLEAC if available or AC if MLEAC is not provided. AN is also required unless genotypes are provided for all samples.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ldgauthier, the tool document already covers the MLEAC/AC annotations in the bullet point on L51.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changing 'The tool uses priors from' to 'The tool can use priors from'. Will add in the specific annotations exactly as you word it in a section below as it is much clearer than the current bullet point.
...va/org/broadinstitute/hellbender/tools/walkers/variantutils/CalculateGenotypePosteriors.java
Show resolved
Hide resolved
* | ||
* <h3>Inputs</h3> | ||
* <p> | ||
* <ul> | ||
* <li>A VCF with genotype likelihoods, and optionally genotypes, AC/AN fields, or MLEAC/AN fields.</li> | ||
* <li>(Optional) A PED pedigree file containing the description of the relationships between individuals.</li> | ||
* <li>(Optional) A PED pedigree file containing the description of the relationships between individuals. Only | ||
* trio groups are considered in the calculations.</li> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should we be absolutely specific that we mean mother-father-offspring trios?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What is meant by trios is I think clear. However, I see now we do not mention this anywhere and so I will add this.
@@ -80,6 +86,17 @@ | |||
* For any non-SNP sites in the input callset, flat priors are applied. | |||
* </p> | |||
* | |||
* <p> | |||
* For the latest versions of the tool in which this message appears, the tool appropriately applies priors to indels. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
as of 4.0.5.0
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added. Thank you.
Back to you @sooheelee |
- Add tool order of ingestion of annotations MLEAC vs AC and detail on AN requirement - Specifically define that trios are mother-father-offspring - Add version that appropriately applies priors to indels (4.0.5.0) - Smooth writing
I've incorporated your feedback @ldgauthier. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍 Sorry I missed this in the flurry of github emails yesterday.
No worries Laura. Thanks again for the feedback. |
joint posteriors (JL)
tojoint posteriors (JP)