Conditional code paths for ReadLikelihoods method that always returns true #4865

davidbenjamin · 2018-06-08T15:54:18Z

ReadLikelihoods has the following method:

public boolean hasFilledLikelihoods() { return true; }

The annotations engine has several code paths that could only be taken if this were to return false. For example:

In DepthPerAlleleBySample

if (likelihoods.hasFilledLikelihoods()) {
    counts = annotateWithLikelihoods(vc, g, alleles, likelihoods);
 } else { 30 lines of code}

In RankSumTest

if (likelihoods.hasFilledLikelihoods()) {
    fillQualsFromLikelihood(vc, likelihoods, refQuals, altQuals, refLoc);
} else { 11 quintuply-nested lines of code}

There's also a bunch of code in all the StrandBiasTest implementations.

Can this method, and all the unused code paths, just be deleted? @droazen @ldgauthier

The text was updated successfully, but these errors were encountered:

droazen · 2018-06-08T15:58:01Z

@davidbenjamin I don't think it can be removed. That method (ReadLikelihoods.hasFilledLikelihoods()) is overridden in UnfilledReadsLikelihoods to return false. It was deliberately added by @jamesemery in his port of VariantAnnotator -- the else clauses you mention are effectively the "VariantAnnotator" code path.

@jamesemery Can you comment?

jamesemery · 2018-06-08T21:07:35Z

@droazen That is correct. It was a necessary step to avoid having to make a class equality check on the likelihoods object itself. @davidbenjamin I am open to suggestions if you have an idea of how better to encapsulate the separation between these two likelihood objects.

davidbenjamin · 2018-06-11T20:07:48Z

Thanks for indulging me on this. To me it seems like UnfilledReadsLikelihoods diverges too much from ReadsLikelihoods to extend it. In effect it's letting ReadsLikelihoods sometimes be a wrapper for something that is not a ReadsLikelihoods.

I haven't worked this out but I would hope that it's possible to construct a ReadsLikelihoods from a pileup. I mean, the idea of pileup calling is that you use just a single base for the likelihoods and not the whole read (via Pair-HMM), so we should be able to fill the likelihoods from the base qualities.

davidbenjamin · 2018-06-12T03:55:30Z

Another point is that the code is already implicitly assigning likelihoods to reads in order to determine which allele each read supports. A method to create a ReadsLikelihoods from a pileup would just make this explicit.

magicDGS · 2018-06-12T14:02:23Z

I have some code to do that in a test project - if you are interested on it, I can submit a PR with my proposal. It will be nice for an idea that I have in mind, and if it is used also in GATK it would have more support (as a single developer, my reviews are not as good as in a team). Just let me know if you wanna port the code to some utility class!

droazen · 2018-06-12T16:36:36Z

@jamesemery Can you weigh in with your thoughts? I seem to recall that there were a lot of nuances surrounding this issue (for example, consistency issues between VariantAnnotator and other annotation-producing tools).

jamesemery · 2018-06-12T22:03:15Z

@davidbenjamin That would work and is already more or less what we are doing for VariantAnnotator. The reason we went through trouble of making two classes in the first place was because the pileup was not entirely adequate without actually doing the assembly and genotyping as indels and a number of other effects cause problems that become expensive unless we are in a tool like haplotype caller.
Currently, UnfilledReadsLiklihoods represents the "likelihood" of each allele by doing just what you said, looking at only the base pileup. This means that VariantAnnoator is currently only able to add annotations to SNPs as a result, just like it did in gatk3 (indeed including some of the same UnifiedGenotyper bugs as well).

davidbenjamin · 2020-01-08T19:04:29Z

Closed by #6172.

davidbenjamin added question coding labels Jun 8, 2018

droazen assigned jamesemery Jun 8, 2018

davidbenjamin closed this as completed Jan 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conditional code paths for ReadLikelihoods method that always returns true #4865

Conditional code paths for ReadLikelihoods method that always returns true #4865

davidbenjamin commented Jun 8, 2018

droazen commented Jun 8, 2018 •

edited

Loading

jamesemery commented Jun 8, 2018

davidbenjamin commented Jun 11, 2018

davidbenjamin commented Jun 12, 2018

magicDGS commented Jun 12, 2018

droazen commented Jun 12, 2018

jamesemery commented Jun 12, 2018

davidbenjamin commented Jan 8, 2020

Conditional code paths for ReadLikelihoods method that always returns true #4865

Conditional code paths for ReadLikelihoods method that always returns true #4865

Comments

davidbenjamin commented Jun 8, 2018

droazen commented Jun 8, 2018 • edited Loading

jamesemery commented Jun 8, 2018

davidbenjamin commented Jun 11, 2018

davidbenjamin commented Jun 12, 2018

magicDGS commented Jun 12, 2018

droazen commented Jun 12, 2018

jamesemery commented Jun 12, 2018

davidbenjamin commented Jan 8, 2020

droazen commented Jun 8, 2018 •

edited

Loading