Adaptive pruning option for local assembly #5473

davidbenjamin · 2018-12-02T07:43:29Z

Closes #4867.

@takutosato Here it is. I'm not quite ready to make it the M2 default, but it looks really good.

@meganshand I have tested it on every mixture in your workspace and results look very similar to the previous hand-tuned pruning results. I'm hoping it's good enough to become best practices for mitochondria and would appreciate if you gave it a shot. You have the right to review if you wish but there's no pressure to do so.

@ldgauthier HaplotypeCaller might also benefit from this. In particular, I wonder about #3697. I'll test it out.

codecov-io · 2018-12-02T08:26:54Z

Codecov Report

Merging #5473 into master will increase coverage by 0.088%.
The diff coverage is 94.631%.

@@              Coverage Diff               @@
##              master    #5473       +/-   ##
==============================================
+ Coverage     86.982%   87.07%   +0.088%     
- Complexity     31186    31244       +58     
==============================================
  Files           1914     1922        +8     
  Lines         144117   144210       +93     
  Branches       15933    15916       -17     
==============================================
+ Hits          125356   125564      +208     
+ Misses         13006    12872      -134     
- Partials        5755     5774       +19

Impacted Files	Coverage Δ	Complexity Δ
...ools/walkers/haplotypecaller/graphs/BaseGraph.java	`82.49% <ø> (-0.135%)`	`93 <0> (-1)`
...lotypecaller/readthreading/ReadThreadingGraph.java	`88.608% <ø> (ø)`	`144 <0> (ø)`	⬇️
...walkers/haplotypecaller/HaplotypeCallerEngine.java	`78.125% <0%> (ø)`	`74 <0> (ø)`	⬇️
...der/tools/walkers/mutect/M2ArgumentCollection.java	`88% <100%> (+0.5%)`	`9 <1> (+1)`	⬆️
...kers/haplotypecaller/AssemblyBasedCallerUtils.java	`77.869% <100%> (+1.125%)`	`35 <0> (ø)`	⬇️
.../readthreading/ReadThreadingAssemblerUnitTest.java	`98.712% <100%> (-0.005%)`	`38 <0> (ø)`
...der/tools/walkers/haplotypecaller/graphs/Path.java	`95.161% <100%> (+0.424%)`	`26 <2> (+2)`	⬆️
...ecaller/AssemblyBasedCallerArgumentCollection.java	`100% <100%> (ø)`	`3 <1> (+2)`	⬆️
...hellbender/tools/walkers/mutect/Mutect2Engine.java	`90.116% <100%> (+0.235%)`	`65 <2> (+2)`	⬆️
...pecaller/readthreading/ReadThreadingAssembler.java	`68.539% <100%> (+0.041%)`	`51 <2> (-1)`	⬇️
... and 39 more

meganshand · 2018-12-02T13:38:24Z

@davidbenjamin Thank you for doing this! Can you share the results you got on the mixtures? I'd be happy to try out this branch on our technical replicates next week.

ldgauthier · 2018-12-03T14:34:57Z

@davidbenjamin Fantastic! Have you talked to Sarah yet? Should I pass along a jar?

davidbenjamin · 2018-12-03T15:10:52Z

@ldgauthier I gave her one about ten days ago. It looks fine so far on her RNA data.

davidbenjamin · 2018-12-03T15:15:06Z

@meganshand I ran the "Full Pipeline" workflows in a clone of your FC workspace: https://portal.firecloud.org/#workspaces/broad-firecloud-dsde/copy-of-megans-m2-mito-validations. I did not run any of the things that generate graphs because they were harder for me to understand. To compare the new results to your previous ones, I took all variants that were either PASS or had only the contamination filter applied, extracted just the locus and alleles columns, then manually inspected the diff. For the 5% and 50% spike-ins there were usually no differences at all, while for the 1% spike-in the difference was usually 2-5 variants that straddled the LOD threshold.

ldgauthier · 2018-12-03T15:31:07Z

Great, thanks!

…

On Mon, Dec 3, 2018 at 10:15 AM David Benjamin ***@***.***> wrote: @meganshand <https://github.com/meganshand> I ran the "Full Pipeline" workflows in a clone of your FC workspace: https://portal.firecloud.org/#workspaces/broad-firecloud-dsde/copy-of-megans-m2-mito-validations. I did not run any of the things that generate graphs because they were harder for me to understand. To compare the new results to your previous ones, I took all variants that were either PASS or had only the contamination filter applied, extracted just the locus and alleles columns, then manually inspected the diff. For the 5% and 50% spike-ins there were usually no differences at all, while for the 1% spike-in the difference was usually 2-5 variants that straddled the LOD threshold. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#5473 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGRhdMwCcuQyzMweZjxWrXBODTCBaOSIks5u1T_-gaJpZM4Y9STI> .

-- Laura Doyle Gauthier, Ph.D. Associate Director, Germline Methods Data Sciences Platform [email protected] Broad Institute of MIT & Harvard 320 Charles St. Cambridge MA 0214

davidbenjamin · 2018-12-05T05:06:58Z

@takutosato Based on all of our validations I added a commit to make this the default for M2. Because M2 shares a nested argument collection with HaplotypeCaller, this was pretty awkward. Louis told me this was the best among bad options.

takutosato

This is a very cool method. Just a couple very minor comments. Looks good otherwise.

takutosato · 2018-12-04T21:15:09Z

.../org/broadinstitute/hellbender/tools/walkers/haplotypecaller/graphs/AdaptiveChainPruner.java

+
+import java.util.*;
+import java.util.stream.Collectors;
+import java.util.stream.IntStream;


some of these imports are not used

takutosato · 2018-12-04T21:15:35Z

.../org/broadinstitute/hellbender/tools/walkers/haplotypecaller/graphs/AdaptiveChainPruner.java

+        return FastMath.max(leftLogOdds, rightLogOdds);
+    }
+
+    // is the chain


either remove or add a doc

takutosato · 2018-12-05T19:17:42Z

...adinstitute/hellbender/tools/walkers/haplotypecaller/graphs/AdaptiveChainPrunerUnitTest.java

+
+import static org.testng.Assert.*;
+
+public class AdaptiveChainPrunerUnitTest {


Either remove or add tests here

deleted -- the tests are all in ChainPRunerUnitTest.

davidbenjamin · 2018-12-05T22:10:41Z

back to @takutosato

davidbenjamin · 2018-12-05T23:21:39Z

Oh wait, approving review, got it.

davidbenjamin added community-request Mutect labels Dec 2, 2018

davidbenjamin assigned takutosato Dec 2, 2018

davidbenjamin requested a review from takutosato December 2, 2018 07:43

refactored LowWeightChainPruner for extensibility

5e41bc4

davidbenjamin force-pushed the db_pruning branch from a11aa05 to bb66388 Compare December 3, 2018 15:57

davidbenjamin added 2 commits December 3, 2018 21:42

docs

d1a6ebc

Adaptive pruning

25da765

davidbenjamin force-pushed the db_pruning branch from bb66388 to 56849e1 Compare December 5, 2018 05:03

Command line fun

ce8051c

davidbenjamin force-pushed the db_pruning branch from 56849e1 to ce8051c Compare December 5, 2018 05:04

serial version Id to fix compile warning

21c292f

takutosato approved these changes Dec 5, 2018

View reviewed changes

code review

2a9ad44

davidbenjamin merged commit 079d34a into master Dec 5, 2018

davidbenjamin deleted the db_pruning branch December 5, 2018 23:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adaptive pruning option for local assembly #5473

Adaptive pruning option for local assembly #5473

davidbenjamin commented Dec 2, 2018

codecov-io commented Dec 2, 2018 •

edited

Loading

meganshand commented Dec 2, 2018

ldgauthier commented Dec 3, 2018

davidbenjamin commented Dec 3, 2018

davidbenjamin commented Dec 3, 2018

ldgauthier commented Dec 3, 2018 via email

davidbenjamin commented Dec 5, 2018

takutosato left a comment

takutosato Dec 4, 2018

davidbenjamin Dec 5, 2018

takutosato Dec 4, 2018

davidbenjamin Dec 5, 2018

takutosato Dec 5, 2018

davidbenjamin Dec 5, 2018

davidbenjamin commented Dec 5, 2018

davidbenjamin commented Dec 5, 2018


		import static org.testng.Assert.*;

		public class AdaptiveChainPrunerUnitTest {

Adaptive pruning option for local assembly #5473

Adaptive pruning option for local assembly #5473

Conversation

davidbenjamin commented Dec 2, 2018

codecov-io commented Dec 2, 2018 • edited Loading

Codecov Report

meganshand commented Dec 2, 2018

ldgauthier commented Dec 3, 2018

davidbenjamin commented Dec 3, 2018

davidbenjamin commented Dec 3, 2018

ldgauthier commented Dec 3, 2018 via email

davidbenjamin commented Dec 5, 2018

takutosato left a comment

Choose a reason for hiding this comment

takutosato Dec 4, 2018

Choose a reason for hiding this comment

davidbenjamin Dec 5, 2018

Choose a reason for hiding this comment

takutosato Dec 4, 2018

Choose a reason for hiding this comment

davidbenjamin Dec 5, 2018

Choose a reason for hiding this comment

takutosato Dec 5, 2018

Choose a reason for hiding this comment

davidbenjamin Dec 5, 2018

Choose a reason for hiding this comment

davidbenjamin commented Dec 5, 2018

davidbenjamin commented Dec 5, 2018

codecov-io commented Dec 2, 2018 •

edited

Loading