Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

More excessively micro micro-optimizations #5616

Merged
merged 4 commits into from
Feb 8, 2019
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -193,18 +193,20 @@ public List<VariantContext> calculateRefConfidence(final Haplotype refHaplotype,
final String sampleName = readLikelihoods.getSample(0);

final int globalRefOffset = refSpan.getStart() - activeRegion.getExtendedSpan().getStart();
for ( final ReadPileup pileup : refPileups ) {
// Note, we use an indexed for-loop here because this method has a large impact on the profile of HaplotypeCaller runtime in GVCF mode
final int refPileupsSize = refPileups.size();
for (int i = 0; i < refPileupsSize; i++) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add comment here that the old-school for loop is intentional (for-each was found to show up on the profiler).

final ReadPileup pileup = refPileups.get(i);
final Locatable curPos = pileup.getLocation();
final int offset = curPos.getStart() - refSpan.getStart();

final VariantContext overlappingSite = GATKVariantContextUtils.getOverlappingVariantContext(curPos, variantCalls);
final List<VariantContext> currentPriors = getMatchingPriors(curPos, overlappingSite, VCpriors);
if ( overlappingSite != null && overlappingSite.getStart() == curPos.getStart() ) {
final List<VariantContext> currentPriors = VCpriors.isEmpty() ? Collections.emptyList() : getMatchingPriors(curPos, overlappingSite, VCpriors);
if (overlappingSite != null && overlappingSite.getStart() == curPos.getStart()) {
if (applyPriors) {
results.add(PosteriorProbabilitiesUtils.calculatePosteriorProbs(overlappingSite, currentPriors,
numRefSamplesForPrior, options));
}
else {
} else {
results.add(overlappingSite);
}
} else {
Expand Down Expand Up @@ -420,9 +422,17 @@ protected static boolean isAltAfterAssembly(final PileupElement element, final b
* @param priorList priors within the current ActiveRegion
* @return prior VCs representing the same variant position as call
*/
List<VariantContext> getMatchingPriors(final Locatable curPos, final VariantContext call, final List<VariantContext> priorList) {
private List<VariantContext> getMatchingPriors(final Locatable curPos, final VariantContext call, final List<VariantContext> priorList) {
final int position = call != null ? call.getStart() : curPos.getStart();
return priorList.stream().filter(vc -> position == vc.getStart()).collect(Collectors.toList());
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add a comment that this method is implemented with an old-school for loop instead of streaming for performance reasons (otherwise, someone will probably come along in 2 years and undo this).

final List<VariantContext> matchedPriors = new ArrayList<>(priorList.size());
// NOTE: a for loop is used here because this method ends up being called per-pileup, per-read and using a loop instead of streaming saves runtime
final int priorsListSize = priorList.size();
for (int i = 0; i < priorsListSize; i++) {
if (position == priorList.get(i).getStart()) {
matchedPriors.add(priorList.get(i));
}
}
return matchedPriors;
}

/**
Expand Down