-
Notifications
You must be signed in to change notification settings - Fork 353
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mutect2 error: java.lang.IllegalArgumentException: log10 p: Values must be non-infinite and non-NAN #2829
Comments
Hello! Sorry, maybe a little on the off-topic side, however, I'm curious:
Sergey |
Hi Sergey, I'm not sure about the polymorphism level in the yeast population. The strain we're using is diploid and we found evidence for high allelism in the two samples that we're trying to analyze. I already ran HC on them and, in addition, I wanted to try out Mutect2 to see if there are any low-frequency variants present in one sample and not the other. Alex |
Hi Alex! Thanks for reporting the issue and expanding the usage of bcbio into yeast genomics! Unfortunately, it is not easy to debug such a unique case: single end reads on a custom yeast reference coupled with somatic(cancer) variant calling with mutect2. (Wow!) Hopefully, somebody will reproduce it in a human context and find the source of the issue. Some thoughts for now - have you tried those methods to increase sensititvity?
However, I hope you remember that gain in sensitivity does not come without loss in specificity (i.e. you will be getting more false positive trash calls). Buy the way, have you done any measurements of variant calling precision and sensitivity in yeast with bcbio? (See the examples here, https://github.com/bcbio/bcbio_validations). If you have done it, we might publish the results among the other validations, underlining this unique use case of bcbio. Sergey |
Thanks much for this report and apologies about the issue. This looks related to #2832 and we were able to reproduce when the MuTect2 associated |
Thanks a lot for the update. The MuTect2 issue went away after upgrading to the latest development version. |
Hi Brad. I updated to the latest version of bcbio and am still getting this error. I am doing tumor only calling using mutect2 on paired-end data. Version of bcbio is 1.1.6a and version of GATK is v4.1.2.0 Here's the variants from the problematic vcf - #CHROM POS ID REF ALT QUAL FILTER INFO FORMAT TUMOR_SAMPLE Here's the output from the stats file for the same - statistic value Let me know if you need any further information to help debug this. Thanks, Moiz |
Hi Moiz, Could you post up the command that is failing along with the broken VCF file so we can have a mini test set while we try to fix this? |
Hi Rory, Here's the command that I see in the bcbio logs - java -Dsamjdk.use_async_io_read_samtools=false -Dsamjdk.use_async_io_write_samtools=true -Dsamjdk.use_async_io_write_tribble=false -Dsamjdk.compression_level=2 -Xms681m -Xmx3181m -XX:+UseSerialGC -Djava.io.tmpdir=bcbio_20180410/work_novoalign/bcbiotx/tmp3gkzrbvm -jar bcbio_1.1.5/anaconda/share/gatk4-4.1.2.0-1/gatk-package-4.1.2.0-local.jar FilterMutectCalls --reference bcbio_1.1.5/genomes/Hsapiens/hs37d5/seq/hs37d5.fa --variant Tumor-raw.vcf.gz --output Tumor-raw-filt.vcf.gz Let me know if you need anything else. Thanks, Moiz |
Thanks, could you post up the VCF file that you sampled from above, but attach the actual gzipped file? That way we can use that to test against. |
Hi Rory, That's all the variants in the vcf. I can attach the actual vcf if you need it. |
Yup! The actual VCF would be helpful, just so we'll have the header and it wil be formatted right and all of that. |
Hi Rory, I have attached the bgzipped vcf file. Let me know if you need anything else. Thanks, Moiz |
Sorry, could you post the error that you are seeing as well? The bcbio-nextgen-debug.log file would be helpful to look at. |
Hi Rory, The sample identifiers in the debug log file have PHI in them that I cannot share. It'll take me a lot of time to de-identify and validate it. I am attaching the stack trace from where the error occurred. Let me know if this is enough. Traceback (most recent call last): |
Thanks, are there errors further up in the debug log? This is the generic
“this command failed” error, the root cause should be further up.
…On Tue, Jun 11, 2019 at 5:15 PM Moiz Bootwalla ***@***.***> wrote:
Hi Rory,
The sample identifiers in the debug log file have PHI in them that I
cannot share. It'll take me a lot of time to de-identify and validate it. I
am attaching the stack trace from where the error occurred. Let me know if
this is enough.
Traceback (most recent call last):
File "bcbio_nextgen/1.1.5/bin/bcbio_nextgen.py", line 238, in
main(**kwargs)
File "bcbio_nextgen/1.1.5/bin/bcbio_nextgen.py", line 46, in main
run_main(**kwargs)
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/bcbio/pipeline/main.py",
line 53, in run_main
fc_dir, run_info_yaml)
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/bcbio/pipeline/main.py",
line 89, in _run_toplevel
for xs in pipeline(config, run_info_yaml, parallel, dirs, samples):
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/bcbio/pipeline/main.py",
line 152, in variant2pipeline
samples = genotype.parallel_variantcall_region(samples, run_parallel)
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/bcbio/variation/genotype.py",
line 208, in parallel_variantcall_region
"vrn_file", ["region", "sam_ref", "config"]))
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/bcbio/distributed/split.py",
line 35, in grouped_parallel_split_combine
final_output = parallel_fn(parallel_name, split_args)
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/bcbio/distributed/ipython.py",
line 137, in run
for data in view.map_sync(fn, items, track=False):
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/ipyparallel/client/view.py",
line 344, in map_sync
return self.map(f,*sequences,**kwargs)
File
"<bcbio_1.1.5/anaconda/lib/python3.6/site-packages/decorator.py:decorator-gen-140>",
line 2, in map
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/ipyparallel/client/view.py",
line 52, in sync_results
ret = f(self, *args, **kwargs)
File
"<bcbio_1.1.5/anaconda/lib/python3.6/site-packages/decorator.py:decorator-gen-139>",
line 2, in map
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/ipyparallel/client/view.py",
line 37, in save_ids
ret = f(self, *args, **kwargs)
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/ipyparallel/client/view.py",
line 1114, in map
return pf.map(*sequences)
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/ipyparallel/client/remotefunction.py",
line 299, in map
return self(*sequences, __ipp_mapping=True)
File
"<bcbio_1.1.5/anaconda/lib/python3.6/site-packages/decorator.py:decorator-gen-122>",
line 2, in *call*
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/ipyparallel/client/remotefunction.py",
line 80, in sync_view_results
return f(self, *args, **kwargs)
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/ipyparallel/client/remotefunction.py",
line 285, in *call*
return r.get()
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/ipyparallel/client/asyncresult.py",
line 169, in get
raise self.exception()
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/ipyparallel/client/asyncresult.py",
line 228, in _resolve_result
results = error.collect_exceptions(results, self._fname)
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/ipyparallel/error.py",
line 233, in collect_exceptions
raise e
File
"bcbio_1.1.5/anaconda/lib/python3.6/site-packages/ipyparallel/error.py",
line 231, in collect_exceptions
raise CompositeError(msg, elist)
ipyparallel.error.CompositeError: one or more exceptions from call to
method: variantcall_sample
[20:apply]: CalledProcessError: Command 'set -o pipefail; unset JAVA_HOME
&& export PATH=bcbio_1.1.5/anaconda/bin:"$PATH" && gatk --java-options
'-Xms681m -Xmx3181m -XX:+UseSerialGC
-Djava.io.tmpdir=bcbio_20180410/work_novoalign/bcbiotx/tmp3gkzrbvm' Mutect2
--annotation ClippingRankSumTest --annotation DepthPerSampleHC --reference
bcbio_1.1.5/genomes/Hsapiens/hs37d5/seq/hs37d5.fa --annotation
MappingQualityRankSumTest --annotation MappingQualityZero --annotation
QualByDepth --annotation ReadPosRankSumTest --annotation RMSMappingQuality
--annotation FisherStrand --annotation MappingQuality --annotation
DepthPerAlleleBySample --annotation Coverage --read-validation-stringency
LENIENT -I bcbio_20180410/work_novoalign/align/Tumor_Sample/Tumor-sort.bam
--tumor-sample TUMOR_SAMPLE -L
bcbio_20180410/work_novoalign/mutect2/13/Tumor-regions.bed
--interval-set-rule INTERSECTION -O
bcbio_20180410/work_novoalign/mutect2/13/Tumor-raw.vcf.gz && sed -i
's/callable\t0.0/callable\t1.0/'
bcbio_20180410/work_novoalign/mutect2/13/Tumor-raw.vcf.gz.
stats && unset JAVA_HOME && export PATH=bcbio_1.1.5/anaconda/bin:"$PATH"
&& gatk --java-options '-Xms681m -Xmx3181m -XX:+UseSerialGC
-Djava.io.tmpdir=bcbio_20180410/work_novoalign/bcbiotx/tmp3gkzrbvm'
FilterMutectCalls --reference
bcbio_1.1.5/genomes/Hsapiens/hs37d5/seq/hs37d5.fa --variant
bcbio_20180410/work_novoalign/mutect2/13/Tumor-raw.vcf.gz --output
bcbio_20180410/work_novoalign/bcbiotx/tmp3gkzrbvm/Tumor-raw-filt.vcf.gz
—
You are receiving this because you modified the open/close state.
Reply to this email directly, view it on GitHub
<#2829?email_source=notifications&email_token=AADFG6X7K3UOU64ETECICZTP2AIX5A5CNFSM4HOTY5K2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODXOQZBQ#issuecomment-501025926>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AADFG6UQ2ZBJPXZV6JY4RD3P2AIX5ANCNFSM4HOTY5KQ>
.
|
Hi Rory, Is this what you were looking for - [June 5, 2019 2:50:08 PM PDT] org.broadinstitute.hellbender.tools.walkers.mutect.filtering.FilterMutectCalls done. Elapsed time: 0.09 minutes. |
@roryk any updates on this issue? Kindly let me know if you need any more details. |
Moiz; |
Hi Brad, I tested this and confirm that it works fine now. I will let you know if this issue crops up again. Thank you so much for fixing this! |
Thank you for following up Moiz! |
Previous default of 4 did not work for all cases, so continue this hack-y workaround. Fixes #2829
Moiz; |
Hello,
I'm running into an error with mutect2 (
java.lang.IllegalArgumentException: log10 p: Values must be non-infinite and non-NAN
) when using a pair of FASTQ files with yeast sequence reads. Other callers (vardict, freebayes, strelka2) seem to work fine with these files. Haplotypecaller also works fine in germline VC mode.I have GATK 4.1.1.0. I'm guessing these GATK issues is related: broadinstitute/gatk#5821 and broadinstitute/gatk#5880, but I'm not sure how I can get mutect2 to work with these files. Any idea what could be causing this?
Note: I'm using single-end reads (this came up in a separate issue: broadinstitute/gatk#5553 (comment)) and I've set up two batches, one treating a FASTQ file as normal and the other as tumor, and the second one with the samples reversed.
This is the stack trace:
And this is the YAML template that I use:
The text was updated successfully, but these errors were encountered: