Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ValidateVariants gVCF mode error "covers a position previously traversed" #6023

Closed
bhanugandham opened this issue Jul 1, 2019 · 1 comment · Fixed by #6028
Closed

ValidateVariants gVCF mode error "covers a position previously traversed" #6023

bhanugandham opened this issue Jul 1, 2019 · 1 comment · Fixed by #6028

Comments

@bhanugandham
Copy link
Contributor

bhanugandham commented Jul 1, 2019

User report:

ValidateVariants causes the error:

java -Xms32G -Xmx32G -jar /data/biosoftware/GATK/gatk-4.1.2.0/gatk-package-4.1.2.0-local.jar ValidateVariants -R ~/reference/reference.fasta -V $i -gvcf

And it causes the following error for all my files:

***********************************************************************

A USER ERROR has occurred: In a GVCF all records must ordered. Record: [VC Unknown @ Super-Scaffold_2:1-4 Q. of type=SYMBOLIC alleles=[G*, <NON_REF>] attr={END=4} filters= covers a position previously traversed.

***********************************************************************

This doesn't cause the error:

java -Xms32G -Xmx32G -jar /data/biosoftware/GATK/gatk-4.1.2.0/gatk-package-4.1.2.0-local.jar ValidateVariants -R ~/reference/reference.fasta -V $i

This Issue was generated from your [forums]
[forums]: https://gatkforums.broadinstitute.org/gatk/discussion/comment/59104#Comment_59104

@bhanugandham
Copy link
Contributor Author

@ldgauthier and @jonn-smith

As discussed during the gatk office hours, this error traces back to ValidateVariants in GVCF mode being unable to handle variants with a lower start position than the previous contig.
Example:
Super-Scaffold_1 9238114 . T <NON_REF> . . END=9238123 GT:DP:GQ:MIN_DP:PL 0/0:12:0:11:0,0,0
Super-Scaffold_2 1 . G <NON_REF> . . END=4 GT:DP:GQ:MIN_DP:PL 0/0:31:93:31:0,93,1141

@droazen droazen added this to the Engine-Q2-2019 milestone Jul 1, 2019
michaelgatzen added a commit that referenced this issue Jul 3, 2019
- Extracted the order validation for GVCF files into a separate method and included
a check to reset the counter when a new contig is found. Contigs have to
occur in continuous blocks; validation for files in which contigs occur
alternatingly is not supported.
- Added a set of integration tests for GVCF files with two and three contigs.

Fixes #6023
michaelgatzen added a commit that referenced this issue Jul 3, 2019
- Extracted the order validation for GVCF files into a separate method and included
a check to reset the counter when a new contig is found. Contigs have to
occur in continuous blocks; validation for files in which contigs occur
alternatingly is not supported.
- Added a set of integration tests for GVCF files with two and three contigs.

Fixes #6023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants