Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pipeline does not complete variant calling when staring from the '--step prepare_recalibration' #763

Closed
senzhaocode opened this issue Sep 28, 2022 · 10 comments
Labels
bug Something isn't working

Comments

@senzhaocode
Copy link

Description of the bug

Hello Sarek developer,

I am using Sarek pipeline (v3.0.1) to do somatic variant calling, and I started an analysis from '--step prepare_recalibration' using bam file after mark duplicates (e.g. *.md.bam); - see the command line in attached figure.

Pipeline was running normally and no errors were reported when it done, but it did not finish all steps (including variant calling) and there were no recalibrated bam files generated in output (e.g. *.recal.bam) as well. It looks the pipeline was terminated automatically after creating *.recal.table; - see attached log file.

I could not figure it out, and could you help me a troubleshooting.

Screenshot 2022-09-28 at 11 21 35

Screenshot 2022-09-28 at 11 32 58

Best

Sen

Command used and terminal output

No response

Relevant files

nextflow.log

System information

Nextflow version - 21.11.0
Hardware - HPC
Executor - slurm jobscript
Container engine - Singularity
OS - CentOS Linux
Version of nf-core/sarek - v3.0.1

@senzhaocode senzhaocode added the bug Something isn't working label Sep 28, 2022
@FriederikeHanssen
Copy link
Contributor

can you provide the config + input file you used?

@FriederikeHanssen
Copy link
Contributor

also did the pipeline terminate? what was the exit message?

@senzhaocode
Copy link
Author

config file:
markduplicate.txt

input file:
markduplicates_no_table.csv
As the bam files (after MarkDuplicates) in the input sheet are private, I could not share with you now. Could you replace them with any testing data on your side. Thanks! I just would like to confirm that whether all steps in the pipeline will go through properly.

The terminate log message (it reported the pipeline completed successfully):

Screenshot 2022-09-28 at 13 14 49

@senzhaocode
Copy link
Author

BTW, in the output directory, no 'preprocessing' folder (which saves *.recal.bam) is available. I also screen the intermediate directory 'work' - there was *.recal.bam file of each genomic interval (recalibration processing should be work well), but they did not merge to a final *.recal.bam.

@FriederikeHanssen
Copy link
Contributor

Hi! Just to double check: The log you posted says in the command it started from --step recalibrate and that it is version 3.0 . Is this the proper log for the problem above?

@FriederikeHanssen
Copy link
Contributor

could finally reproduce this 🎉 one step closer to finding the issue

@varun171
Copy link

Hi Friederike,

I am having the same issue with sarek v3.1.2. Do you remember what was the issue and how was it resolved.

Thanks

Varun

nf-evObbXhThPAF9.log

@FriederikeHanssen
Copy link
Contributor

Yes we fixed it in the code :D I can see in the log file, that Mutect2 is not spelled correctly, can you please try with --tools mutect2 instead?

@varun171
Copy link

varun171 commented Mar 28, 2023 via email

@juliawiggeshoff
Copy link

juliawiggeshoff commented Jul 23, 2024

config file: markduplicate.txt

input file: markduplicates_no_table.csv As the bam files (after MarkDuplicates) in the input sheet are private, I could not share with you now. Could you replace them with any testing data on your side. Thanks! I just would like to confirm that whether all steps in the pipeline will go through properly.

The terminate log message (it reported the pipeline completed successfully):

Screenshot 2022-09-28 at 13 14 49

was the issue here that the index file for the bams are saved as .bam.bam.bai? because sarek returns the error that this file does not exist, rightfully so. I have to remove the duplicated .bam and then it works. Using release 3.2.2.

Is this known and has it been fixed in recent releases?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants