The format is based on Keep a Changelog and this project adheres to Semantic Versioning.
Initial release of nf-core/bactmap, created with the nf-core template.
The pipeline is composed of the following steps:
- Index reference fasta file (
BWA index
) - Trim reads for quality and adapter sequence (Optional) (
fastp
) - Estimate genome size (
mash sketch
) - Downsample fastq files (Optional) (
Rasusa
) - Variant calling
- Read mapping (
BWA mem
) - Sort and index alignments (
SAMtools
) - Call and filter variants (
BCFtools
) - Convert filtered bcf to pseudogenome fasta (
vcf2pseudogenome.py
)
- Read mapping (
- Create alignment from pseudogenome by concatenating fasta files having first checked that the sample sequences are high quality(
calculate_fraction_of_non_GATC_bases.py
) - Remove recombination (Optional) (
Gubbins
) - Extract variant sites from alignment (
SNP-sites
) - Construct phylogenetic tree (Optional)