The format is based on Keep a Changelog and this project adheres to Semantic Versioning.
Initial release of nf-core/bactmap, created with the nf-core template.
The pipeline is composed of the following steps:
- Index reference fasta file (
BWA index) - Trim reads for quality and adapter sequence (Optional) (
fastp) - Estimate genome size (
mash sketch) - Downsample fastq files (Optional) (
Rasusa) - Variant calling
- Read mapping (
BWA mem) - Sort and index alignments (
SAMtools) - Call and filter variants (
BCFtools) - Convert filtered bcf to pseudogenome fasta (
vcf2pseudogenome.py)
- Read mapping (
- Create alignment from pseudogenome by concatenating fasta files having first checked that the sample sequences are high quality(
calculate_fraction_of_non_GATC_bases.py) - Remove recombination (Optional) (
Gubbins) - Extract variant sites from alignment (
SNP-sites) - Construct phylogenetic tree (Optional)