You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: config/README.md
+2-31Lines changed: 2 additions & 31 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,6 +9,7 @@ A Snakemake workflow for the post-processing of microbial genome assemblies.
9
9
3.[bakta](https://github.com/oschwengers/bakta), a fast, alignment-free annotation tool. Note: Bakta will automatically download its companion database from zenodo (light: 1.5 GB, full: 40 GB)
10
10
3. Create a QC report for the assemblies using [Quast](https://github.com/ablab/quast)
11
11
4. Create a pangenome analysis (orthologs/homologs) using [Panaroo](https://gthlab.au/panaroo/)
12
+
5. Compute pairwise average nucleotide identity (ANI) between the assemblies using [FastANI](https://github.com/ParBLiSS/FastANI) and plot a phylogenetic tree based on the ANI distances.
12
13
13
14
## Running the workflow
14
15
@@ -22,34 +23,4 @@ The samplesheet table has the following layout:
**Note:** Pangenome analysis with `Panaroo` requires at least two samples.
26
-
27
-
### Parameters
28
-
29
-
This table lists all parameters that can be used to run the workflow.
30
-
31
-
| Parameter | Type | Details | Default |
32
-
|:---|:---|:---|:---|
33
-
|**samplesheet**| string | Path to the sample sheet file in csv format ||
34
-
|**tool**| array[string]| Annotation tool to use (one of `prokka`, `pgap`, `bakta`) ||
35
-
|**pgap**|| PGAP configuration object ||
36
-
| bin | string | Path to the PGAP script ||
37
-
| use_yaml_config | boolean | Whether to use YAML configuration for PGAP |`False`|
38
-
|_prepare_yaml_files_|| Paths to YAML templates for PGAP ||
39
-
| generic | string | Path to the generic YAML configuration file ||
40
-
| submol | string | Path to the submol YAML configuration file ||
41
-
|**prokka**|| Prokka configuration object ||
42
-
| center | string | Center name for Prokka annotation (used in sequence IDs) ||
43
-
| extra | string | Extra command-line arguments for Prokka |`--addgenes`|
44
-
|**bakta**|| Bakta configuration object ||
45
-
| download_db | string | Bakta database type (`full`, `light`, or `none`) |`light`|
46
-
| existing_db | string | Path to an existing Bakta database (optional). Needs to be combined with `download_db='none'`|`--keep-contig-headers --compliant`|
47
-
| extra | string | Extra command-line arguments for Bakta ||
48
-
|**quast**|| QUAST configuration object ||
49
-
| reference_fasta | string | Path to the reference genome for QUAST ||
50
-
| reference_gff | string | Path to the reference annotation for QUAST |
51
-
| extra | string | Extra command-line arguments for QUAST ||
52
-
|**panaroo**|| Panaroo configuration object ||
53
-
| remove_source | string | Source types to remove in Panaroo (regex supported) |`cmsearch`|
54
-
| remove_feature | string | Feature types to remove in Panaroo (regex supported) |`tRNA\|rRNA\|ncRNA\|exon\|sequence_feature`|
55
-
| extra | string | Extra command-line arguments for Panaroo |`--clean-mode strict --remove-invalid-genes`|
26
+
**Note:** Pangenome analysis with `Panaroo` and pairwise similarity analysis with `FastANI` requires at least two samples.
0 commit comments