Skip to content

Commit 20a6fc9

Browse files
committed
fix: schema and README update
1 parent fdf4adc commit 20a6fc9

3 files changed

Lines changed: 28 additions & 33 deletions

File tree

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
# snakemake-assembly-postprocessing
22

33
[![Snakemake](https://img.shields.io/badge/snakemake-≥8.24.1-brightgreen.svg)](https://snakemake.github.io)
4-
[![GitHub actions status](https://github.com/MPUSP/snakemake-assembly-postprocessing/actions/workflows/main.yml/badge.svg)](https://github.com/MPUSP/snakemake-assembly-postprocessing/actions/workflows/main.yml)
4+
[![GitHub Actions](https://github.com/MPUSP/snakemake-assembly-postprocessing/actions/workflows/snakemake-tests.yml/badge.svg)](https://github.com/MPUSP/snakemake-assembly-postprocessing/actions/workflows/snakemake-tests.yml)
55
[![run with conda](http://img.shields.io/badge/run%20with-conda-3EB049?labelColor=000000&logo=anaconda)](https://docs.conda.io/en/latest/)
66
[![run with apptainer](https://img.shields.io/badge/run%20with-apptainer-1D355C.svg?labelColor=000000)](https://apptainer.org/)
77
[![workflow catalog](https://img.shields.io/badge/Snakemake%20workflow%20catalog-darkgreen)](https://snakemake.github.io/snakemake-workflow-catalog/docs/workflows/MPUSP/snakemake-assembly-postprocessing)

config/README.md

Lines changed: 0 additions & 30 deletions
Original file line numberDiff line numberDiff line change
@@ -23,33 +23,3 @@ The samplesheet table has the following layout:
2323
| ... | ... | ... | ... | ... |
2424

2525
**Note:** Pangenome analysis with `Panaroo` requires at least two samples.
26-
27-
### Parameters
28-
29-
This table lists all parameters that can be used to run the workflow.
30-
31-
| Parameter | Type | Details | Default |
32-
|:---|:---|:---|:---|
33-
| **samplesheet** | string | Path to the sample sheet file in csv format | |
34-
| **tool** | array[string] | Annotation tool to use (one of `prokka`, `pgap`, `bakta`) | |
35-
| **pgap** | | PGAP configuration object | |
36-
| bin | string | Path to the PGAP script | |
37-
| use_yaml_config | boolean | Whether to use YAML configuration for PGAP | `False` |
38-
| _prepare_yaml_files_ | | Paths to YAML templates for PGAP | |
39-
| generic | string | Path to the generic YAML configuration file | |
40-
| submol | string | Path to the submol YAML configuration file | |
41-
| **prokka** | | Prokka configuration object | |
42-
| center | string | Center name for Prokka annotation (used in sequence IDs) | |
43-
| extra | string | Extra command-line arguments for Prokka | `--addgenes` |
44-
| **bakta** | | Bakta configuration object | |
45-
| download_db | string | Bakta database type (`full`, `light`, or `none`) | `light` |
46-
| existing_db | string | Path to an existing Bakta database (optional). Needs to be combined with `download_db='none'` | `--keep-contig-headers --compliant` |
47-
| extra | string | Extra command-line arguments for Bakta | |
48-
| **quast** | | QUAST configuration object | |
49-
| reference_fasta | string | Path to the reference genome for QUAST | |
50-
| reference_gff | string | Path to the reference annotation for QUAST |
51-
| extra | string | Extra command-line arguments for QUAST | |
52-
| **panaroo** | | Panaroo configuration object | |
53-
| remove_source | string | Source types to remove in Panaroo (regex supported) | `cmsearch` |
54-
| remove_feature | string | Feature types to remove in Panaroo (regex supported) | `tRNA\|rRNA\|ncRNA\|exon\|sequence_feature` |
55-
| extra | string | Extra command-line arguments for Panaroo | `--clean-mode strict --remove-invalid-genes` |

config/schemas/config.schema.yml

Lines changed: 27 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,12 +5,16 @@ type: object
55
properties:
66
samplesheet:
77
type: string
8-
description: Path to the sample sheet file
8+
description: Path to the sample sheet in CSV format
9+
default: "config/samples.csv"
910
tool:
1011
type: array
11-
description: Annotation tool to use
12+
description: Annotation tool(s) to use
13+
default: ["prokka"]
1214
items:
1315
type: string
16+
description: Name of the annotation tool
17+
default: "prokka"
1418
enum:
1519
- prokka
1620
- pgap
@@ -21,18 +25,22 @@ properties:
2125
bin:
2226
type: string
2327
description: Path to the PGAP script
28+
default: "path/to/pgap.py"
2429
use_yaml_config:
2530
type: boolean
2631
description: Whether to use YAML configuration for PGAP
32+
default: true
2733
prepare_yaml_files:
2834
type: object
2935
properties:
3036
generic:
3137
type: string
3238
description: Path to the generic YAML configuration file
39+
default: "config/generic.yaml"
3340
submol:
3441
type: string
3542
description: Path to the submol YAML configuration file
43+
default: "config/submol.yaml"
3644
required:
3745
- generic
3846
- submol
@@ -46,9 +54,11 @@ properties:
4654
center:
4755
type: string
4856
description: Center name for Prokka annotation (used in sequence IDs)
57+
default: ""
4958
extra:
5059
type: string
5160
description: Extra command-line arguments for Prokka
61+
default: "--addgenes"
5262
required:
5363
- center
5464
- extra
@@ -58,12 +68,15 @@ properties:
5868
download_db:
5969
type: string
6070
description: Bakta database type, one of 'full', 'light', or 'none' if existing is used
71+
default: "light"
6172
existing_db:
6273
type: string
6374
description: Path to an existing Bakta database (optional)
75+
default: ""
6476
extra:
6577
type: string
6678
description: Extra command-line arguments for Bakta
79+
default: "--keep-contig-headers --compliant"
6780
required:
6881
- download_db
6982
- existing_db
@@ -74,27 +87,38 @@ properties:
7487
reference_fasta:
7588
type: string
7689
description: Path to the reference genome for QUAST
90+
default: ""
7791
reference_gff:
7892
type: string
7993
description: Path to the reference annotation for QUAST
94+
default: ""
8095
extra:
8196
type: string
8297
description: Extra command-line arguments for QUAST
98+
default: ""
99+
required:
100+
- reference_fasta
101+
- reference_gff
102+
- extra
83103
panaroo:
84104
type: object
85105
properties:
86106
skip:
87107
type: boolean
88108
description: Whether to skip Panaroo analysis
109+
default: false
89110
remove_source:
90111
type: string
91112
description: Source types to remove in Panaroo (regex supported)
113+
default: "cmsearch"
92114
remove_feature:
93115
type: string
94116
description: Feature types to remove in Panaroo (regex supported)
117+
default: "tRNA|rRNA|ncRNA|exon|sequence_feature"
95118
extra:
96119
type: string
97120
description: Extra command-line arguments for Panaroo
121+
default: "--clean-mode strict --remove-invalid-genes"
98122

99123
required:
100124
- samplesheet
@@ -103,3 +127,4 @@ required:
103127
- prokka
104128
- bakta
105129
- quast
130+
- panaroo

0 commit comments

Comments
 (0)