Skip to content

Commit e6abdf8

Browse files
committed
Write docs for new version
1 parent a0028b4 commit e6abdf8

20 files changed

Lines changed: 875 additions & 427 deletions

docs/cli-reference/extractnc.md

Lines changed: 11 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,14 +1,17 @@
11
# ExtractNC
22

33
```
4-
extarctnc [-h] genome gff_file
4+
usage: extract_nc [-h] [-o OUTPUT] genome gff_file
55
66
positional arguments:
7-
genome FASTA containing the genome to extract from
8-
gff_file GFF file containing annotations of CDS and mRNA regions
9-
10-
optional arguments:
11-
-h, --help show this help message and exit
7+
genome FASTA containing the genome to extract from
8+
gff_file GFF file containing annotations of CDS and mRNA
9+
regions
10+
11+
options:
12+
-h, --help show this help message and exit
13+
-o OUTPUT, --output OUTPUT
14+
FASTA file to write output to
1215
```
1316

1417
Input files:
@@ -21,4 +24,5 @@ Output files:
2124

2225
- `noncoding.fasta` - FASTA file containing only the transcribed noncoding regions of the DNA
2326

24-
*Note the method used here will not work if there a coding region is labelled outside a single mRNA region. The program does a check before running to make sure this is true and throws an exception before trying to run if so, to prevent nonsense results from being produced.*
27+
!!! warning
28+
The method used here will not work if there a coding region is labelled outside a single mRNA region. The program does a check before running to make sure this is true and throws an exception before trying to run if so, to prevent nonsense results from being produced.

docs/cli-reference/index.md

Lines changed: 17 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -4,26 +4,24 @@ References of how to use the command line interface of the program and notes on
44
General help message is as follows:
55

66
```
7-
usage: hlsmallrna [-h] [-q] [-C PATH_TO_CONFIG]
8-
{process,sort,extractnc,unitas,targetid,all} ...
7+
usage: hlsmallrna [-h] [-o OUTPUT] [-t THREADS] [-k] [-v] config_file
98
10-
Pipeline to process small RNAs
9+
Pipeline to process small RNAs - version 2
1110
1211
positional arguments:
13-
{process,sort,extractnc,unitas,targetid,all}
14-
process Preprocessing for the RNA
15-
sort Find RNAs that align to a genome and sort them by
16-
length
17-
extractnc Extract the noncoding region from a fasta with a GFF
18-
file
19-
unitas Run unitas on split files and merge results
20-
targetid Align small RNA to a number of genome features to find
21-
out what is targeted
22-
all Run process, sort and unitas one after the other
12+
config_file Path to YAML config file
2313
24-
optional arguments:
25-
-h, --help show this help message and exit
26-
-q, --quiet Suppress output from intermediate commands
27-
-C PATH_TO_CONFIG, --path-to-config PATH_TO_CONFIG
28-
Path to the TOML format config file to use
29-
```
14+
options:
15+
-h, --help show this help message and exit
16+
-o OUTPUT, --output OUTPUT
17+
Directory to write output to (will be created if it
18+
doesn't exist, default: hlsmallrna_output)
19+
-t THREADS, --threads THREADS
20+
Number of threads to run the pipeline on (default: 4)
21+
-k, --keep-files If set, keep the intermediate files, to help debug the
22+
application
23+
-v, --verbose If set, print the output for the intermediate commands
24+
```
25+
26+
!!! note "The Config File"
27+
If you are looking for instructions on writing the config file, use [the relevent page of the walkthrough](../walkthrough/config.md) to help.

docs/cli-reference/process.md

Lines changed: 0 additions & 34 deletions
This file was deleted.

docs/cli-reference/sort.md

Lines changed: 0 additions & 46 deletions
This file was deleted.

docs/cli-reference/targetid.md

Lines changed: 0 additions & 37 deletions
This file was deleted.

docs/cli-reference/unitas.md

Lines changed: 0 additions & 40 deletions
This file was deleted.

docs/index.md

Lines changed: 43 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,46 @@
1+
---
2+
hide:
3+
- navigation
4+
- toc
5+
---
16
# Welcome to the Hunt Lab Small RNA Pipeline Documentation
27

3-
This tool is designed to automate a lot of the common bioinformatics performed on small RNA by the [Hunt Lab](https://www.vickyhuntlab.org/). It is deliberately designed to be highly configurable so it can be easily applied to a wide range of scenarios, though most of our work is on nematode and nematomorph worms.
8+
This documentation is for version 2 of the pipeline, select 1.1.0 from the version dropdown for the version 1 documentation.
49

5-
Use the tabs above or buttons below to navigate this site.
10+
This is a pipeline developed in [Vicky Hunt's Lab](https://www.vickyhuntlab.org/) at the [University of Bath](https://www.bath.ac.uk), UK to analyse small RNA data in a quick and consistent way between projects. It is deliberately designed to be highly configurable so it can be easily applied to a wide range of scenarios, though most of our work is on nematode and nematomorph worms.
11+
12+
# Background
13+
14+
Most of the original methodology was devised by Mona Suleiman, Dominika Lastik and Vicky Hunt for the papers below. This was then developed into a usable pipeline by Kieran Reynolds in 2022 and updated to version 2 in 2025.
15+
16+
> Suleiman, M., Kounosu, A., Murcott, B. et al. piRNA-like small RNAs target transposable elements in a Clade IV parasitic nematode. Sci Rep 12, 10156 (2022). [https://doi.org/10.1038/s41598-022-14247-1](https://doi.org/10.1038/s41598-022-14247-1)
17+
18+
> Lastik, D., Kounosu, A., Dayi, M. et al. Small non-coding RNAs have predicted roles in reproductive biology and transposable element regulation in the parasitic worm Strongyloides venezuelensis. Sci Rep 15, 20608 (2025). [https://doi.org/10.1038/s41598-025-01968-2](https://doi.org/10.1038/s41598-025-01968-2)
19+
20+
# Citation
21+
22+
This pipeline is mainly a combination of existing tools so, doesn't have a paper attached to it but can be referenced with the repository URL and version number. For example:
23+
24+
> This analysis was performed using HuntLab-smallRNA v2.0.0 ([https://github.com/Vicky-Hunt-Lab/HuntLab-smallRNA](https://github.com/Vicky-Hunt-Lab/HuntLab-smallRNA)).
25+
26+
For the internal steps please cite the papers of the tools used:
27+
28+
**Trim**:
29+
30+
> Martin, Marcel. (2011). CUTADAPT removes adapter sequences from high-throughput sequencing reads. EMBnet.journal. 17. 10.14806/ej.17.1.200.
31+
32+
**Sort**:
33+
34+
> Langmead, B., Salzberg, S. Fast gapped-read alignment with Bowtie 2. Nat Methods 9, 357-359 (2012). https://doi.org/10.1038/nmeth.1923
35+
36+
**Unitas**:
37+
38+
> Gebert, D., Hewel, C. & Rosenkranz, D. unitas: the universal tool for annotation of small RNAs. BMC Genomics 18, 644 (2017). https://doi.org/10.1186/s12864-017-4031-9
39+
40+
**Targetid**:
41+
42+
> Langmead, B., Salzberg, S. Fast gapped-read alignment with Bowtie 2. Nat Methods 9, 357-359 (2012). https://doi.org/10.1038/nmeth.1923
43+
44+
**Enrich**:
45+
46+
> Carlos P Cantalapiedra, Ana Hernández-Plaza, Ivica Letunic, Peer Bork, Jaime Huerta-Cepas, eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale, Molecular Biology and Evolution, Volume 38, Issue 12, December 2021, Pages 5825-5829, https://doi.org/10.1093/molbev/msab293

0 commit comments

Comments
 (0)