@@ -162,14 +162,29 @@ in the bin/ directory.
162162
163163 -i the path to your input fasta alignment.
164164 -o the path to your working/output directory.
165- -r the common name of your reference species (more below).
165+ -r the build or common name of your reference species (more below).
166166
167167# Optional Arguments:
168168
169169 --ucsc This will invoke 00_convertHeader.py, which will convert the
170170 headers from UCSC fasta files so they only contain build
171171 names and gene IDs. This does not need to be run on Stich Gene
172172 Blocks output.
173+
174+ --retainStops This will tell AlignmentProcessor to retain sequences
175+ that contain internal stop codons. By default,
176+ sequences with internal stop codons will be removed
177+ from the analysis as they may bias the results.
178+
179+ --changeNames Tells the program to change genome build names to
180+ commom names (more below).
181+
182+ -% a decimal value specifying the minimum percentage of reads
183+ that must remain after replacing unknown codons with gaps
184+ (Default = 0.5). You may wish to use a lower threshold
185+ for highly diverged species or for low quality genomes.
186+
187+ # Ka/Ks_Calculator/CodeML related arguments
173188
174189 --axt/phylip Specifies which format to convert the files into
175190 (axt format for KaKs_Calculator; phylip for CodeML
@@ -186,15 +201,10 @@ in the bin/ directory.
186201 This file must be titled "codeml.ctl" (the default
187202 name given by PAML).
188203
189- --retainStops This will tell AlignmentProcessor to retain sequences
190- that contain internal stop codons. By default,
191- sequences with internal stop codons will be removed
192- from the analysis as they may bias the results.
193-
194- -% a decimal value specifying the minimum percentage of reads
195- that must remain after replacing unknown codons with gaps
196- (Default = 0.5). You may wish to use a lower threshold
197- for highly diverged species or for low quality genomes.
204+ -n if "--codeml" is selected, you may specify the number of CPUs
205+ to run CodeML. CodeML itself cannot be parallelized, but
206+ AlignmentProcessor can call multiple instances of CodeML to
207+ shorten overall run time. (Default = 1)
198208
199209# Additional Commands
200210
@@ -431,7 +441,7 @@ memory.
431441Change directory into the AlignmentProcessor folder. Paste the followig into
432442a terminal:
433443
434- python AlignmentProcessor.py --axt --kaks --ucsc -r Green_anole \
444+ python AlignmentProcessor.py --axt --kaks --ucsc -r anoCar2 \
435445-i kaksTest.fa -o test/
436446
437447This will return a text file with 11 lines.
@@ -441,7 +451,7 @@ The test directory already contains sample CodeML control and tree files, so
441451all you need to do is change into the AlignmentProcessor direcotry and paste
442452the following:
443453
444- python AlignmentProcessor.py --phylip --codeml --ucsc -r Green_anole \
454+ python AlignmentProcessor.py --phylip --codeml --ucsc -n 2 -r anoCar2 \
445455-i codemlTest.fa -o test/
446456
447457There should be 8 .mlc files in the 07_codeml directory.
0 commit comments