You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
update three different README files:
- paper/ - capitalize sentence descriptions and add some detail
- paper/SE/ - add instructions for how to execute all scripts within SE and remove simulations in progress table
- paper/SE/results/ - embed results images in README file with descriptions of summary output files
Copy file name to clipboardExpand all lines: paper/README.md
+20-16Lines changed: 20 additions & 16 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -21,44 +21,48 @@ paper
21
21
|--CENPK-chipseq
22
22
```
23
23
24
+
These scripts were built to run on a linux server with a PBS job scheduler set up and some of the dependencies installed using some environmental modules and a conda environment for remaining dependencies. You may need to modify these scripts to account for different server setup and configurations.
25
+
26
+
See the [GenoPipe documentation](https://pughlab.mbg.cornell.edu/GenoPipe-docs/) for a list of dependencies needed to run these publication-associated scripts. In addition to these dependencies, you will also need to instal [seqtk](https://github.com/lh3/seqtk).
27
+
24
28
## setup.sh
25
-
runs the scripts to download and format the yeast and human genomes and other reference files for aligning the data
26
-
also indexes the genomes for BWA
29
+
Runs the scripts to download and format the yeast and human genomes and other reference files for aligning the data
30
+
also indexes the genomes for BWA. This must be run before all other scripts to reproduce the publication figures.
27
31
28
32
## scripts
29
-
contains the general scripts for setting up the simulations like downloading and parsing the reference genomes
30
-
also contains the general scripts that several of the higher directory scripts call
33
+
Contains the general scripts for setting up the simulations like downloading and parsing the reference genomes
34
+
also contains the general scripts that several of the higher directory scripts call.
31
35
32
36
## input
33
-
where setup.sh puts the reference genome and the aligner indexes
34
-
also where other reference FASTA files are stored (i.e. R500.fa, 3xFLAG.fa, and the HIV genome sequence)
37
+
Where `setup.sh` puts the reference genome and the aligner indexes. Also where other reference FASTA files are stored
38
+
(i.e. R500.fa, 3xFLAG.fa, and the HIV genome sequence).
35
39
36
40
## db
37
-
where the input database directories are built by setup.sh with variation as appropriate for GenoPipe module, species, and epitope set
41
+
Where the input database directories are built by `setup.sh` with variation as appropriate for GenoPipe module, species, and epitope set.
38
42
39
43
## SyntheticEpitope
40
-
contains the scripts and houses the results of simulations testing EpitopeID
44
+
Contains the scripts and the results of simulations testing EpitopeID. This also includes the mixed contamination simulation tests.
41
45
42
46
## SyntheticStrain
43
-
contains the scripts and houses the results of simulations testing StrainID
47
+
Contains the scripts and the results of simulations testing StrainID.
44
48
45
49
## SyntheticDeletion
46
-
contains the scripts and houses the results of simulations testing DeletionID
50
+
Contains the scripts and the results of simulations testing DeletionID.
47
51
48
52
## ENCODEdata-eGFP
49
-
contains the scripts and information for downloading ENCODE eGFP data to test EpitopeID
53
+
Contains the scripts and information for downloading ENCODE eGFP data to test EpitopeID.
50
54
51
55
## ENCODEdata-CellLines
52
-
contains the scripts and information for downloading ENCODE transcription factor ChIP-seq data to test StrainID
56
+
Contains the scripts and information for downloading ENCODE transcription factor ChIP-seq data to test StrainID.
53
57
54
58
## HIV_samples
55
-
contains the scripts and information for downloading, processing, and running EpitopeID on the Bosque et al, 2017 dataset for localizing HIV genome insertions
59
+
Contains the scripts and information for downloading, processing, and running EpitopeID on the Bosque et al, 2017 dataset for localizing HIV genome insertions.
56
60
57
61
## YKOC-wgs
58
-
contains the scripts and information for downloading, processing, and running DeletionID on the Puddu et al, 2019 dataset for identifying deletions
62
+
Contains the scripts and information for downloading, processing, and running DeletionID on the Puddu et al, 2019 dataset for identifying deletions.
59
63
60
64
## BY4742-chipseq
61
-
contains the scripts and information for downloading, processing, and running StrainID on the BAM files
65
+
Contains the scripts and information for downloading, processing, and running StrainID on the BAM files.
62
66
63
67
## CENPK-chipseq
64
-
contains the scripts and information for downloading, processing, and running StrainID on the BAM files
68
+
cContains the scripts and information for downloading, processing, and running StrainID on the BAM files.
0 commit comments