Skip to content

Commit 95dc2e6

Browse files
committed
update StrainID with hg38 db and scripts
Add support for hg38 and expanded cell lines on StrainID -add hg38_VCF db with similar cell lines to hg19_VCF -add hg38_ENCODE db with ENCODE cell lines at least 250 variants big -add scripts and refs for building full hg38_DepMap db -update README to describe how Model.csv was downloaded. -minor fix to generate_sacCer3_VariantDB.sh usage typo -.gitignore updated appropriately
1 parent 4fd50d0 commit 95dc2e6

7 files changed

Lines changed: 2382 additions & 1 deletion

File tree

.gitignore

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,3 +2,10 @@
22
EpitopeID/ecoli_EpiID/FASTA_genome/genome.fa.*
33
EpitopeID/sacCer3_EpiID/FASTA_genome/genome.fa*
44
EpitopeID/hg19_EpiID/FASTA_genome/genome.fa*
5+
StrainID/hg38_VCF
6+
StrainID/hg38_ENCODE
7+
StrainID/hg38_DepMap
8+
StrainID/utility_scripts/header.vcf
9+
StrainID/utility_scripts/CL_names.txt
10+
StrainID/utility_scripts/CSV
11+
StrainID/utility_scripts/hg38_DepMap

StrainID/hg38_ENCODE.tar.gz

546 KB
Binary file not shown.

StrainID/hg38_VCF.tar.gz

101 KB
Binary file not shown.

StrainID/utility_scripts/Model.csv

Lines changed: 1865 additions & 0 deletions
Large diffs are not rendered by default.

StrainID/utility_scripts/README.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
2+
### Model.csv
3+
Downloaded from https://depmap.org/portal/download/all/?releasename=DepMap+Public+23Q2&filename=Model.csv on May 14, 2023
4+
5+
6+
### generate_hg38_VariantDB.sh
7+
Download all DepMap (Broad) variants based on `Model.csv` accessions and cell line names.
8+
9+
### generate_sacCer3_VariantDB.sh
10+
Download all Song et al (2015) variants and perform BedTools intersect for unique substitutions (better uniquely characterize strain) and format into VCF files for StrainID.

StrainID/utility_scripts/generate_hg38_VariantDB.sh

Lines changed: 499 additions & 0 deletions
Large diffs are not rendered by default.

StrainID/utility_scripts/generate_sacCer3_VariantDB.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@
1818
usage()
1919
{
2020
echo 'generate_sacCer3_VariantDB.sh'
21-
echo 'eg: bash generate_sacCer3_GenomeAnnotation.sh'
21+
echo 'eg: bash generate_sacCer3_VariantDB.sh'
2222
exit
2323
}
2424

0 commit comments

Comments
 (0)