Skip to content

Commit 032a449

Browse files
committed
Add ecoli reference files for EpitopeID
This commit includes the reference files for running EpitopeID on E. coli data using the W3110 strain reference assembly. The utility scripts are also updated to allow users to rebuild the annotation files with different bin sizes. The .gitignore file was updated to exclude the BWA index files but the ecoli genome (small file size) is included in this commit.
1 parent 0595656 commit 032a449

27 files changed

Lines changed: 71016 additions & 0 deletions

.gitignore

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,11 @@ EpitopeID/sacCer3_EpiID/FASTA_genome/genome.fa.ann
55
EpitopeID/sacCer3_EpiID/FASTA_genome/genome.fa.bwt
66
EpitopeID/sacCer3_EpiID/FASTA_genome/genome.fa.pac
77
EpitopeID/sacCer3_EpiID/FASTA_genome/genome.fa.sa
8+
EpitopeID/ecoli_EpiID/FASTA_genome/genome.fa.amb
9+
EpitopeID/ecoli_EpiID/FASTA_genome/genome.fa.ann
10+
EpitopeID/ecoli_EpiID/FASTA_genome/genome.fa.bwt
11+
EpitopeID/ecoli_EpiID/FASTA_genome/genome.fa.pac
12+
EpitopeID/ecoli_EpiID/FASTA_genome/genome.fa.sa
813
EpitopeID/hg19_EpiID/FASTA_genome/genome.fa
914
EpitopeID/hg19_EpiID/FASTA_genome/genome.fa.amb
1015
EpitopeID/hg19_EpiID/FASTA_genome/genome.fa.ann

EpitopeID/ecoli_EpiID/FASTA_genome/genome.fa

Lines changed: 66379 additions & 0 deletions
Large diffs are not rendered by default.
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
W3110 4646332 7 70 71
Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
>AID
2+
CCAAAAGATCCGGCGAAGCCACCCGCGAAGGCGCAGGTTGTTGGTTGGCCACCTGTAAGGTCATATCGTAAGAATGTTATGGTTAGCTGTCAGAAGAGCAGTGGCGGTCCAGAGGCCGCTGCATTTGTGAAA
3+
>CBP
4+
AAGAGAAGGTGGAAGAAGAACTTCATCGCCGTGAGCGCGGCAAACCGTTTCAAGAAGATAAGTAGCAGTGGAGCATTA
5+
>Extended-Tap
6+
GGTCGACGGATCCCCGGGTTAATTAATCCATGGAAGAGAAGATGGAAAAAGAATTTCATAGCCGTCTCAGCAGCCAACCGCTTTAAGAAAATCTCATCCTCCGGGGCACTTGATTATGATATTCCAACTACTGCTAGCGAGAATTTGTATTTTCAGGGAGAATTCGGCCTTGCGCAACACGATGAAGCCGTGGACAACAAATTCAACAAAGAACAACAAAACGCGTTCTATGAGATCTTACATTTACCTAACTTAAACGAAGAACAACGAAACGCCTTCATCCAAAGTTTAAAAGATGACCCAAGCCAAAGCGCTAACCTTTTAGCAGAAGCTAAAAAGCTAAATGATGCTCAGGCGCCGAAAGTAGACAACAAATTCAACAAAGAACAACAAAACGCGTTCTATGAGATCTTACATTTACCTAACTTAAACGAAGAACAACGAAACGCCTTCATCCAAAGTTTAAAAGATGACCCAAGCCAAAGCGCTAACCTTTTAGCAGAAGCTAAAAAGCTAAATGATGCTCAGGCGCCGAAAGTAGACGCGAATCATCAGTGAGGCGCGCCACTTCTAAATAAGCGAATTTCTTATGATTTATGATTTTTATTATTAAATAAGTTATAAAAAAAATAAGTGTATACAAATTTTAAAGTGACTCTTAGGTTTTAAAACGAAAATTCTTATTCTTGAGTAACTCTTTCCTGTAGGTCAGGTTGCTTTCTCAGGTATAGTATGAGGTCGCTCTTATTGACCACACCTCTACCGGCAGATCCGCTAGGGATAACAGGGTAATATAGATCTGTTTAGCTTGCCTCGTCCCCGCCGGGTCACCCGGCCAGCGACATGGAGGCCCAGAATACCCTCCTTGACAGTCTTGACGTGCGCAGCTCAGGGGCATGATGTGACTGTCGCCCGTACATTTAGCCCATACATCCCCATGTATAATCATTTGCATCCATACATTTTGATGGCCGCACGGCGCGAAGCAAAAATTACGGCTCCTCGCTGCAGACCTGCGAGCAGGGAAACGCTCCCCTCACAGACGCGTTGAATTGTCCCCACGCCGCGCCCCTGTAGAGAAATATAAAAGGTTAGGATTTGCCACTGAGGTTCTTCTTTCATATACTTCCTTTTAAAATCTTGCTAGGATACAGTTCTCACATCACATCCGAACATAAACAACCATGGGTAGGAGGGCTTTTGTAGAAAGAAATACGAACGAAACGAAAATCAGCGTTGCCATCGCTTTGGACAAAGCTCCCTTACCTGAAGAGTCGAATTTTATTGATGAACTTATAACTTCCAAGCATGCAAACCAAAAGGGAGAACAAGTAATCCAAGTAGACACGGGAATTGGATTCTTGGATCACATGTATCATGCACTGGCTAAACATGCAGGCTGGAGCTTACGACTTTACTCAAGAGGTGATTTAATCATCGATGATCATCACACTGCAGAAGATACTGCTATTGCACTTGGTATTGCATTCAAGCAGGCTATGGGTAACTTTGCCGGCGTTAAAAGATTTGGACATGCTTATTGTCCACTTGACGAAGCTCTTTCTAGAAGCGTAGTTGACTTGTCGGGACGGCCCTATGCTGTTATCGATTTGGGATTAAAGCGTGAAAAGGTTGGGGAATTGTCCTGTGAAATGATCCCTCACTTACTATATTCCTTTTCGGTAGCAGCTGGAATTACTTTGCATGTTACCTGCTTATATGGTAGTAATGACCATCATCGTGCTGAAAGCGCTTTTAAATCTCTGGCTGTTGCCATGCGCGCGGCTACTAGTCTTACTGGAAGTTCTGAAGTCCCAAGCACGAAGGGAGTGTTGTAAAGAGTACTGACAATAAAAAGATTCTTGTTTTCAAGAACTTGTCATTTGTATAGTTTTTTTATATTGTAGTTGTTCTATTTTAATCAAATGTTAGCGTGATTTATATTTTTTTTCGCCTCGACATCATCTGCCCAGATGCGAAGTTAAGTGCGCAGAAAGTAATATCATGCGTCAATCGTATGTGAATGCTGGTCGCTATACTGCTGTCGATTCGATACTAACGCCGCCATCCAGTTTAAACGAGCTCGAATTCATCGA
7+
>FLAG-3x
8+
AGATTACAAGGATCACGATGGCGATTACAAGGATCACGATATCGATTACAAGGATGATGATGATAAG
9+
>FRB
10+
ATATTGTGGCACGAGATGTGGCACGAAGGGCTGGAAGAGGCCTCTAGGCTATACTTTGGCGAGCGTAATGTCAAAGGGATGTTCGAGGTGCTAGAGCCCCTTCACGCGATGATGGAGAGAGGTCCACAGACTCTGAAAGAAACGAGCTTCAATCAAGCCTACGGTAGGGACTTGATGGAAGCTCAAGAGTGGTGTCGTAAGTATATGAAATCCGGGAACGTCAAAGACTTGTTACAAGCTTGGGATCTGTATTACCATGTCTTCCGTAGAATCAGCAAG
11+
>GFP
12+
GTAAAGGTGAGGAATTATTTACAGGCGTAGTACCAATCCTAGTAGAGTTAGACGGCGACGTCAATGGCCACAAATTTAGTGTCTCTGGCGAGGGTGAAGGTGATGCTACCTACGGTAAGTTAACGCTAAAATTTATATGCACTACGGGTAAATTACCAGTACCGTGGCCCACATTAGTGACAACTTTTACTTATGGAGTGCAGTGTTTTTCCCGTTACCCGGATCACATGAAAAGGCACGACTTCTTCAAATCTGCTATGCCGGAAGGGTACGTTCAAGAAAGAACTATATTTTTTAAGGACGACGGGAATTACAAGACCAGAGCTGAAGTAAAATTTGAGGGAGACACCTTGGTTAACAGAATTGAGCTAAAGGGAATAGATTTCAAAGAAGATGGGAACATCCTAGGCCATAAATTAGAGTACAACTATAATTCACACAATGTGTATATAATGGCAGATAAGCAGAAAAATGGAATTAAAGTTAACTTCAAGATTAGACACAATATCGAAGATGGGAGTGTCCAGCTAGCCGATCACTATCAACAGAATACGCCGATTGGCGATGGGCCTGTCCTTTTGCCAGATAATCACTATCTTTCTACACAATCTGCTCTGAGTAAGGACCCGAATGAGAAACGTGACCACATGGTCCTGTTGGAATTTGTAACAGCAGCGGGCATAACACACGGCATGGATGAGCTATATAAA
13+
>HA_v1
14+
TACCCATACGATGTTCCAGATTACGCTTACCCATACGATGTTCCAGATTACGCTTACCCATACGATGTTCCAGATTACGCT
15+
>HA_v2
16+
TATCCATATGATGTTCCAGATTATGCTTATCCATATGATGTTCCAGATTATGCTTATCCATATGATGTTCCAGATTATGCT
17+
>HA_v3
18+
GCGGCCGTTTACCCATACGATGTTCCTGACTATGCGGGCTATCCCTATGACGTCCCGGACTATGCAGGATCCTATCCATATGACGTTCCAGATTACGCTCCGGCCGCC
19+
>HaloTag
20+
AATGCAGAAATAGGGACAGGCTTTCCTTTTGATCCCCATTATGTAGAGGTACTAGGAGAACGTATGCATTACGTGGACGTGGGACCAAGAGATGGCACCCCAGTTCTTTTTTTACACGGAAACCCAACCAGTTCTTATGTCTGGAGGAACATCATCCCTCACGTTGCTCCAACTCACAGATGTATAGCGCCTGATCTGATTGGGATGGGTAAATCAGACAAACCCGACTTGGGCTACTTCTTCGATGACCACGTCAGATTTATGGACGCGTTCATTGAGGCACTTGGGTTGGAAGAAGTCGTCCTGGTAATACATGACTGGGGATCAGCCCTGGGATTTCATTGGGCTAAAAGGAATCCAGAGAGAGTAAAAGGGATTGCCTTCATGGAGTTTATCAGGCCCATACCCACCTGGGATGAGTGGCCCGAGTTTGCTAGAGAGACCTTCCAGGCTTTCAGGACTACCGATGTCGGCCGTAAATTGATTATTGACCAAAATGTTTTCATCGAGGGAACGTTGCCTATGGGCGTAGTCCGTCCCCTAACGGAGGTCGAAATGGATCATTACAGAGAACCGTTTTTGAACCCTGTAGACCGTGAGCCACTGTGGCGTTTTCCAAACGAATTACCGATAGCTGGCGAACCCGCTAATATCGTCGCCCTGGTAGAGGAATACATGGACTGGCTTCATCAAAGTCCTGTTCCCAAATTGTTATTTTGGGGTACACCAGGTGTATTAATTCCACCTGCCGAAGCCGCCAGGTTGGCAAAATCATTACCGAACTGCAAAGCTGTAGACATCGGCCCCGGTTTGAATCTGCTGCAGGAGGATAACCCGGATTTGATAGGGTCTGAAATCGCGCGTTGGTTGTCAACCCTGGAAATATCAGGC
21+
>MNase_v2
22+
CCGCCACCAGTACCAAGAAGCTGCACAAGGAGCCCGCCACCCTGATCAAGGCCATAGATGGCGATACCGTGAAGCTGATGTATAAGGGCCAGCCCATGACCTTCCGCCTGCTGCTGGTGGATACCCCCGAGACCAAGCACCCGAAAAAGGGCGTGGAAAAGTACGGACCCGAGGCCAGCGCCTTCACAAAAAAGATGGTGGAGAACGCCAAGAAAATCGAGGTGGAGTTCGATAAAGGCCAACGCACCGATAAGTATGGACGCGGTCTGGCCTACATCTACGCCGACGGCAAAATGGTGAACGAGGCCCTGGTGCGGCAGGGACTGGCCAAGGTGGCGTACGTGTACAAGCCCAACAACACCCACGAGCAGCACCTGCGCAAGAGCGAGGCTCAGGCAAAGAAGGAGAAACTGAACATCTGGAGTGAGGATAACGCCGATAGCGGCCAGT
23+
>Myc-3x
24+
GGCGAACAGAAGCTAATCAGTGAGGAGGACCTTAATGGTGAGCAGAAACTAATCTCTGAAGAGGATTTGAATGGAGAACAAAAGCTGATTTCTGAGGAGGACTTAAAC
25+
>ProteinA
26+
AAAACGGCAGCTTTAGCGCAGCACGACGAGGCTGTGGACAATAAGTTTAACAAGGAGCAGCAGAACGCATTCTACGAGATCCTTCATCTGCCTAACCTGAATGAGGAACAAAGGAACGCGTTCATTCAAAGTCTGAAGGACGACCCAAGCCAGTCAGCGAATTTGCTTGCCGAAGCAAAAAAACTGAACGACGCCCAAGCGCCCAAAGTCGATAATAAATTCAATAAGGAACAGCAGAATGCATTCTATGAGATTCTGCATCTACCGAATCTTAACGAAGAACAGAGGAACGCCTTTATCCAGTCTCTAAAGGATGATCCAAGCCAATCCGCTAACCTATTAGCTGAAGCAAAGAAGCTTAATGGAGCCCAAGCACCGAAAGTTGATGCCAACAGTGCCGGTAAATCAACT
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
5492 13 0
Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,27 @@
1+
5492 13 11
2+
0 AID (null)
3+
0 132 0
4+
0 CBP (null)
5+
132 78 0
6+
0 Extended-Tap (null)
7+
210 2096 0
8+
0 FLAG-3x (null)
9+
2306 67 0
10+
0 FRB (null)
11+
2373 279 0
12+
0 GFP (null)
13+
2652 710 0
14+
0 HA_v1 (null)
15+
3362 81 0
16+
0 HA_v2 (null)
17+
3443 81 0
18+
0 HA_v3 (null)
19+
3524 108 0
20+
0 HaloTag (null)
21+
3632 891 0
22+
0 MNase_v2 (null)
23+
4523 450 0
24+
0 Myc-3x (null)
25+
4973 108 0
26+
0 ProteinA (null)
27+
5081 411 0
5.44 KB
Binary file not shown.
1.34 KB
Binary file not shown.
2.73 KB
Binary file not shown.
Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
>AID
2+
CCAAAAGATCCGGCGAAGCCACCCGCGAAGGCGCAGGTTGTTGGTTGGCCACCTGTAAGGTCATATCGTAAGAATGTTATGGTTAGCTGTCAGAAGAGCAGTGGCGGTCCAGAGGCCGCTGCATTTGTGAAA

0 commit comments

Comments
 (0)