Skip to content

Commit 81ef5f8

Browse files
committed
add scripts to make sacCer3_VCF db
This commit updates the sacCer3_VCF variant database such that the VCF files for each strain only contain unique, unambiguous variants. The full, original VCF files are moved into a subdirectory `sacCer3_VCF/full_VCF`. Also included in this commit are scripts to regenerate these databases from scratch. sacCer3_VCF/*.gatk.vcf -These are the unambiguous variants for each sacCer3 strain sacCer3_VCF/full_VCF/*.gatk.vcf -These are the full VCF files saved for reference utility_scripts/generate_sacCer3_VariantDB.sh -This bash script downloads VCF files from SGD, parses them to convert chromosome naming from roman to arabic numerals, and strips out ambiguous variants with bedtools intersect utility_scripts/parsers/parse_sacCer3_VCF.pl -This is the perl script for converting the chromosome naming system of the VCF files
1 parent 9fc4413 commit 81ef5f8

26 files changed

Lines changed: 258914 additions & 200693 deletions

StrainID/sacCer3_VCF/BY4741.gatk.vcf

Lines changed: 2 additions & 490 deletions
Large diffs are not rendered by default.

StrainID/sacCer3_VCF/BY4742.gatk.vcf

Lines changed: 2 additions & 456 deletions
Large diffs are not rendered by default.

StrainID/sacCer3_VCF/CEN.PK2-1Ca.gatk.vcf

Lines changed: 2 additions & 20457 deletions
Large diffs are not rendered by default.

StrainID/sacCer3_VCF/D273-10B.gatk.vcf

Lines changed: 2 additions & 24029 deletions
Large diffs are not rendered by default.

StrainID/sacCer3_VCF/FL100.gatk.vcf

Lines changed: 2 additions & 18808 deletions
Large diffs are not rendered by default.

StrainID/sacCer3_VCF/JK9-3d.gatk.vcf

Lines changed: 2 additions & 11341 deletions
Large diffs are not rendered by default.

StrainID/sacCer3_VCF/README.txt

Lines changed: 0 additions & 3 deletions
This file was deleted.

StrainID/sacCer3_VCF/RM11-1A.gatk.vcf

Lines changed: 2 additions & 36666 deletions
Large diffs are not rendered by default.

StrainID/sacCer3_VCF/SEY6210.gatk.vcf

Lines changed: 2 additions & 6102 deletions
Large diffs are not rendered by default.

StrainID/sacCer3_VCF/Sigma1278b-10560-6B.gatk.vcf

Lines changed: 2 additions & 30424 deletions
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)