Skip to content

Commit ccc43a6

Browse files
committed
add scripts download BY4742 fastq
This commit includes the PBS script for downloading the FASTQ files from the Sra metadata associated with de Jonge 2017. The `.gitignore` file is updated appropriately to ignore log and FASTQ files.
1 parent 4067d6e commit ccc43a6

2 files changed

Lines changed: 36 additions & 0 deletions

File tree

paper/.gitignore

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -35,6 +35,9 @@ SyntheticStrain/results/hg19*
3535
ENCODE_CellLines/results/BAM
3636
ENCODE_CellLines/results/BAM-nospike
3737
ENCODE_CellLines/results/ID
38+
BY4742-chipseq/logs/*.out
39+
BY4742-chipseq/logs/*.err
40+
BY4742-chipseq/results/FASTQ
3841
CENPK-chipseq/logs/*.out
3942
CENPK-chipseq/logs/*.err
4043
CENPK-chipseq/results/FASTQ
Lines changed: 33 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,33 @@
1+
#!/bin/bash
2+
#PBS -l nodes=1:ppn=6
3+
#PBS -l pmem=24gb
4+
#PBS -l walltime=03:00:00
5+
#PBS -A open
6+
#PBS -o logs/download.data.log.out
7+
#PBS -e logs/download.data.log.err
8+
#PBS -t 1-10
9+
10+
# Requires
11+
# parallel fastq dump v2.8.0
12+
13+
# FIRST CHANGE PATH TO EXECUTE
14+
WRK=/path/to/GenoPipe/paper/BY4742-chipseq
15+
cd $WRK
16+
17+
module load anaconda3
18+
source activate ~/work/myconda/genopipe/
19+
20+
[ -d logs ] || mkdir logs
21+
[ -d results/FASTQ ] || mkdir -p results/FASTQ
22+
23+
INDEX=$(($PBS_ARRAYID+1))
24+
25+
METADATA=SraRunInfo.csv
26+
INFO=`sed "${INDEX}q;d" $METADATA`
27+
SRR=`echo $INFO | cut -d"," -f1`
28+
#echo $INFO
29+
30+
# FASTQ-DUMP
31+
echo "($INDEX) Begin downloading $SRR FASTQ..."
32+
parallel-fastq-dump --gzip --split-files -t 8 -O results/FASTQ -s $SRR
33+
echo "Complete"

0 commit comments

Comments
 (0)