Skip to content

Commit 17ea970

Browse files
authored
Merge pull request #109 from RBL-NCI/activeDev
complete v2.0 merge
2 parents e717c01 + a8a9d57 commit 17ea970

57 files changed

Lines changed: 19862 additions & 2680 deletions

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
Lines changed: 48 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,48 @@
1+
---
2+
title: "QC Troubleshooting"
3+
output: html_document
4+
params:
5+
a_txt:
6+
value: "~/../../Volumes/RBL_NCI-1/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/output/complete_sample/align/align2e_aligned.txt ~/../../Volumes/RBL_NCI-1/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/output/complete_sample/align/align2h_aligned.txt ~/../../Volumes/RBL_NCI-1/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/output/complete_sample/align/align2i_aligned.txt ~/../../Volumes/RBL_NCI-1/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/output/complete_sample/align/align2j_aligned.txt ~/../../Volumes/RBL_NCI-1/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/output/complete_sample/align/alignClipSeqTools_v1_aligned.txt ~/../../Volumes/RBL_NCI-1/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/output/complete_sample/align/alignClipSeqTools_v2_aligned.txt"
7+
u_txt:
8+
value: "~/../../Volumes/RBL_NCI-1/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/output/complete_sample/align/align2e_unaligned.txt ~/../../Volumes/RBL_NCI-1/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/output/complete_sample/align/align2h_unaligned.txt ~/../../Volumes/RBL_NCI-1/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/output/complete_sample/align/align2i_unaligned.txt ~/../../Volumes/RBL_NCI-1/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/output/complete_sample/align/align2j_unaligned.txt ~/../../Volumes/RBL_NCI-1/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/output/complete_sample/align/alignClipSeqTools_v1_unaligned.txt ~/../../Volumes/RBL_NCI-1/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/output/complete_sample/align/alignClipSeqTools_v2_unaligned.txt"
9+
b_txt:
10+
value: x
11+
---
12+
13+
```{r Params, echo=FALSE}
14+
align_txt = as.list(strsplit(params$a_txt, '\\s+')[[1]])
15+
unalign_txt = as.list(strsplit(params$u_txt, '\\s+')[[1]])
16+
```
17+
18+
## Alignment
19+
### Aligned
20+
**Statistics for aligned samples:**
21+
22+
```{r, echo=FALSE, comment='', results="asis"}
23+
setwd("~/../../Volumes/RBL_NCI-1/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/output/complete_sample/align/")
24+
for (a_file in align_txt){
25+
#text files
26+
cat(readLines(a_file), sep = '\n\n')
27+
28+
#png files
29+
png_file = gsub(".txt",".png",a_file)
30+
cat("\t\t", paste0("![](", png_file,")"), "\n")
31+
}
32+
```
33+
34+
### Unaligned
35+
**Statistics for unaligned samples:**
36+
37+
```{r, echo=FALSE, comment='', results="asis"}
38+
setwd("~/../../Volumes/RBL_NCI-1/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/output/complete_sample/align/")
39+
40+
for (a_file in unalign_txt){
41+
#text files
42+
cat(readLines(a_file), sep = '\n')
43+
44+
#png files
45+
png_file = gsub(".txt",".png",a_file)
46+
cat("\t\t", paste0("![](", png_file,")"), "\n")
47+
}
48+
```

build/STAR_testing/README

Lines changed: 80 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,80 @@
1+
## Download reference files
2+
Download all reference files from source
3+
run_star.sh with flag flag_download="Y"
4+
5+
## Index reference files
6+
Create indexed reference files to feed during alignment
7+
run_star.sh with flag flag_index="Y"
8+
9+
## Use file to create STAR params listing
10+
Update /data/RBL_NCI/Wolin/mES_fclip_1_YL_012122/alignment_analysis/star/docs/Star_variations.txt
11+
run_star.sh with flag flag_variables="Y" to create variable_set.txt file which has all of the STAR variables needed
12+
13+
## Run STAR alignment with partial sample
14+
Using input FLAG_Ro_fclip.dedup.si.Sptan1.unique.fastq run star variables
15+
1A 1c 2b 2d 2f 2h ClipSeqTools_v2 1b 2a 2c 2e 2g 2i ClipSeqTools_v1 clipv1_double_raiseq Encode1
16+
run_star.sh with flag flag_align_partial="Y" - done on an interactive node
17+
18+
Run analysis of gaps in these samples
19+
run_star.sh with flag flag_gap_partial="Y"
20+
cigar_plotting_partial.Rmd
21+
22+
## Run STAR alignment with complete sample
23+
Using input /data/RBL_NCI/Wolin/mES_fclip_1_YL_012122/01_preprocess/FLAG_Ro_fclip_filtered.fastq run star variables
24+
2e 2i 2j ClipSeqTools_v1 2h ClipSeqTools_v2
25+
run_star.sh with flag flag_align_complete="Y" - sends swarm to cluster
26+
27+
Run analysis of gaps in these samples
28+
run_star.sh with flag flag_gap_complete="N"
29+
30+
Run analysis of alignment stats in these samples
31+
run_star.sh with flag flag_align_stats="N"
32+
cigar_plotting_complete_v1.Rmd
33+
34+
Run iCLIP pipeline in these samples
35+
save to complete_sample/pipeline dir
36+
37+
## Run STAR alignment with rnu6 sample
38+
Using input /data/RBL_NCI/Wolin/mES_fclip_1_YL_012122/01_preprocess/FLAG_Ro_fclip_filtered.fastq subset sample for rnu6 reads
39+
Used IGV session to randomly select geneIDs and saved to text file rnu6_readids.txt
40+
Remove @ onto readids for subsetting of novo bam file and save to text file rnu6_readids_piccard.txt
41+
run_star.sh with flag flag_subset_rnu="Y"
42+
43+
Run alignment of rnu6
44+
run_star.sh with flag flag_align_rnu="Y"
45+
46+
Run analysis of alignment stats in these samples
47+
run_star.sh with flag flag_align_stats_rnu="N"
48+
49+
## Run STAR alignment with complete sample
50+
Expand complete sample testing with additional variables
51+
original: 2e 2i 2j ClipSeqTools_v1 2h ClipSeqTools_v2
52+
added: clipv1_double 2e_double clipv1_triple
53+
add new def "expanded" to include additional variables not prev tested
54+
run_star.sh with flag flag_align_complete="Y" edit for only necessary variables - sends swarm to cluster
55+
56+
Run analysis of gaps in these samples
57+
run_star.sh with flag flag_gap_complete="Y"
58+
cigar_plotting_complete_v2.Rmd
59+
60+
Run iCLIP pipeline in these samples
61+
save to complete_sample/pipeline_v2 dir
62+
63+
Due to cluster being down create subset BAM to transfer files to desktop
64+
Subsetting for Rnu6, Sptan1, sympk, Gm24204
65+
run_star.sh with flag flag_subset_bam_multiple_genes="Y"
66+
67+
## Run STAR alignment with partial sample
68+
Subsetting for Rnu6, Sptan1, sympk, Gm24204, GAPDH, ACTB
69+
70+
Used IGV session to randomly select geneIDs for original subset (Rnu6, Sptan1, sympk, Gm24204) and then gene ranges to select all genes for GAPDH,ACTB as requested by Marco
71+
run_star.sh with flag flag_subset_fq_multiple_genes="Y" to submit job (930) to cluster to create one FQ with all genes
72+
73+
Run alignment with variables (expanded_gene_list):
74+
clipfinal_10 clipfinal_12 clipfinal_6 clipfinal_8
75+
clipfinal_11 clipfinal_5 clipfinal_7 clipfinal_9
76+
run_star.sh with flag flag_align_overhang="Y" will submit to cluster
77+
78+
Run analysis of gaps in these samples
79+
run_star.sh with flag flag_gap_overhang="Y"
80+
cigar_plotting_partial_v2.Rmd

build/STAR_testing/cigar_plotting.html

Lines changed: 1682 additions & 0 deletions
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)