Skip to content

Commit c73b779

Browse files
authored
update the readme
few changes in the text.
1 parent dfb19f8 commit c73b779

1 file changed

Lines changed: 15 additions & 15 deletions

File tree

README.md

Lines changed: 15 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -5,25 +5,25 @@
55

66
<div style="text-align: justify">
77

8-
Bulk mRNA-seq experiments are essential for exploring a wide range of biological questions. To bring closer the data analysis to its interpretation and facilitate both interactive, exploratory tasks and the sharing of (easily accessible) information, we present *bulkAnalyseR* an R package that offers a seamless, customisable solution for most bulk RNAseq datasets. By integrating state-of-the-art approaches without relying on extensive computational support, and replacing static images with interactive panels, our aim is to further support and strengthen the reusability of data. *bulkAnalyseR* enables standard analyses of bulk data, using an expression matrix as starting point. It presents the outputs of various steps in an interactive web-based interface, making it easy to generate, explore and verify hypotheses. Moreover, the app can be easily shared and published, incentivising research reproducibility and allowing others to explore the same processed data and enhance the biological conclusions.
8+
Bulk sequencing experiments (e.g. mRNAseq, sRNAseq etc) are essential for exploring a wide range of biological questions. To bring closer the data analysis to its interpretation and facilitate both interactive, exploratory tasks and the sharing of (easily accessible) information, we present *bulkAnalyseR*, an R package that offers a seamless, customisable solution for most bulk sequencing datasets. By integrating state-of-the-art approaches without relying on extensive computational support, and replacing static images with interactive panels, our aim is to further support and strengthen the reusability of data. *bulkAnalyseR* enables standard analyses of bulk data, using an expression matrix as starting point. It presents the outputs of various steps in an interactive web-based interface, making it easy to generate, explore and verify hypotheses. Moreover, the app can be easily shared and published, incentivising research reproducibility and allowing others to explore the same processed data.
99

1010
[comment]: <Preprint: AAA>
1111

1212
<img src="vignettes/figures/workflow.png" alt="drawing" width="500"/>
1313

14-
*Workflow diagram of the **bulkAnalyseR** pipeline*
14+
*Workflow diagram of the **bulkAnalyseR** pipeline. The input comprises a processed (i.e. normalised, noise corrected) expression matrix. Using **bulkAnalyseR** all standard steps related to differential expression analyses are handled seamlesley. The pairwaise comparison of differential expression outputs is also possible (using cross-plots and upset plots). Lastly, localised Gene Regulatory Networks can be created.*
1515

1616
## Preprocessing step
1717

18-
### Defining expression matrix and metadata
18+
### Defining the input expression matrix and corresponding metadata
1919

20-
To create a shiny app using *bulkAnalyseR*, you need an expression matrix and a metadata table loaded.
20+
To create a shiny app using *bulkAnalyseR*, you need a processed (e.g. normalised, noise corrected) expression matrix and a corresponding metadata table loaded in your workspace.
2121

22-
The expression matrix should have genes on the rows (with the Ensembl ID as the row name) and samples on the columns.
22+
The expression matrix is expected to have genes on the rows (with the Ensembl ID as the row name) and samples on the columns.
2323

24-
The first column of the metadata table must match the column names of the expression matrix. The other columns of the metadata table contain other pieces of information about the samples e.g. timepoint, treatment groups you want to compare between.
24+
The first column of the metadata table must match the column names of the expression matrix. The other columns of the metadata table contain other information about the samples e.g. time points, treatment groups, demographic information etc; the differential expression analysis may take into account a subset of these columns.
2525

26-
For example, if you had this expression matrix (from [2019 paper by Yang et al](https://www.sciencedirect.com/science/article/pii/S2405471219301152)):
26+
We illustrate this on a case study from a 2019 paper by Yang et al (https://www.sciencedirect.com/science/article/pii/S2405471219301152). Using three time points, each with two replicates, for some bulk mRNAseq samples, we obtain the following matrices:
2727

2828
| |SRR7624365 | SRR7624366 | SRR7624371 | SRR7624372 | SRR7624375 | SRR7624376 |
2929
|---|---|---|---|---|---|--- |
@@ -32,7 +32,7 @@ For example, if you had this expression matrix (from [2019 paper by Yang et al](
3232
|ENSMUSG00000051951| 6 | 4 | 2 | 0 | 47 | 37 |
3333
|| ... | ... | ... | ... | ... | ... |
3434

35-
Then you could have this metadata table:
35+
Below is an example of a metadata table:
3636

3737
| srr | timepoint |
3838
| --- | --- |
@@ -45,27 +45,27 @@ Then you could have this metadata table:
4545

4646
### Denoising and normalisation
4747

48-
Before using the expression matrix to create our shiny app, some preprocessing should be performed. *bulkAnalyseR* contains the function **preprocessExpressionMatrix** which takes the expression matrix as input then denoises the data using [*noisyR*](https://github.com/Core-Bioinformatics/noisyR) and normalises using either quantile (by default) or RPM normalisation (specified using *normalisation.method* parameter).
48+
Before using the expression matrix to create a shiny app, some preprocessing should be performed. *bulkAnalyseR* contains the function **preprocessExpressionMatrix** which takes the raw expression matrix as input, then denoises the data using [*noisyR*](https://github.com/Core-Bioinformatics/noisyR) and normalises the expression levels using either quantile (by default) or RPM normalisation (specified using *normalisation.method* parameter).
4949

50-
It is not recommended to use data which has not been denoised and normalised as input to *generateShinyApp*. You can also perform your own preprocessing outside *preprocessExpressionMatrix*.
50+
It is not recommended to use data which has not been denoised and normalised as input to *generateShinyApp*; noisy data is prone to spurious, un-reproducible patterns; analyses performed on unnormalised data are unlikely to be robust. You can also perform your own preprocessing outside *preprocessExpressionMatrix* function.
5151

5252
Main function: *preprocessExpressionMatrix()*
5353

5454
Supporting function: *noisyr_counts_with_plot()*
5555

5656
## Generating shiny app
5757

58-
The central function in *bulkAnalyseR* is **generateShinyApp**. This function creates an app.R file and all required objects to run the app in .rda format in the target directory. The key inputs to **generateShinyApp** are *expression.matrix* (after being processed using *preprocessExpressionMatrix*) and *meta*. You can also specify the title, name for directory where the app should be saved, shiny theme as well as specifying the organism on which your data was generated.
58+
The central function in *bulkAnalyseR* is **generateShinyApp**. This function creates an app.R file and all required objects to run the app in .rda format in the target directory. The key inputs to **generateShinyApp** are *expression.matrix* (after being processed using *preprocessExpressionMatrix*) and *meta*. You can also specify the title, a folder name where the app will be saved, a shiny theme, as well as specifying the organism on which your data was generated.
5959

60-
Calling *generateShinyApp* with these parameters will create a folder with your chosen name in which there will be 2 data files *expression_matrix.rda* and *metadata.rda* and *app.R* which defines the app. To see the app, you can call *shiny::runApp()* with the name of the folder and the app will start. The app generated is standalone and can be shared with collaborators or published online through a platform like [shinyapps.io](https://www.shinyapps.io/). This provides an easy way for anyone to explore the data and verify the conclusions, increasing access and promoting reproducibility of the bioinformatics analysis.
60+
Calling *generateShinyApp* with these parameters will create a folder with your chosen name in which there will be 2 files *expression_matrix.rda* and *metadata.rda* and *app.R* which defines the app. To see the app, you can call *shiny::runApp()* with the name of the folder as parameter. The app generated is standalone and can be shared with collaborators or published online through a platform like [shinyapps.io](https://www.shinyapps.io/). This provides an easy way for anyone to explore the data and verify the conclusions, increasing access and promoting reproducibility of the bioinformatics analysis.
6161

6262
By default, the app will have 9 panels: Sample select, Quality checks, Differential expression, Volcano and MA plots, DE summary, Enrichment, Expression patterns, Cross plots, GRN inference. You can choose to remove one or more panels using the *default.panels* parameter.
6363

6464
By default, the app will look like this:
6565

6666
<img src="vignettes/figures/ScreenShot.png" alt="drawing" width="800"/>
6767

68-
*Screenshot from sample bulkAnalyseR app*
68+
*Screenshot from Yang case study processed with the bulkAnalyseR app*
6969

7070
[comment]: <See [vignette](www.vignettelink.com) for more details on the individual panels.>
7171

@@ -114,9 +114,9 @@ shiny::runApp(shiny.dir)
114114

115115
## Installation guide
116116

117-
*bulkAnalyseR* can be installed from CRAN using *install.packages("bulkAnalyseR")*. You also need to make sure all bioconductor dependencies are also installed.
117+
*bulkAnalyseR* can be installed from CRAN using *install.packages("bulkAnalyseR")*. Please make sure all bioconductor dependencies are also installed.
118118

119-
To install the latest stable development version from GitHub, first install CRAN dependencies as well as *devtools* then use *devtools::install_github("Core-Bioinformatics/bulkAnalyseR")*.
119+
To install the latest stable development version from GitHub, first install the CRAN dependencies as well as *devtools* then use *devtools::install_github("Core-Bioinformatics/bulkAnalyseR")*.
120120

121121
### Required CRAN packages (use *install.packages()*) ###
122122

0 commit comments

Comments
 (0)