openproblems-bio
diff --git a/‎README.md‎
Lines changed: 24 additions & 23 deletions b/‎README.md‎
Lines changed: 24 additions & 23 deletions
diff --git a/‎docs/source/images/grn_models.png‎
-132 KB b/‎docs/source/images/grn_models.png‎
-132 KB
diff --git a/‎miniconda.sh‎ b/‎miniconda.sh‎
diff --git a/‎scripts/download_resources.sh‎
Lines changed: 2 additions & 7 deletions b/‎scripts/download_resources.sh‎
Lines changed: 2 additions & 7 deletions
diff --git a/‎scripts/render_readme.sh‎ ‎scripts/repo/render_readme.sh‎scripts/render_readme.sh renamed to scripts/repo/render_readme.sh b/‎scripts/render_readme.sh‎ ‎scripts/repo/render_readme.sh‎scripts/render_readme.sh renamed to scripts/repo/render_readme.sh
diff --git a/‎scripts/run_benchmark_all.sh‎ ‎scripts/repo/run_benchmark_all.sh‎scripts/run_benchmark_all.sh renamed to scripts/repo/run_benchmark_all.sh b/‎scripts/run_benchmark_all.sh‎ ‎scripts/repo/run_benchmark_all.sh‎scripts/run_benchmark_all.sh renamed to scripts/repo/run_benchmark_all.sh
diff --git a/‎scripts/run_grn_evaluation.sh‎
Lines changed: 2 additions & 5 deletions b/‎scripts/run_grn_evaluation.sh‎
Lines changed: 2 additions & 5 deletions
diff --git a/‎scripts/single_grn_evaluation.sh‎
Lines changed: 0 additions & 139 deletions b/‎scripts/single_grn_evaluation.sh‎
Lines changed: 0 additions & 139 deletions
diff --git a/‎src/api/comp_metric.yaml‎
Lines changed: 20 additions & 13 deletions b/‎src/api/comp_metric.yaml‎
Lines changed: 20 additions & 13 deletions
diff --git a/‎src/api/comp_metric_regression.yaml‎
Lines changed: 0 additions & 17 deletions b/‎src/api/comp_metric_regression.yaml‎
Lines changed: 0 additions & 17 deletions
@@ -11,7 +11,7 @@ Benchmarking GRN inference methods
 <!-- Leaderboard: 
   [Performance comparision](https://add-grn--openproblems.netlify.app/results/grn_inference/) -->
 
-[Performance comparision](https://github.com/janursa/grn_benchmark/blob/main/notebooks/process_results.ipynb) -- we are currently under revision and the official leaderboard will be released soon.
+Check for [performance comparision](https://github.com/janursa/grn_benchmark/blob/main/notebooks/process_results.ipynb) of integrated GRN inference methods.
 
 Article: [geneRNIB: a living benchmark for gene regulatory network inference](https://www.biorxiv.org/content/10.1101/2025.02.25.640181v1)
 
@@ -22,7 +22,7 @@ Documentation:
 Repository:
 [openproblems-bio/task_grn_inference](https://github.com/openproblems-bio/task_grn_inference)
 
-If you use this framework, please cite it as
+If you use this framework, please cite
 
 ```
   @article{nourisa2025genernib,
@@ -49,58 +49,58 @@ are re-assessed, and the leaderboard is updated accordingly. The aim is
 to evaluate both the accuracy and completeness of inferred GRNs. It is
 designed for both single-modality and multi-omics GRN inference.
 
-In the current version, geneRNIB contains 10 inference methods including
-both single and multi-omics, 8 evalation metrics, and five datasets.
-
-See our publication for the details of methods.
-
 ## Installation
 
-You need to have Docker, Java, and Viash installed. Follow
-[these instructions](https://openproblems.bio/documentation/fundamentals/requirements)
-to install the required dependencies. 
+Install Docker, Java, and Viash using
+[these instructions](https://openproblems.bio/documentation/fundamentals/requirements).
 
 ## Download resources
 ```bash
 git clone --recursive git@github.com:openproblems-bio/task_grn_inference.git
 
 cd task_grn_inference
 ```
-To interact with the framework, you should download the resources containing necessary inferene and evaluation datasets to get started. 
-Here, we download the **test resources** which are solely used for testing if the framework is installed successfully. 
+To interact with the framework,download the resources containing necessary inferene and evaluation datasets. 
 
 ```bash
-scripts/download_resources.sh
+pip install awscli
+aws s3 sync  s3://openproblems-data/resources/grn/grn_benchmark resources/grn_benchmark  --no-sign-request
+
 ```
 
-Refer to the [Documentation](https://genernib-documentation.readthedocs.io/en/latest/) for downloading the actual datasets. To reproduce the results, run `scripts/run_benchmark_all.sh`, which is a very resource intensive run.
 
 ## Run a GRN inference method 
 
 To infer a GRN for a given dataset (e.g. `op`) using simple Pearson correlation:
 
 ```bash
 viash run src/methods/pearson_corr/config.vsh.yaml -- \
-            --rna resources_test/grn_benchmark/inference_data/op_rna.h5ad \
-            --prediction output/net.h5ad \
-            --tf_all resources_test/grn_benchmark/prior/tf_all.csv
+            --rna resources/grn_benchmark/inference_data/op_rna.h5ad \
+            --tf_all resources/grn_benchmark/prior/tf_all.csv \ 
+            --prediction output/net.h5ad
 ```
-Of note, we are using the `resources_test` datasets, which are small versions of the actual datasets for computational speed. Thus, the obtained predictions are not realistic. To obtain a realistic prediction, download the actual data and set the folder to `resources`.  
 
-## Evaluate a GRN prediction
-Once got the prediction for a given dataset (e.g. op), use the following code to obtain evaluation scores. 
+## Evaluate a GRN model
 
 ```bash
-bash scripts/run_grn_evaluation.sh --prediction=output/net.h5ad --save_dir=output/ --dataset=op --build_images=true --test_run=true
+bash scripts/run_grn_evaluation.sh \
+             --prediction=output/net.h5ad \
+             --dataset=op \ 
+             --build_images=true \ 
+             --save_dir=output 
 ```
+`build_images` only needed for the first run.
 
-**This** outputs the scores into `output/score_uns.yaml`. Of note, by passing `--test_run`, the evaluations are done on the test data. To use the actual data (`resources` folder), omit this flag.
+This outputs the scores into `output/score_uns.yaml`. 
 
 
 ## Add a GRN inference method, evaluation metric, or dataset
 
 To add a new component to the repository, follow the [Documentation](https://genernib-documentation.readthedocs.io/en/latest/).
 
+## Run the entire pipline
+
+Run `scripts/run_all.sh` for the entire pipeline. Due to resource intensive nature of the task, we have splitted the pipeline into two steps of GRN inference and evaluation.
 
 ## Authors & contributors
 
@@ -109,9 +109,10 @@ To add a new component to the repository, follow the [Documentation](https://gen
 | Jalil Nourisa     | author      |
 | Robrecht Cannoodt | author      |
 | Antoine Passimier | contributor |
+| Jérémie Kalfon    | contributor |
 | Marco Stock       | contributor |
 | Christian Arnold  | contributor |
-| Jérémie Kalfon    | contributor |
+
 
 ## API
 
 
@@ -2,11 +2,6 @@
 
 set -e
 
-common/scripts/sync_resources
+# common/scripts/sync_resources
 
-# aws s3 sync  s3://openproblems-data/resources/grn/grn_benchmark resources/grn_benchmark  --no-sign-request
-
-
-# aws s3 sync  s3://openproblems-data/resources/grn/grn_models resources/grn_models --delete 
-# aws s3 sync  resources_test/ s3://openproblems-data/resources_test/grn/  --delete 
-# aws s3 sync  resources/grn_benchmark/ s3://openproblems-data/resources/grn/grn_benchmark  --delete 
+aws s3 sync  s3://openproblems-data/resources_test/grn/grn_benchmark resources_test/grn_benchmark  --no-sign-request
@@ -53,7 +53,6 @@ done
 echo "$@"
 echo "DATASET: $DATASET"
 echo "PREDICTION: $PREDICTION"
-echo "SAVE_DIR: $SAVE_DIR"
 echo "RUN_TEST: $RUN_TEST"
 echo "BUILD_IMAGES: $BUILD_IMAGES"
 echo "RUN_LOCAL: $RUN_LOCAL"
@@ -64,11 +63,11 @@ if [ -z "${DATASET:-}" ]; then
     exit 1
 fi
 
+
 num_workers=10
-metric_ids="[regression_2, ws_distance, sem, tf_recovery, tf_binding, replica_consistency]" #regression_1, regression_2, ws_distance
+metric_ids="[all_metrics]" #regression_2, ws_distance, sem, tf_recovery, tf_binding, replica_consistency
 RUN_ID="${DATASET}_evaluation"
 models_folder="${DATASET}/"
-apply_skeleton=false
 apply_tf=true
 layer='lognorm'
 if [ "$RUN_TEST" = "false" ]; then
@@ -125,8 +124,6 @@ append_entry() {
     tf_all: ${resources_dir}/grn_benchmark/prior/tf_all.csv
     regulators_consensus: ${resources_dir}/grn_benchmark/prior/regulators_consensus_${dataset}.json
     prediction: ${prediction}
-    skeleton: ${resources_dir}/grn_benchmark/prior/skeleton.csv
-    apply_skeleton: ${apply_skeleton}
     apply_tf: ${apply_tf}
     reg_type: ${reg_type}
     layer: $layer_
 
@@ -13,7 +13,15 @@ arguments:
     direction: input
   - name: --evaluation_data
     __merge__: file_evaluation_bulk_h5ad.yaml
-    required: true
+    required: false
+    direction: input
+  - name: --evaluation_data_sc
+    __merge__: file_evaluation_sc_h5ad.yaml
+    required: false
+    direction: input
+  - name: --evaluation_data_de
+    __merge__: file_evaluation_de_h5ad.yaml
+    required: false
     direction: input
   - name: --score
     __merge__: file_score_h5ad.yaml
@@ -27,14 +35,10 @@ arguments:
   - name: --max_n_links
     type: integer
     default: 50000
-  - name: --verbose
-    type: integer
-    default: 2
-    direction: input
   - name: --tf_all
     type: file
     direction: input
-    required: true
+    required: false
     example: resources_test/grn_benchmark/prior/tf_all.csv
   - name: --num_workers
     type: integer
@@ -44,14 +48,17 @@ arguments:
     type: boolean 
     required: false
     default: true
-  - name: --apply_skeleton
-    type: boolean 
-    required: false
-    default: false
-  - name: --skeleton
-    type: file 
+  - name: --regulators_consensus
+    type: file
+    direction: input
+    must_exist: false
     required: false
-    example: resources_test/grn_benchmark/prior/skeleton.csv
+    example: resources_test/grn_benchmark/prior/regulators_consensus_op.json
+  - name: --reg_type
+    type: string
+    direction: input
+    default: ridge
+    description: name of regression to use
 
 
 test_resources: