Add _pkgdown.yml with echoverse dark theme; improve vignette

bschilder · claude · bschilder · commit e5a20ad55133 · 2026-03-16T01:05:42.000-04:00
- Create _pkgdown.yml with Bootstrap 5 dark theme matching echoverse suite
- Organise exports into reference groups (IMPACT annotations, IMPACT
  enrichment, IMPACT visualisation, SpliceAI, deep learning annotations)
- Add has_internet() guard to echoAI vignette
- Expand vignette with SpliceAI and deep learning sections
- Add /doc/ and /Meta/ to .gitignore

Co-Authored-By: Claude Opus 4.6 (1M context) &lt;noreply@anthropic.com&gt;
diff --git a/.gitignore b/.gitignore
@@ -43,3 +43,5 @@ vignettes/*.R
 .Renviron
 *.tbi
 Rplots.pdf
+/doc/
+/Meta/
diff --git a/_pkgdown.yml b/_pkgdown.yml
@@ -0,0 +1,127 @@
+url: https://rajlabmssm.github.io/echoAI/
+
+template:
+  bootstrap: 5
+  bslib:
+    bg: "#1a2744"
+    fg: "#e8f4f4"
+    primary: "#4ecdc4"
+    secondary: "#3a8d8c"
+    success: "#2ee8d6"
+    info: "#5bc0be"
+    code-bg: "#d6e4ee"
+    code-color: "#1a2744"
+    border-color: "#2a4060"
+    link-color: "#4ecdc4"
+    link-hover-color: "#2ee8d6"
+    font-size-base: "1rem"
+  includes:
+    in_header: |
+      <style>
+      .navbar {
+        background-color: #162038 !important;
+        border-bottom: 2px solid #4ecdc4;
+      }
+      .navbar-brand, .nav-link { color: #e8f4f4 !important; }
+      .nav-link:hover { color: #2ee8d6 !important; }
+      .nav-link.active { color: #2ee8d6 !important; border-bottom: 2px solid #2ee8d6; }
+      pre {
+        background-color: #d6e4ee !important;
+        color: #1a2744 !important;
+        border: none !important;
+        border-radius: 12px;
+        margin: 0;
+        padding: 0.8em 1em;
+      }
+      code { color: #1a6b68 !important; border: none !important; }
+      pre code { color: #1a2744 !important; border: none !important; background: transparent !important; }
+      pre code span { border: none !important; background: transparent !important; }
+      .table { color: #e8f4f4 !important; }
+      .table-striped > tbody > tr:nth-of-type(odd) > * {
+        background-color: rgba(78, 205, 196, 0.05) !important;
+        color: #e8f4f4 !important;
+      }
+      .table-striped > tbody > tr:nth-of-type(even) > * {
+        background-color: transparent !important;
+        color: #e8f4f4 !important;
+      }
+      .table > thead { border-bottom: 2px solid #4ecdc4; }
+      h1, h2, h3, h4, h5, h6 { color: #4ecdc4 !important; }
+      a { color: #5bc0be; }
+      a:hover { color: #2ee8d6; }
+      .card { background-color: #1e3050; border-color: #2a4060; }
+      .footer { background-color: #162038 !important; border-top: 1px solid #2a4060; }
+      .page-header { border-bottom: 2px solid #3a8d8c; }
+      .sourceCode {
+        background-color: #d6e4ee !important;
+        border: 1px solid #3a8d8c;
+        border-radius: 12px !important;
+        overflow: hidden;
+      }
+      .sourceCode code span,
+      .sourceCode code a {
+        border: none !important;
+        outline: none !important;
+        box-shadow: none !important;
+        background: transparent !important;
+      }
+      </style>
+
+navbar:
+  structure:
+    left: [intro, reference, articles, news]
+    right: [search, github]
+  components:
+    github:
+      icon: fa-github
+      href: https://github.com/RajLabMSSM/echoAI
+
+articles:
+  - title: Getting started
+    contents:
+      - echoAI
+  - title: Setup
+    contents:
+      - docker
+
+reference:
+  - title: IMPACT annotations
+    desc: Query and process IMPACT transcription factor binding predictions
+    contents:
+      - IMPACT_query
+      - IMPACT_get_annotations
+      - IMPACT_iterate_get_annotations
+      - IMPACT_get_annotation_key
+      - IMPACT_get_ldscores
+      - IMPACT_get_top_annotations
+      - IMPACT_postprocess_annotations
+  - title: IMPACT enrichment
+    desc: Compute and visualise enrichment of IMPACT scores across SNP groups
+    contents:
+      - IMPACT_compute_enrichment
+      - IMPACT_iterate_enrichment
+  - title: IMPACT visualisation
+    desc: Plots for IMPACT scores and enrichment results
+    contents:
+      - IMPACT_plot_enrichment
+      - IMPACT_plot_impact_score
+      - IMPACT_snp_group_boxplot
+      - IMPACT_heatmap
+  - title: SpliceAI
+    desc: Query and visualise SpliceAI splice-site predictions
+    contents:
+      - SPLICEAI_run
+      - SPLICEAI_query_api
+      - SPLICEAI_query_tsv
+      - SPLICEAI_query_tsv_iterate
+      - SPLICEAI_query_vcf
+      - SPLICEAI_snp_probs
+      - SPLICEAI_plot
+  - title: Deep learning annotations
+    desc: Query and visualise Basenji/DeepSEA predictions
+    contents:
+      - DEEPLEARNING_query
+      - DEEPLEARNING_query_multi_chr
+      - DEEPLEARNING_query_one_chr
+      - DEEPLEARNING_melt
+      - DEEPLEARNING_plot
diff --git a/vignettes/echoAI.Rmd b/vignettes/echoAI.Rmd
@@ -16,26 +16,43 @@ vignette: >
 library(echoAI)
 ```
 
+```{r, echo=FALSE}
+## Several examples query remote Zenodo/GitHub resources.
+## Gate on internet access so R CMD check works offline.
+has_internet <- function() {
+    z <- try(suppressWarnings(
+        readLines("https://github.com", n = 1L, warn = FALSE)
+    ), silent = TRUE)
+    !inherits(z, "try-error")
+}
+run_online <- has_internet()
+```
+
 # Introduction
 
 `echoAI` provides API access to variant-level AI/ML predictions,
-currently centred on
-[IMPACT](https://github.com/immunogenomics/IMPACT)
-(Inference and Modeling of Phenotype-related ACtive Transcription).
+currently centred on three tools:
+
+- **IMPACT** (Inference and Modeling of Phenotype-related ACtive Transcription)
+  -- predicts transcription factor (TF) binding at motif sites by learning
+  epigenomic profiles, primarily from [ENCODE](https://www.encodeproject.org/).
+  The 707 annotations cover a wide range of immune and non-immune cell types,
+  making IMPACT scores especially useful for prioritising causal variants in
+  immune-mediated diseases. All IMPACT data are aligned to **hg19**.
+  Tabix-indexed versions are hosted on Zenodo
+  ([doi:10.5281/zenodo.7062238](https://doi.org/10.5281/zenodo.7062238))
+  for rapid remote querying.
 
-IMPACT predicts transcription factor (TF) binding at a motif site by
-learning the epigenomic profiles at those sites
-(primarily [ENCODE](https://www.encodeproject.org/)).
-The 707 annotations cover a wide range of immune and non-immune cell types,
-making IMPACT scores especially useful for prioritising causal variants
-in immune-mediated diseases.
+- **SpliceAI** -- predicts the probability that a variant alters mRNA splicing.
+  Results can be obtained via a local VCF/TSV or the Broad Institute API.
 
-All IMPACT data are aligned to the **hg19** genome build.
-Tabix-indexed versions are hosted on Zenodo
-([doi:10.5281/zenodo.7062238](https://doi.org/10.5281/zenodo.7062238))
-for rapid remote querying.
+- **Deep learning annotations** (Basenji, DeepSEA) -- query precomputed
+  variant-level scores from deep learning models of chromatin accessibility
+  and gene expression.
 
-# Query IMPACT annotations
+# IMPACT
+
+## Query IMPACT annotations
 
 The primary entry point is `IMPACT_query()`, which queries tabix-indexed
 IMPACT annotation and LD-score files hosted on Zenodo.
@@ -70,7 +87,7 @@ annot_long <- IMPACT_query(
 head(annot_long)
 ```
 
-# Annotation key
+## Annotation key
 
 The annotation key maps each of the 707 IMPACT annotation IDs to its
 source study, tissue, cell type/line, and transcription factor.
@@ -80,7 +97,7 @@ annot_key <- IMPACT_get_annotation_key(save_key = FALSE)
 head(annot_key)
 ```
 
-# Download full annotation matrices
+## Download full annotation matrices
 
 For larger analyses (e.g. genome-wide or multi-locus), you can download
 the full per-chromosome annotation matrices directly from the IMPACT
@@ -103,7 +120,7 @@ merged_DT <- echodata::get_Nalls2019_merged()
 ANNOT_MELT <- IMPACT_iterate_get_annotations(merged_DT = merged_DT)
 ```
 
-# Post-processing
+## Post-processing
 
 `IMPACT_postprocess_annotations()` converts the annotation table to
 long format, identifies the top consensus SNP per locus, and adds a
@@ -113,7 +130,7 @@ combined cell-type label.
 ANNOT_MELT <- IMPACT_postprocess_annotations(ANNOT_MELT)
 ```
 
-# Enrichment analysis
+## Enrichment analysis
 
 Enrichment is computed as the ratio of IMPACT signal in a given SNP group
 (e.g. consensus, credible set, lead GWAS) to the proportion of SNPs in
@@ -132,9 +149,9 @@ head(enrich)
 ENRICH <- IMPACT_iterate_enrichment(ANNOT_MELT = ANNOT_MELT)
 ```
 
-# Visualisation
+## Visualisation
 
-## SNP group box plot
+### SNP group box plot
 
 Compare IMPACT score distributions across SNP groups with
 `IMPACT_snp_group_boxplot()`:
@@ -147,15 +164,15 @@ bp <- IMPACT_snp_group_boxplot(
 )
 ```
 
-## Enrichment plots
+### Enrichment plots
 
 Visualise enrichment results with bar and violin plots:
 
 ```{r enrichment-plot, eval=FALSE}
 plots <- IMPACT_plot_enrichment(ENRICH = ENRICH)
 ```
 
-## Locus plot
+### Locus plot
 
 Create a multi-panel locus plot showing GWAS results, fine-mapping
 posterior probabilities, and per-tissue IMPACT scores:
@@ -164,14 +181,50 @@ posterior probabilities, and per-tissue IMPACT scores:
 impact_plot <- IMPACT_plot_impact_score(annot_melt = annot_melt)
 ```
 
-## Heatmap
+### Heatmap
 
 Generate a ComplexHeatmap of mean IMPACT scores across loci and SNP groups:
 
 ```{r heatmap, eval=FALSE}
 mat_meta <- IMPACT_heatmap(ANNOT_MELT = ANNOT_MELT)
 ```
 
+# SpliceAI
+
+## Run SpliceAI
+
+`SPLICEAI_run()` is the main entry point. It dispatches to the
+appropriate backend (API, local TSV, or VCF) depending on your input.
+
+```{r spliceai-run, eval=FALSE}
+query_dat <- echodata::BST1[1:50,]
+res <- SPLICEAI_run(query_dat = query_dat)
+```
+
+## Visualise splice probabilities
+
+```{r spliceai-plot, eval=FALSE}
+plt <- SPLICEAI_plot(query_dat = res)
+```
+
+# Deep learning annotations
+
+## Query deep learning scores
+
+`DEEPLEARNING_query()` retrieves precomputed variant-level scores from
+deep learning models (e.g. Basenji, DeepSEA) via tabix-indexed files.
+
+```{r dl-query, eval=FALSE}
+query_dat <- echodata::BST1[1:50,]
+dl_res <- DEEPLEARNING_query(query_dat = query_dat)
+```
+
+## Visualise scores
+
+```{r dl-plot, eval=FALSE}
+plt <- DEEPLEARNING_plot(dl_res)
+```
+
 
 # Session Info