Reproducible analysis companion for the published QuEStVar manuscript
Code, notebooks, and reproducible workflows for the QuEStVar manuscript and published analysis archive.
Overview • Manuscript • Standalone package • Repository structure • Getting started • Citation
This repository contains the code and data supporting the published manuscript Statistical Testing for Protein Equivalence Identifies Core Functional Modules Conserved across 360 Cancer Cell Lines and Presents a General Approach to Investigating Biological Systems. It preserves the analysis notebooks, manuscript-era source code, and reproducibility scaffolding used for the study.
Use this repository when you want to inspect, rerun, or audit the scientific analysis behind the paper. Use the standalone QuEStVar package when you want the maintained installable software for new analyses.
The repository covers three main analysis streams:
- Spike-in data analysis. Preprocessing and applying QuEStVar to a benchmark dataset to demonstrate the method in a controlled setting.
- Simulation studies. Evaluating QuEStVar's sample equivalence index metric against correlation under multiple simulated scenarios.
- Cancer cell line analysis. Using QuEStVar to explore quantitative protein stability and variability and identify conserved functional modules across 360 cancer cell lines.
- Title: Statistical Testing for Protein Equivalence Identifies Core Functional Modules Conserved across 360 Cancer Cell Lines and Presents a General Approach to Investigating Biological Systems
- Authors: Enes K. Ergin, Junia J. K. Myung, and Philipp F. Lange
- Journal: Journal of Proteome Research 2024, 23 (6), 2169-2185
- Resources:
This repository is the manuscript companion and analysis archive. The standalone packaged version of QuEStVar is available separately as the first stable release, v0.1.0, with its own repository, documentation, and PyPI distribution.
- GitHub repository: https://github.com/eneskemalergin/QuEStVar
- GitHub release: https://github.com/eneskemalergin/QuEStVar/releases/tag/v0.1.0
- PyPI package: https://pypi.org/project/questvar/
- Install command:
pip install questvar
The split matters because the environments are different. This manuscript repository is pinned to a reproducibility-oriented Python 3.9 stack in requirements.txt, while the standalone package targets modern packaged use and is maintained independently on PyPI and GitHub.
2022_Frohlich/- Contains the spike-in benchmark analysis notebooks. - IncludesNotebook_S1.ipynb,Notebook_S2.ipynb, andNotebook_S3.ipynb. - The working tree also expects analysis subfolders for data (raw,processed,results,supplementary) and figures.2022_Goncalves/- Contains the cancer cell line analysis notebooks. - IncludesNotebook_S4.ipynbthroughNotebook_S8.ipynbtogether with the corresponding analysis outputs used in the manuscript workflow.Misc/- Contains supporting notebooks such asNotebook_S9.ipynbandNotebook_S10.ipynb. - These document libraries, helper functions, and software-version context used during the study.questvar/- Contains the manuscript-era QuEStVar source code used directly by the notebooks in this repository.supp_notebooks/- Contains HTML-rendered notebook outputs generated from the Jupyter notebooks.nb_to_html.sh- Helper script for converting notebooks into HTML outputs.requirements.txt- Pinned Python dependencies for reproducing the manuscript analyses..gitignore,CITATION.cff,LICENSE,README.md- Repository metadata, citation metadata, ignore rules, licensing, and top-level documentation.
Note
The data and figures folders are intentionally ignored by git to avoid committing large artifacts. Raw input files that belong in the data/raw folders should be obtained from the Zenodo archive linked above.
-
Clone this repository.
-
Create a dedicated Python environment.
conda
conda create --name my_env python=3.9 conda activate my_env
venv
python3 -m venv my_env source my_env/bin/activate # On Windows use: my_env\Scripts\activate
-
Install the pinned manuscript dependencies.
pip install -r requirements.txt
If you prefer manual installation, install the packages listed in
requirements.txtindividually with eitherconda install <package_name>orpip install <package_name>. -
Explore the Jupyter notebooks in
2022_Frohlich/,2022_Goncalves/, andMisc/to follow the manuscript analyses end to end. -
Refer to
questvar/for the core statistical testing implementation used by the notebooks. -
Optionally generate HTML notebook exports with
nb_to_html.shwhen you want portable rendered outputs.
- The manuscript environment is pinned to Python 3.9.18 and the dependency versions listed in
requirements.txt. - The repository is designed around notebook-driven analysis. Large intermediate files and figures are expected to live outside version control.
- The
questvar/code in this repository reflects the manuscript analysis snapshot rather than the separately maintained installable package. - Notebook outputs and large derived artifacts are expected to be regenerated locally from the archived workflow and downloaded raw inputs.
If you use the manuscript analyses, data workflow, or the scientific framing of QuEStVar, cite the paper:
The repository also includes a root CITATION.cff file so GitHub and citation tools can surface the preferred manuscript citation directly.
@article{ergin2024questvar,
author = {Ergin, Enes K. and Myung, Junia J. K. and Lange, Philipp F.},
title = {Statistical Testing for Protein Equivalence Identifies Core Functional Modules Conserved across 360 Cancer Cell Lines and Presents a General Approach to Investigating Biological Systems},
journal = {Journal of Proteome Research},
year = {2024},
volume = {23},
number = {6},
pages = {2169--2185},
doi = {10.1021/acs.jproteome.4c00131}
}