This document is a quick navigation guide to the repository. It is intentionally shorter than IMPLEMENTATION_AUDIT.md: the goal here is to help readers find the right entrypoint quickly.
c3/: project-native codeopenrlhf/: vendored upstream training stack with C3 integrationsconfigs/: task, role, registry, analysis, and data-manifest configscripts/: reproducibility, data preparation, audit, and helper scriptsdocs/: release policy, code audit, provenance, and user-facing documentation
- configs/tasks/math.yaml
- configs/tasks/code.yaml
- c3/integration/marl_specs.py
- c3/integration/task_datasets.py
- c3/mas/rollout_generator.py
- openrlhf/trainer/ppo_utils/experience_maker.py
- c3/credit/c3/
- fast smoke: scripts/reproduce/smoke.sh
- data prep: scripts/data/prepare_all.sh
- training matrix: scripts/reproduce/paper_train.sh
- main-results sweep: scripts/reproduce/paper_main_results.sh
- analysis figures: scripts/reproduce/paper_analysis_figs.sh
- release audit: scripts/audit/pre_release.sh
- math reward entry: c3/envs/math/reward.py
- code reward entry: c3/envs/code/reward.py
- code executor: c3/envs/code/executor.py
- environment dispatch: c3/envs/registry.py
- MAPPO baseline: c3/algorithms/mappo.py
- MAGRPO baseline: c3/algorithms/magrpo.py
- C3 fallback: c3/algorithms/c3.py
- algorithm naming and normalization: c3/algorithms/registry.py
- main-results aggregation: c3/tools/main_results.py
- analysis aggregation: c3/tools/analysis_results.py
- plotting: c3/tools/plot_paper_figures.py
- analysis CLI: c3/analysis/c3_analysis.py
flowchart TD
Tasks["configs/tasks/*.yaml"] --> Loader["c3/integration/marl_specs.py"]
Loader --> Data["c3/integration/task_datasets.py"]
Loader --> MAS["c3/mas/rollout_generator.py"]
MAS --> Experience["openrlhf/trainer/ppo_utils/experience_maker.py"]
Experience --> Credit["c3/credit/c3/*"]
Credit --> PPO["openrlhf/trainer/ppo_trainer.py"]
The paper-facing C3 implementation is not centered on c3/algorithms/c3.py. That file exists for compatibility and fallback behavior. The primary node-level credit path is:
- dataset provenance and SHA pins: configs/data_manifest.yaml
- main-results registry: configs/main_results_registry.yaml
- analysis defaults: configs/analysis.yaml
- task configs: configs/tasks/
- role configs: configs/roles/
- paper-to-code mapping: IMPLEMENTATION_AUDIT.md
- release surface rules: RELEASE_POLICY.md
- data provenance and strict verification: DATA_SOURCES.md
- upstream provenance: UPSTREAM.md