Skip to content

Latest commit

 

History

History
90 lines (66 loc) · 4.16 KB

File metadata and controls

90 lines (66 loc) · 4.16 KB

Code Map

This document is a quick navigation guide to the repository. It is intentionally shorter than IMPLEMENTATION_AUDIT.md: the goal here is to help readers find the right entrypoint quickly.

Top-level layout

  • c3/: project-native code
  • openrlhf/: vendored upstream training stack with C3 integrations
  • configs/: task, role, registry, analysis, and data-manifest config
  • scripts/: reproducibility, data preparation, audit, and helper scripts
  • docs/: release policy, code audit, provenance, and user-facing documentation

Where to look first

I want to understand the paper path

  1. configs/tasks/math.yaml
  2. configs/tasks/code.yaml
  3. c3/integration/marl_specs.py
  4. c3/integration/task_datasets.py
  5. c3/mas/rollout_generator.py
  6. openrlhf/trainer/ppo_utils/experience_maker.py
  7. c3/credit/c3/

I want to run the repository

I want to understand the environments

I want to understand the baselines

I want to understand evaluation and paper tables

Core-path vs fallback-path

Core C3 path

flowchart TD
Tasks["configs/tasks/*.yaml"] --> Loader["c3/integration/marl_specs.py"]
Loader --> Data["c3/integration/task_datasets.py"]
Loader --> MAS["c3/mas/rollout_generator.py"]
MAS --> Experience["openrlhf/trainer/ppo_utils/experience_maker.py"]
Experience --> Credit["c3/credit/c3/*"]
Credit --> PPO["openrlhf/trainer/ppo_trainer.py"]
Loading

Important note

The paper-facing C3 implementation is not centered on c3/algorithms/c3.py. That file exists for compatibility and fallback behavior. The primary node-level credit path is:

Configuration single sources of truth

Related docs