feat: add scaffolding rollout workflow by WeiHaocheng · Pull Request #1064 · inclusionAI/AReaL

WeiHaocheng · 2026-03-19T14:03:18Z

Description

Related Issue

Fixes #(issue)

Type of Change

Checklist

I have read the Contributing Guide
Pre-commit hooks pass (pre-commit run --all-files)
Relevant tests pass; new tests added for new functionality
Documentation updated (if applicable; built with ./docs/build_all.sh)
Branch is up to date with main
Self-reviewed via /review-pr command
This PR was created by a coding agent via /create-pr
This PR is a breaking change

Breaking Change Details (if applicable):

Additional Context

Need help? Check the Contributing Guide or ask in
GitHub Discussions!

gemini-code-assist · 2026-03-19T14:07:30Z

Warning

Gemini encountered an error creating the summary. You can try again by commenting /gemini summary.

…fixes - Fix data race in ScaffoldingLlm by cloning controller synchronously before async handoff - Fix sampling_params propagation in GSM8KScaffoldingWorkflow (delegate to parent build_scaffolding_llm) - Simplify controllers.py by removing unused code paths - Add chat_scaffolding example with YAML config - Add 2-node GSM8K RLVR scaffolding config - Simplify MathVerifyWorker (remove signal-based timeout) - Increase Ray scheduler startup_timeout to 600s for large clusters Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Revert controllers.py to scaffolding_pr state (no changes needed) - Revert workflow.py arun_episode to original approach (set task_data on trajectory_maker before generate_async) - Keep synchronous clone in scaffolding_llm.py but remove unused **kwargs Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…rams Without this, NativeGenerationController() is created with empty sampling_params, so max_tokens/temperature/stop are never set on tasks. SGLang defaults to ~16 tokens, producing near-zero rewards. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Keep explicit controller/trajectory_maker construction but pass max_tokens and temperature from gconfig to NativeGenerationController. Without sampling_params, SGLang defaults to ~16 tokens output. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

feat: improve scaffolding workflow with multi-worker support and bug fixes

garrett4wade

LGTM in general. Since the scaffolding module is relatively mature and not necessarily related to AReaL's core modules, I recommend to move all the new files into the examples directory. It'll be a complete and standalone example.

garrett4wade · 2026-04-07T06:03:38Z

+        if self.worker is None:
+            self._lazy_init_scaffolding(engine)


A new rollout workflow object is created for each trajectory. It will also create a ScaffoldingLlm object. Will this be very expensive?

WeiHaocheng · 2026-04-11T15:19:04Z

LGTM in general. Since the scaffolding module is relatively mature and not necessarily related to AReaL's core modules, I recommend to move all the new files into the examples directory. It'll be a complete and standalone example.

Move code to the example.

feat: add scaffolding rollout workflow

9356abf

luhongyu.4869 and others added 16 commits March 22, 2026 13:41

style: fix ruff lint and format issues

ce4e9d7

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

fix: restore LLMJudgeController in scaffolding controllers

bdc452a

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

chore: remove unrelated skill files from PR

d17d6fa

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

fix: revert unintended changes to gsm8k.py and cache.py

36d8cc2

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

fix: keep worker_logger.error instead of print in worker.py

3fcba6c

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

fix: revert unintended change to ray scheduler timeout

8eb99f1

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

fix: revert unintended changes to reward/__init__.py

612e4f7

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

chore: remove chat_scaffolding example files from PR

74767b6

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

fix: revert test and gsm8k example to scaffolding_pr state

ec564a8

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

fix: revert README.md to scaffolding_pr state

1940ad2

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

fix: pass empty kwargs to ScaffoldingRequest

f6c74cd

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Merge pull request #1 from WeiHaocheng/scaffolding-updates

e03a423

feat: improve scaffolding workflow with multi-worker support and bug fixes

garrett4wade reviewed Apr 7, 2026

View reviewed changes

garrett4wade added the reviewed label Apr 7, 2026

WeiHaocheng added 2 commits April 10, 2026 06:24

Move all scaffolding code into examples

2dde797

Move test code to example

249c9c3

small fix

18fb8cf

garrett4wade added the safe-to-test Ready to run unit-tests in a PR. label Apr 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add scaffolding rollout workflow#1064

feat: add scaffolding rollout workflow#1064
WeiHaocheng wants to merge 20 commits intoinclusionAI:mainfrom
WeiHaocheng:scaffolding_pr

WeiHaocheng commented Mar 19, 2026

Uh oh!

gemini-code-assist bot commented Mar 19, 2026

Uh oh!

garrett4wade left a comment

Uh oh!

garrett4wade Apr 7, 2026

Uh oh!

WeiHaocheng commented Apr 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

WeiHaocheng commented Mar 19, 2026

Description

Related Issue

Type of Change

Checklist

Additional Context

Uh oh!

gemini-code-assist bot commented Mar 19, 2026

Uh oh!

garrett4wade left a comment

Choose a reason for hiding this comment

Uh oh!

garrett4wade Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

WeiHaocheng commented Apr 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants