[follow up on #1259] gpu witnessgen flow by hero78119 · Pull Request #1303 · scroll-tech/ceno

hero78119 · 2026-04-13T07:20:06Z

summary of data life cycle during entire proving

During opcode assignment
- shard raw GPU state is uploaded / kept resident:
  - StepRecord
  - shard metadata
  - shared shard buffers as needed
- GPU still runs assignment-time kernels for side effects:
  - LK multiplicity
  - shardram / shared-circuit accumulation
- witness trace is not kept as an eager RMM anymore
- per-chip replay plan is recorded
During commit_traces
- no full witness set is resident up front
- for each trace, deferred commit does:
  - regenerate that chip’s witness/device backing from resident raw shard GPU state
  - commit that one trace
  - drop that transient witness before moving to the next trace
- after commit finishes, only raw shard GPU state remains resident
During per-chip proof
- before a chip task proves, replay regenerates that chip’s witness/device backing from raw shard GPU state
- chip proof uses it
- task-local witness is dropped after that chip finishes
- raw shard GPU state stays resident across all chip proofs
During PCS opening
- replay regenerates the needed witness/device backing again from raw shard GPU state
- opening uses it
- transient witness is dropped afterward
At shard end
- shard raw GPU state is released
- replay/session metadata is invalidated

So the intended steady-state invariant is:

persistent across shard proof:
- raw shard GPU state only
transient on demand:
- witness/device backing per trace / per chip / per opening step

Two nuances:

Initial assign still runs GPU kernels because side effects are needed then, but it no longer keeps eager witness RMMs in cache-none mode.
The remaining OOM is now later in chip proving, not in commit, which is consistent with this lifecycle shift.

…nvariants

hero78119 added 4 commits April 13, 2026 14:22

zkvm gpu: keep witness device-resident and enforce col-major commit i…

c8a5a69

…nvariants

update gkr-backend

e1c1082

align structural witness with normal witness

f026b64

fix build error

c46a574

hero78119 force-pushed the feat/gpu-witnessgen_flow branch from 7eaf64c to c46a574 Compare April 13, 2026 07:59

assert gpu vram release at the end of create_proof

aebf3b2

hero78119 force-pushed the feat/gpu-witnessgen_flow branch from 19a50f5 to aebf3b2 Compare April 13, 2026 09:41

set gpu device_backing even rmm generate in cpu

9c98a33

hero78119 force-pushed the feat/gpu-witnessgen_flow branch from ecc9007 to 0e99ebb Compare April 14, 2026 12:25

transient gpu polygroup when cache level is none

8ae8010

hero78119 force-pushed the feat/gpu-witnessgen_flow branch from 0e99ebb to 8ae8010 Compare April 14, 2026 13:10

hero78119 added 5 commits April 14, 2026 22:13

truncate to make rmm size match

490993a

update comment

e9a98c9

Refactor GPU witness replay and cache-none proving flow

0db05c3

Defer cache-none GPU witness materialization

33006fd

gpu skip assign shard meta info on non initial pass

e166b25

hero78119 force-pushed the feat/gpu-witnessgen_flow branch from 79b60ad to e166b25 Compare April 16, 2026 12:40

hero78119 added 4 commits April 16, 2026 21:00

update estimated memory formula

854027d

WIP fix estimate memory and cuda slice blowup problem

890500d

e2e pass local without OOM

3693654

more debug log

fdb277e

hero78119 force-pushed the feat/gpu-witnessgen_flow branch from d57f469 to fdb277e Compare April 17, 2026 02:14

wip: replayable shard_ram chip

fcd98f2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[follow up on #1259] gpu witnessgen flow#1303

[follow up on #1259] gpu witnessgen flow#1303
hero78119 wants to merge 17 commits intofeat/gpu-witnessgenfrom
feat/gpu-witnessgen_flow

hero78119 commented Apr 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

hero78119 commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

summary of data life cycle during entire proving

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

hero78119 commented Apr 13, 2026 •

edited

Loading