Add per-run profiling config for fine-grained Run() profiling by xiaofeihan1 · Pull Request #2152 · microsoft/onnxruntime-genai

xiaofeihan1 · 2026-05-09T08:22:42Z

Adds model.decoder.run_profiling to genai_config.json so users can profile selected session.Run() calls (e.g. only prefill, or only the N-th decode) without modifying Python code. The existing set_runtime_option('enable_profiling', ...) path already supports per- run profiling but requires API calls per experiment; this surfaces the same capability via config.

Schema:
"run_profiling": {
"enabled": true,
"output_prefix": "onnxruntime_run_profile",
"runs": "0"
}

runs DSL: comma-separated tokens, each one of N, A-B, A-, or .
"0" -> prefill only (default)
"1-" -> all decode steps
"0,5" -> prefill + 5th decode
"" -> every run

Filename: <output_prefix>_<ort_timestamp>.json. Run index is a per-Generator counter (0 = prefill, N = N-th decode). Coexists with session-level enable_profiling; both produce independent files.

Adds model.decoder.run_profiling to genai_config.json so users can profile selected session.Run() calls (e.g. only prefill, or only the N-th decode) without modifying Python code. The existing set_runtime_option('enable_profiling', ...) path already supports per- run profiling but requires API calls per experiment; this surfaces the same capability via config. Schema: "run_profiling": { "enabled": true, "output_prefix": "onnxruntime_run_profile", "runs": "0" } runs DSL: comma-separated tokens, each one of N, A-B, A-, or *. "0" -> prefill only (default) "1-" -> all decode steps "0,5" -> prefill + 5th decode "*" -> every run Filename: <output_prefix><idx>_<ort_timestamp>.json. Run index is a per-Generator counter (0 = prefill, N = N-th decode). Coexists with session-level enable_profiling; both produce independent files. Co-Authored-By: Claude Opus 4 <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add per-run profiling config for fine-grained Run() profiling#2152

Add per-run profiling config for fine-grained Run() profiling#2152
xiaofeihan1 wants to merge 1 commit into
mainfrom
xiaofeihan/per-run-profiling-config

xiaofeihan1 commented May 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

xiaofeihan1 commented May 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant