feat(sdk): optimize tool definitions and prompts for efficient MCP workflows by tupizz · Pull Request #2722 · superdoc-dev/superdoc

tupizz · 2026-04-06T16:06:23Z

Summary

Optimizes SuperDoc MCP tool definitions, system prompts, and benchmark infrastructure for efficient agent workflows. Adds three new features to the format.apply mutation step and fixes markdown insert validation to support positioned inserts with BlockNodeAddress targets.

Results

NDA From-Scratch Creation (Claude Code + SuperDoc MCP)

Metric	Before	After	Improvement
Steps	58	13	-77%
Cost	$0.97	$0.24	-75%
Duration	217s	90s	-58%
MCP calls	56	9	-84%
Correctness	14/14	14/14	Same

Full Benchmark (96 runs, 8 providers x 12 tasks)

Condition	Pass Rate (before)	Pass Rate (after)
Codex-superdoc-mcp	67%	100%
CC-superdoc-mcp	42%	83%
Baseline/vendor	92%	92% (unchanged)

What changed

1. Tool definitions (`operation-definitions.ts`)

Updated tool descriptions in INTENT_GROUP_META to guide agents toward efficient patterns:

superdoc_edit: Now described as "the primary tool for inserting content." Agents are told to ALWAYS use markdown insert for headings/paragraphs, with context-driven formatting guidance. Description explains target + placement for positioned inserts.
superdoc_create: Redirects agents to markdown insert. Starts with "IMPORTANT: For headings and paragraphs, use superdoc_edit with type markdown instead."
superdoc_mutations: Documented create.heading, create.paragraph, create.table as supported step types. Added format.apply batch guidance.
superdoc_format: Directed agents to mutations format.apply for multi-item formatting.
superdoc_search: Clarified ref lifecycle (expires between tool calls, resolves automatically within mutations batch).

2. New features: `format.apply` extensions

Three new optional fields added to the format.apply mutation step:

alignment — Set paragraph alignment in the same step as inline formatting:

{
  "op": "format.apply",
  "args": {
    "inline": {"fontFamily": "Times New Roman", "fontSize": 12, "underline": true},
    "alignment": "center"
  }
}

Values: left, center, right, justify. Maps to OOXML justification on the parent paragraph node(s). Previously, alignment required a separate superdoc_format call since paragraphs.setAlignment is not a valid mutations step op.

scope: "block" — Expand formatting to cover entire paragraph(s), not just matched text:

{
  "op": "format.apply",
  "where": {"by": "select", "select": {"type": "text", "pattern": "short prefix"}, "require": "first"},
  "args": {"inline": {"fontSize": 12}, "scope": "block"}
}

Solves the problem where text pattern selectors only match a substring, leaving the rest of the paragraph with default markdown formatting. With scope: "block", a short identifying prefix is enough to format the whole block.

minProperties: 1 on args schema — Prevents agents from sending empty args: {}.

Files changed:

packages/document-api/src/types/mutation-plan.types.ts — StyleApplyStep.args type
packages/document-api/src/contract/schemas.ts — JSON schema for format.apply step
packages/super-editor/src/editors/v1/document-api-adapters/plan-engine/executor.ts — applyAlignmentToRange() helper, expandToBlockBoundaries() helper, updated executeStyleApply and executeSpanStyleApply
packages/super-editor/src/editors/v1/document-api-adapters/plan-engine/register-executors.ts — Guard for optional inline
packages/super-editor/src/editors/v1/document-api-adapters/plan-engine/paragraphs-wrappers.ts — Exported ALIGNMENT_TO_JUSTIFICATION (shared constant)

3. Markdown insert: `placement` + `BlockNodeAddress` target

Fixed validation to allow markdown/html inserts with placement and BlockNodeAddress targets:

{
  "action": "insert",
  "type": "markdown",
  "target": {"kind": "block", "nodeType": "paragraph", "nodeId": "54A21B3C"},
  "placement": "before",
  "value": "# Executive Summary\n\nThis agreement..."
}

Previously, placement was rejected for any input with value, and BlockNodeAddress targets were only accepted for structural (content) inserts. Now markdown/html inserts route through the structural insert path which supports both.

Files changed:

packages/document-api/src/insert/insert.ts — Validation accepts placement + BlockNodeAddress for markdown/html. Added RichContentInsertInput type export.
packages/document-api/src/index.test.ts — Tests for new validation behavior
packages/super-editor/src/editors/v1/document-api-adapters/plan-engine/plan-wrappers.ts — insertStructuredInner handles BlockNodeAddress targets via resolveStructuralInsertTarget + resolvePlacement

4. System prompts

Updated all prompt surfaces with context-driven formatting guidance:

system-prompt-mcp-header.md: Efficient patterns section with markdown insert, scope: "block", and alignment examples. "When to use which tool" guide.
system-prompt-core.md: 3-step insert workflow (understand context → insert markdown → format in one batch). Context-driven reasoning: "What kind of document is this? How are titles styled?"
claude-code-agent.mjs / codex-agent.mjs: Same patterns for benchmark providers.

5. Benchmark and eval infrastructure

ENABLE_TOOL_SEARCH: 'auto:5' — Tool schemas loaded on-demand (saves ~57KB per turn)
maxTurns: 35 for Claude Code (up from 20)
New NDA fixture documents and interactive DOCX output reviewer script
Removed debug console.log from claude-code-agent provider

6. Other changes

examples/collaboration/ai-node-sdk/Makefile — Always rebuilds CLI binary on make dev-local (prevents stale binary)
apps/docs/ai/ — Updated MCP how-to-use guide and agent best practices with efficient patterns

How the optimizations work

Before (old pattern: 45+ calls for NDA)

For each heading:
  1. superdoc_create(heading)           ← create one block
  2. superdoc_get_content(blocks)       ← re-fetch to get ref
  3. superdoc_search("heading text")    ← search to get text ref
  4. superdoc_format(inline, color)     ← format color
  5. superdoc_format(set_alignment)     ← format alignment
= 5 calls per heading × 8 headings = 40+ calls

After (new pattern: 3 calls)

1. superdoc_get_content(blocks)                      ← read + understand context
2. superdoc_edit(insert, type: "markdown", target, placement)  ← ALL structure in one call
3. superdoc_mutations(format.apply steps with alignment + scope: "block")  ← ALL formatting in one batch

Why it works

All claims verified against engine source:

mdastToProseMirror.ts:170-184: Markdown # creates proper Heading1 styleId
register-executors.ts:410-416: create.heading/paragraph wired in plan engine
compiler.ts:505-595: Selectors resolve at compile time
executor.ts:807-870: applyAlignmentToRange + expandToBlockBoundaries for alignment and scope
plan-wrappers.ts:968-988: BlockNodeAddress targets resolved via resolveStructuralInsertTarget + resolvePlacement

Test plan

1354 document-api tests pass
11,123 super-editor tests pass
super-editor builds clean
generate:all completes
SDK and MCP server rebuilt
New tests for placement validation (markdown + BlockNodeAddress)
NDA creation: 13 steps, $0.24, 14/14 checks (down from 58 steps, $0.97)
Full benchmark: Codex-superdoc-mcp 100%, CC-superdoc-mcp 83%
Executive summary insert verified with 3-call pattern (read + insert + format)

- superdoc_edit: emphasize markdown insert for multi-section creation - superdoc_create: direct to markdown/mutations for multiple items - superdoc_mutations: document create steps and batch format pattern - superdoc_format: direct to mutations for multi-item formatting - superdoc_search: clarify ref lifecycle within vs across batches - system-prompt: add efficient document creation workflow

…ints - Update provider SUPERDOC_SYSTEM_PROMPT with markdown insert and mutations batch examples (what CC actually reads as system prompt) - Update Codex AGENTS.md with same efficient patterns - Update MCP header prompt with "when to use which tool" guide - Increase CC maxTurns from 20 to 35 (both CC failures were at 21) - Regenerate SDK artifacts and rebuild MCP server

…plyInput types

…ional inline in step executors

…pply step

…ph formatting

…arkdown inserts

…put reviewer

…deduplicate alignment constant

…t type, deduplicate alignment constant" This reverts commit 4c04ebd.

…onstant

…sponse

…content

…cks.list

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: c5a322fce2

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-04-06T23:31:12Z

evals/providers/claude-code-agent.mjs

+
+Use superdoc_edit with type "markdown" to create ALL structure in one call:
+
+superdoc_edit({action: "insert", type: "markdown", placement: "end", value: "# Heading 1\\n\\nParagraph text...\\n\\n# Heading 2\\n\\nMore text..."})


Use a valid placement literal in benchmark agent guidance

The prompt example hard-codes placement: "end", but insert validation only accepts before, after, insideStart, or insideEnd; "end" is rejected as INVALID_INPUT. When the benchmark agent follows this example (and the same one in evals/providers/codex-agent.mjs), its first markdown insert fails and burns turns/cost for recoverable errors.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-06T23:31:12Z

packages/document-api/src/contract/operation-definitions.ts

-      'Do NOT mix text mutations and formatting in the same call.',
+      'Execute multiple operations atomically in one batch. Use this for any workflow needing 2+ changes. ' +
+      'Supported step types: text (text.rewrite, text.insert, text.delete), format (format.apply), create (create.heading, create.paragraph, create.table), assert. ' +
+      'Each step has an id, an op, a "where" clause for targeting ({by:"select", select:{...}, require:"first"|"exactlyOne"|"all"|"last"} or {by:"ref", ref:"..."}), and "args" with operation-specific parameters. ' +


Remove unsupported require:"last" from mutations tool description

This description advertises require:"last" as a valid where cardinality, but the actual mutation schema/types only allow first | exactlyOne | all. Agents relying on the tool description can emit plans that fail schema validation before execution, which is especially costly in MCP loops where retries consume extra calls.

Useful? React with 👍 / 👎.

tupizz added 2 commits April 6, 2026 12:55

superdoc-bot bot added the review: quick label Apr 6, 2026

tupizz added 3 commits April 6, 2026 13:17

feat(evals): enable tool search to reduce token overhead

dc427eb

docs(ai): add markdown insert pattern and formatting guidance

091da29

docs(ai): add efficient patterns to MCP how-to-use guide

2c94377

tupizz changed the title ~~Tadeu/sdk tool definitions update~~ feat(sdk): optimize tool definitions and prompts for efficient MCP workflows Apr 6, 2026

tupizz added 21 commits April 6, 2026 14:00

fix(evals): remove debug console.log that dumped every SDK message

9f65e2c

feat(document-api): add alignment field to StyleApplyStep and StyleAp…

13ba3a6

…plyInput types

fix(document-api): keep inline required on StyleApplyInput, guard opt…

b67b909

…ional inline in step executors

feat(document-api): add alignment to format.apply step JSON schema

98a1609

feat(super-editor): support alignment in format.apply mutation step

69f45cf

docs(sdk): update tool descriptions to show alignment inside format.a…

9aa655b

…pply step

feat(document-api): add scope: block to format.apply for full-paragra…

145693d

…ph formatting

feat(document-api): allow placement and BlockNodeAddress target for m…

64cab87

…arkdown inserts

chore: regenerate SDK artifacts and docs from updated contract

c3395d6

feat(evals): add new NDA documents and implement interactive DOCX out…

1e4e294

…put reviewer

fix: address PR review — minProperties, RichContentInsertInput type, …

4c04ebd

…deduplicate alignment constant

Revert "fix: address PR review — minProperties, RichContentInsertInpu…

ee17a46

…t type, deduplicate alignment constant" This reverts commit 4c04ebd.

fix(document-api): add minProperties, type export, shared alignment c…

e95f430

…onstant

docs(sdk): require fontSize on headings after markdown insert

16d6a52

docs(sdk): context-driven formatting guidance for markdown inserts

8da40b0

docs(sdk): only set properties explicitly present in document blocks

e61a515

feat(super-editor): resolve default fontSize in get_content blocks re…

f39f127

…sponse

fix(super-editor): fallback to 10pt default when styles omit fontSize

739a18f

fix(super-editor): resolve fontSize per-block via style chain in get_…

20a1341

…content

test(super-editor): add fontSize style chain resolution tests for blo…

e0b7e23

…cks.list

docs(sdk): guide agents to match uppercase title conventions

c5a322f

tupizz marked this pull request as ready for review April 6, 2026 23:24

tupizz merged commit 92c0299 into andrii/sd-2451-refactor-mcp-set-up Apr 6, 2026
41 checks passed

tupizz deleted the tadeu/sdk-tool-definitions-update branch April 6, 2026 23:24

chatgpt-codex-connector bot reviewed Apr 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(sdk): optimize tool definitions and prompts for efficient MCP workflows#2722

feat(sdk): optimize tool definitions and prompts for efficient MCP workflows#2722
tupizz merged 26 commits intoandrii/sd-2451-refactor-mcp-set-upfrom
tadeu/sdk-tool-definitions-update

tupizz commented Apr 6, 2026 •

edited

Loading

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Apr 6, 2026

Uh oh!

chatgpt-codex-connector bot Apr 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant


		Use superdoc_edit with type "markdown" to create ALL structure in one call:

		superdoc_edit({action: "insert", type: "markdown", placement: "end", value: "# Heading 1\\n\\nParagraph text...\\n\\n# Heading 2\\n\\nMore text..."})

Conversation

tupizz commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Results

NDA From-Scratch Creation (Claude Code + SuperDoc MCP)

Full Benchmark (96 runs, 8 providers x 12 tasks)

What changed

1. Tool definitions (operation-definitions.ts)

2. New features: format.apply extensions

3. Markdown insert: placement + BlockNodeAddress target

4. System prompts

5. Benchmark and eval infrastructure

6. Other changes

How the optimizations work

Before (old pattern: 45+ calls for NDA)

After (new pattern: 3 calls)

Why it works

Test plan

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

tupizz commented Apr 6, 2026 •

edited

Loading

1. Tool definitions (`operation-definitions.ts`)

2. New features: `format.apply` extensions

3. Markdown insert: `placement` + `BlockNodeAddress` target