Skip to content

feat(run): propagate control failureClass onto canonical RunRecord (agent-eval 0.54.0)#71

Merged
tangletools merged 2 commits into
mainfrom
feat/canonical-failureclass-propagation
May 28, 2026
Merged

feat(run): propagate control failureClass onto canonical RunRecord (agent-eval 0.54.0)#71
tangletools merged 2 commits into
mainfrom
feat/canonical-failureclass-propagation

Conversation

@tangletools
Copy link
Copy Markdown
Contributor

Summary

  • Bumps @tangle-network/agent-eval ^0.52.0^0.54.0 and consumes the new canonical cross-agent RunRecord.failureClass.
  • The control layer already classifies failures (ControlRunResult.failureClass) but it never reached the canonical record — so substrate failure aggregation was starved on the runtime side. This promotes it at the projection boundary.
  • applyRunRecordDefaults() stamps scenarioId + failureClass without overriding anything the adapter set, and promotes only real FAILURE_CLASSES taxonomy tags (a non-taxonomy string like forge_build_unsatisfied stays unclassified rather than polluting the canonical key).

Why

Closes the runtime half of the unified failure-classification architecture (agent-eval 0.54.0): every agent's failures now aggregate in ONE vocabulary, making "which failure dominates across the whole fleet" answerable, with per-agent detail preserved in failureMode.

Test plan

  • pnpm typecheck clean
  • pnpm test — 374/374 (4 new applyRunRecordDefaults unit tests: promote canonical / don't-override-adapter / reject-non-taxonomy / no-failure)
  • pnpm build clean

drewstone added 2 commits May 28, 2026 07:30
…gent-eval 0.54.0)

agent-runtime built RunRecords but never set the structured failure tag, starving
the substrate's cross-agent failure aggregation. The control layer already
classifies failures (ControlRunResult.failureClass) — this promotes that onto the
canonical RunRecord.failureClass key (agent-eval 0.54.0) at the projection
boundary, so failures aggregate across every agent in ONE taxonomy vocabulary.

- bump @tangle-network/agent-eval ^0.52.0 -> ^0.54.0
- applyRunRecordDefaults(): stamps scenarioId + failureClass without overriding
  anything the adapter set; promotes ONLY real FAILURE_CLASSES taxonomy tags
  (a non-taxonomy string like "forge_build_unsatisfied" stays unclassified, not
  polluting the canonical key)
- extracted as a pure exported helper + 4 unit tests (promote / don't-override /
  reject-non-taxonomy / no-failure)

typecheck clean, 374/374.
@tangletools tangletools merged commit 2863d9b into main May 28, 2026
1 check passed
@tangletools tangletools deleted the feat/canonical-failureclass-propagation branch May 28, 2026 13:52
tangletools pushed a commit that referenced this pull request May 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants