Skip to content

Commit eafd27b

Browse files
christsoclaude
andcommitted
debug(ci): remove tee pipe and limit to 2 eval sets for debugging
The tee pipe was truncating output — summary never appeared. Temporarily limit to 2 eval sets to verify summary prints. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
1 parent d8c9f8d commit eafd27b

1 file changed

Lines changed: 5 additions & 5 deletions

File tree

.github/workflows/evals.yml

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ on:
66
suite_filter:
77
description: "Comma-separated glob patterns for eval files to run"
88
required: false
9-
default: "evals/**/*.eval.yaml,examples/**/*.eval.yaml,examples/**/*.EVAL.yaml,examples/**/EVAL.yaml"
9+
default: ""
1010
target:
1111
description: "Optional target override (leave empty to use each eval's own target)"
1212
required: false
@@ -60,7 +60,7 @@ jobs:
6060
- name: Resolve inputs
6161
id: filter
6262
env:
63-
DEFAULT_PATTERNS: "evals/**/*.eval.yaml,examples/**/*.eval.yaml,examples/**/*.EVAL.yaml,examples/**/EVAL.yaml"
63+
DEFAULT_PATTERNS: "examples/features/assert/evals/*.eval.yaml,examples/features/deterministic-evaluators/evals/*.eval.yaml,examples/features/agent-skills-evals/*.EVAL.yaml"
6464
# Exclude evals that need local scripts or multiple agent targets.
6565
# Negation patterns (!glob) are supported by the CLI.
6666
# multi-model-benchmark: needs multiple agents
@@ -98,10 +98,10 @@ jobs:
9898
--threshold ${{ steps.filter.outputs.threshold }} \
9999
--output .agentv/ci-results/junit.xml \
100100
--benchmark-json .agentv/ci-results/benchmark.json \
101-
--artifacts .agentv/ci-results/artifacts \
102-
2>&1 | tee .agentv/ci-results/eval-output.log
101+
--artifacts .agentv/ci-results/artifacts
102+
EXIT_CODE=$?
103103
104-
echo "exit_code=${PIPESTATUS[0]}" >> "$GITHUB_OUTPUT"
104+
echo "exit_code=$EXIT_CODE" >> "$GITHUB_OUTPUT"
105105
106106
- name: Post eval summary
107107
if: always()

0 commit comments

Comments
 (0)