Improve evaluate-pr-tests workflow: slash_command + workflow_dispatch, security hardening by PureWeen · Pull Request #34678 · dotnet/maui

PureWeen · 2026-03-26T16:46:38Z

Description

Overhauls the copilot-evaluate-tests gh-aw workflow — switches to on-demand triggers only (/evaluate-tests slash command + manual workflow_dispatch), adds security hardening, and improves error handling. No auto-runs on PR create/update.

What Changed

Triggers (on-demand only — no auto-runs)

Add slash_command: evaluate-tests — comment /evaluate-tests on a PR to trigger
Keep workflow_dispatch — manual trigger from Actions tab with PR number input
Disable pull_request_target — no auto-evaluation on PR create/update
Add bots: ["copilot-swe-agent[bot]"] — Copilot-authored PRs can be evaluated
Add labels: ["pr-review", "testing"] — workflow runs are labeled

Gate step (fast-fail for invalid requests)

Check PR is OPEN before evaluating (rejects closed/merged PRs with clear message)
Check for test source files in diff before spinning up agent
Fall back to REST API for PRs with 300+ files (where gh pr diff returns HTTP 406)
All API errors surfaced with clear messages — no silent masking
exit 1 stops the workflow immediately — no wasted agent compute

Access gating (Checkout-GhAwPr.ps1)

Reject fork PRs (isCrossRepository check)
Verify PR author has write access (admin/write/maintain roles)
Fix ConvertFrom-Json ordering — check exit code before JSON parsing
Make infrastructure restore fatal on failure (was soft warning)
Remove pre-delete pattern — git checkout overwrites in-place

Workflow improvements

hide-older-comments: true — previous evaluations auto-collapse
report-as-issue: false for noop — no issue created when nothing to evaluate
Timeout bumped 15 → 20 minutes
Dry-run mode via suppress_output input (workflow_dispatch only)
Gather-TestContext.ps1 now receives -PrNumber parameter

Security documentation (gh-aw-workflows.instructions.md)

Add "Before You Build" anti-patterns table — prefer built-in gh-aw features
Add Security Boundaries section with defense layers table
Add Rules for gh-aw Workflow Authors (DO/DON'T list)
Document COPILOT_TOKEN exposure and mitigations
Add slash_command to fork behavior table
Update Checkout-GhAwPr.ps1 description to match current behavior

Trigger Behavior

Trigger	When it fires	Who can trigger
`/evaluate-tests` comment	Comment on a PR	Write-access collaborators + copilot-swe-agent[bot]
`workflow_dispatch`	Actions tab → "Run workflow" → enter PR number	Write-access collaborators
~~`pull_request_target`~~	~~Auto on PR create/update~~	~~Disabled~~

Security Model

Based on GitHub Security Lab guidance:

PR contents treated as passive data (read/analyze, never built or executed)
Agent runs in sandboxed container with GITHUB_TOKEN and gh CLI scrubbed
Write operations in separate safe_outputs job (not the agent)
Agent output limited to max: 1 comment via safe-outputs
Checkout-GhAwPr.ps1 rejects fork PRs and verifies write access before checkout
Infrastructure restore is fatal on failure — prevents running with untrusted infra

Validation

Test	PR	Result
Open PR with tests	#34983	✅ Full success (gate → checkout → agent → comment)
No-test PR	#34876	✅ Gate fast-fail ("no test source files")
Merged PR	#34932	✅ Gate fast-fail ("MERGED — skipping")

Known Limitations

Fork PRs via /evaluate-tests can supply modified .github/skills/ — accepted residual risk (agent sandboxed, output bounded). Tracked as gh-aw#18481
exit 1 in gate step shows ❌ in GitHub checks for no-test/closed PRs — intentional (no built-in "skip" mechanism in gh-aw steps)
pull_request_target commented out — can be re-enabled later for auto-evaluation

- Change trigger from pull_request to pull_request_target so fork PRs have access to secrets (COPILOT_GITHUB_TOKEN) - Add roles: all to allow fork contributors (who have read permission) to trigger the workflow - Remove forks: ["*"] (not needed with pull_request_target) - Remove ready_for_review type (not supported by gh-aw for pull_request_target) - Update if condition and gate step to reference pull_request_target Validated on PureWeen/maui: - Same-repo PR: all green (run 23603776593) - Fork PR via workflow_dispatch: all green (run 23605610535) - Fork PR via pull_request_target: all green (run 23606033617) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

github-actions · 2026-03-26T16:46:49Z

🚀 Dogfood this PR with:

⚠️ WARNING: Do not do this without first carefully reviewing the code of this PR to satisfy yourself it is safe.

curl -fsSL https://raw.githubusercontent.com/dotnet/maui/main/eng/scripts/get-maui-pr.sh | bash -s -- 34678

Or

Run remotely in PowerShell:

iex "& { $(irm https://raw.githubusercontent.com/dotnet/maui/main/eng/scripts/get-maui-pr.ps1) } 34678"

Copilot

Pull request overview

Updates the gh-aw Evaluate PR Tests workflow triggers so fork PRs can be evaluated (by switching from pull_request to pull_request_target) while keeping the workflow’s gating/conditions aligned with the new event.

Changes:

Switched workflow trigger from pull_request to pull_request_target and updated related if: conditions.
Updated the gate step condition to run under pull_request_target.
Updated the compiled .lock.yml to reflect the new trigger configuration.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
`.github/workflows/copilot-evaluate-tests.md`	Moves evaluation to `pull_request_target` and updates workflow conditions/gate accordingly.
`.github/workflows/copilot-evaluate-tests.lock.yml`	Regenerated compiled workflow reflecting the trigger/condition changes.

Copilot · 2026-03-26T16:50:17Z

+  pull_request_target:
    paths:
    - src/**/tests/**
    - src/**/test/**
    types:
    - opened
    - synchronize
    - reopened
-    - ready_for_review
+  # roles: all # Roles processed as role check in pre-activation job
  workflow_dispatch:


The compiled lock file shows roles: all commented out under the on: block, which indicates the roles setting was not applied. This also coincides with the removal of the pre_activation/membership check job, so the workflow no longer gates who can run it. Fix by moving roles: to the correct top-level location in the .md frontmatter and re-compiling so the lock file includes the intended role check behavior.

Copilot · 2026-03-26T16:50:17Z

-    needs: pre_activation
    if: >
-      (needs.pre_activation.outputs.activated == 'true') && ((github.event_name == 'pull_request' && github.event.pull_request.draft == false) || github.event_name == 'workflow_dispatch' || (github.event_name == 'issue_comment' &&
+      (github.event_name == 'pull_request_target' && github.event.pull_request.draft == false) || github.event_name == 'workflow_dispatch' || (github.event_name == 'issue_comment' &&


jobs.activation no longer depends on a role/membership gate (needs: pre_activation and the needs.pre_activation.outputs.activated check are gone). With pull_request_target, this means the workflow can run with secrets for any matching PR/comment, which is a significant security/cost exposure. After fixing the roles placement in the .md, ensure the compiled lock restores the gating (or add an explicit guard) before allowing activation to proceed.

Suggested change

(github.event_name == 'pull_request_target' && github.event.pull_request.draft == false) || github.event_name == 'workflow_dispatch' || (github.event_name == 'issue_comment' &&

(github.event_name == 'pull_request_target' && github.event.pull_request.draft == false && github.event.pull_request.head.repo.fork == false) || github.event_name == 'workflow_dispatch' || (github.event_name == 'issue_comment' &&

The workflow_dispatch step runs with GITHUB_TOKEN and checks out PR code. Restrict it to only process PRs from authors with write/maintain/admin access, preventing checkout of untrusted fork code in a privileged context. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Move the PR author permission check from inline workflow bash into the shared Checkout-GhAwPr.ps1 script. Any gh-aw workflow using this script now automatically gates on the PR author having write/maintain/admin access before checking out code. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Fork PRs are handled by pull_request_target (platform checkout in sandboxed container). The workflow_dispatch path should only process same-repo PRs from authors with write access. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Restoring only skills/, instructions/, and copilot-instructions.md left other .github/ subdirs (pr-review/, scripts/, workflows/) from the PR branch. Restore the entire .github/ directory for complete coverage. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Instead of deleting .github/ and restoring from main, merge the base branch into the PR branch after checkout. This produces the same state as a pull_request merge commit: PR changes + latest main. If the PR modifies a skill, the PR version wins; otherwise main's version is used. This lets contributors iterate on skills via workflow_dispatch while keeping everything else current. On merge conflict, falls back to the PR branch as-is with a warning. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- pull_request_target: only auto-runs for OWNER/MEMBER/COLLABORATOR - issue_comment: /evaluate-tests only accepted from OWNER/MEMBER/COLLABORATOR - workflow_dispatch: unchanged - External PRs require maintainer /evaluate-tests comment to trigger Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Revert merge strategy to targeted git checkout (works in shallow clones) - Remove roles:all, restore gh-aw pre_activation with write-level checks - Remove author_association from if: (gh-aw handles access gating) - Update fork fallback message to remove stale workflow_dispatch advice Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Add suppress_comment input for workflow_dispatch dry-run (evaluate without posting comment) - Add explicit noop guidance so the agent uses it instead of silently exiting - Update posting results section to respect dry-run mode Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Prevents silent fork check bypass when gh returns empty/malformed JSON — $null.isFork evaluates to $false in PowerShell, which would let the fork check pass incorrectly. Note: ready_for_review cannot be added to pull_request_target types yet — gh-aw compiler doesn't include it in the allowed type list. Filed as a known gap. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Fix inaccurate claim that agent has 'no ability to access secrets' — COPILOT_TOKEN is present via --env-all, defended by firewall + redaction + threat detection - Add Security Boundaries section with principles from GitHub Security Lab's pwn-request guidance - Add defense layers table documenting what each layer does/doesn't do - Add explicit rules for workflow authors (DO/DON'T) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot-authored PRs are created by copilot[bot] which doesn't have write collaborator access. The bots: allowlist lets the pre_activation membership check pass for this known bot actor. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Move permission check before fork check — fork PRs from collaborators with write access should be checked out and evaluated. Only block PRs from authors without write access (exit 0, not exit 1 — it's a skip, not an error). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

jeffhandley · 2026-03-30T23:48:51Z

+If there is nothing to evaluate (PR has no test files, PR is a docs-only change, etc.), you **must** call the `noop` tool with a message explaining why:
+
+```json
+{"noop": {"message": "No action needed: [brief explanation, e.g. 'PR contains no test files']"}}


You might want to configure to not generate a no-op run report issue (within the frontmatter).

https://github.github.com/gh-aw/patterns/monitoring/#no-op-run-reports

jeffhandley · 2026-04-03T06:24:38Z

+### Key Principles (from [GitHub Security Lab](https://securitylab.github.com/resources/github-actions-preventing-pwn-requests/))
+
+1. **Never execute untrusted PR code with elevated credentials.** The classic "pwn-request" attack is `pull_request_target` + checkout PR + run build scripts with `GITHUB_TOKEN`. The attack surface includes build scripts (`make`, `build.ps1`), package manager hooks (`npm postinstall`, MSBuild targets), and test runners.
+
+2. **Treating PR contents as passive data is safe.** Reading, analyzing, or diffing PR code is fine — the danger is *executing* it. Our gh-aw workflows read code for evaluation; they never build or run it.
+
+3. **`pull_request_target` grants write permissions and secrets access.** This is by design — the workflow YAML comes from the base branch (trusted). But any step that checks out and runs fork code in this context creates a vulnerability.
+
+4. **`pull_request` from forks has no secrets access.** GitHub withholds secrets because the workflow YAML comes from the fork (untrusted). This is the safe default for CI builds on fork PRs.
+
+5. **The `workflow_run` pattern separates privilege from code execution.** Build in an unprivileged `pull_request` job → pass artifacts → process in a privileged `workflow_run` job. This is architecturally what gh-aw does: agent runs read-only, `safe_outputs` job has write permissions.


jeffhandley · 2026-04-03T06:29:15Z

+  pull_request_target:
+    types: [opened, synchronize, reopened]


I do think this will still lead to the 'Approve and run workflows' button showing up for PRs from untrusted forks. We need to solidify the guidance we give for when to hit that button. I really wish that button navigated into a list of workflows needing approval for the PR with boxes to select which to approve.

jeffhandley · 2026-04-03T06:33:17Z

+      suppress_comment:
+        description: 'Dry-run — evaluate but do not post a comment on the PR'


Future-proofing: I suggest renaming to suppress_output in case the output changes (to a PR review for example).

Suggested change

suppress_comment:

description: 'Dry-run — evaluate but do not post a comment on the PR'

suppress_output:

description: 'Dry-run - evaluate but do not post output on the PR'

jeffhandley · 2026-04-03T06:35:24Z

+        type: boolean
+        default: false
+  bots:
+    - "copilot[bot]"


I am not certain what identity ends up getting used here; I experimented and I think it's one of these but not sure which. 😄

copilot

copilot[bot]

app/copilot-swe-agent

copilot-swe-agent

copilot-swe-agent[bot]

jeffhandley · 2026-04-03T06:38:34Z

 if: >-
-  (github.event_name == 'pull_request' && github.event.pull_request.draft == false) ||
+  (github.event_name == 'pull_request_target' && github.event.pull_request.draft == false) ||
  github.event_name == 'workflow_dispatch' ||
  (github.event_name == 'issue_comment' &&
   github.event.issue.pull_request &&


I always guard against forks as well, preventing the workflow from running on forks except for the workflow_dispatch event. Otherwise, PRs within a fork will result in failing workflow runs (vs. starting the workflow and skipping all jobs).

Simple case that needs adapting to your scenario: if: (!github.event.repository.fork) || github.event_name == 'workflow_dispatch'.

jeffhandley · 2026-04-03T06:43:30Z

+When triggered via `workflow_dispatch` with `suppress_comment` = `${{ inputs.suppress_comment }}`:
+- If **true**, perform the full evaluation but **do not** post a comment on the PR. Write the evaluation to the workflow log only. This is useful for testing the skill without spamming the PR.
+- If **false** (default), post the comment as normal.


These expressions get replaced on their way to the model so this would end up embedding the true or false value into the opening statement. I think you need something more like this (but I recommend validating my understanding here). Note I also reflected my suggested input rename from above.

Suggested change

When triggered via `workflow_dispatch` with `suppress_comment` = `${{ inputs.suppress_comment }}`:

- If **true**, perform the full evaluation but **do not** post a comment on the PR. Write the evaluation to the workflow log only. This is useful for testing the skill without spamming the PR.

- If **false** (default), post the comment as normal.

When triggered via `workflow_dispatch`, the `suppress_output` input controls behavior.

- If `${{ inputs.suppress_output }}` == **true**, perform the full evaluation but **do not** post a comment on the PR. Write the evaluation to the workflow log only. This is useful for testing the skill without spamming the PR.

- If `${{ inputs.suppress_output }}` == **false** (default), post the comment as normal.

jeffhandley · 2026-04-03T06:43:56Z

 ## Posting Results

-Call `add_comment` with `item_number` set to the PR number. Wrap the report in a collapsible `<details>` block:
+If dry-run mode is active (`suppress_comment` is true), log the evaluation report to stdout and stop — do **not** call `add_comment`.


Suggested change

If dry-run mode is active (`suppress_comment` is true), log the evaluation report to stdout and stop — do **not** call `add_comment`.

If dry-run mode is active (`suppress_output` is true), log the evaluation report to stdout and stop — do **not** call `add_comment`.

The correct identity for Copilot-authored PRs is copilot-swe-agent[bot] (882 commits in this repo), not copilot[bot] (0 commits). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

gh pr diff fails with HTTP 406 for PRs with 300+ changed files. Fall back to paginated REST API (pulls/files) when diff is too large. Fixes the failure seen on PR #34617 (inflight candidate with 300+ files). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

PureWeen · 2026-04-08T17:21:06Z

Addressing Jeff's Review Feedback

Thanks for the thorough review @jeffhandley! Here's what we applied:

✅ Applied

Renamed suppress_comment → suppress_output — Future-proofs if output changes to a PR review or other format. (commit f3dc6a9)
Disabled no-op run report issues — Added report-as-issue: false under noop: in safe-outputs. Note: the docs suggest this goes under safe-outputs: noop:, not as a top-level property. (commit f3dc6a9)
Improved dry-run mode wording — Clarified how ${{ inputs.suppress_output }} expression replacement works in the agent prompt. (commit f3dc6a9)
Fixed bot identity — Changed bots: from copilot[bot] to copilot-swe-agent[bot]. Git log shows 882 commits from copilot-swe-agent[bot] and 0 from copilot[bot] in this repo. (commit 7bcc980)

🐛 Additional fix discovered

Fixed gate step for large PRs (300+ files) — gh pr diff fails with HTTP 406 for PRs with 300+ changed files (e.g., PR March 30th, Inflight Candidate #34617 inflight candidate). Now falls back to the paginated pulls/files REST API. (commit 04bf387)

📝 Acknowledged (no change needed)

"Approve and run" concern — You're right this shows up for fork contributors with pull_request_target. However, dotnet/maui already has two pull_request_target workflows (dogfood-comment.yml and bump-global-json.yml) — both self-gate to avoid fork triggers. Our workflow is different in that we want it to run on fork PRs, which is why the approval button appears. Fork contributors can alternatively use /evaluate-tests comment or maintainers can use workflow_dispatch to bypass this. We understand your broader direction toward issue_comment-only for org-wide patterns and will track that.

All changes compiled clean with gh aw compile. Running validation now via workflow_dispatch against PR #34882 (small, with tests) and PR #34617 (300+ files, gate fallback test).

Adds hide-older-comments: true to add-comment safe-output. Previous evaluation comments are automatically minimized (collapsed as 'outdated') when a new evaluation is posted, reducing PR noise. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

jeffhandley · 2026-04-10T04:14:49Z

+    github.event_name == 'workflow_dispatch' ||
+    (github.event_name == 'issue_comment' &&
+     github.event.issue.pull_request &&
+     startsWith(github.event.comment.body, '/evaluate-tests'))


Today I learned about the slash_command trigger at Command Triggers | GitHub Agentic Workflows. I suggest trying that out here. This could then change the if condition to use needs.activation.outputs.slash_command instead of the comment body.

Great catch -- implemented in e7fdf22. Replaced issue_comment + startsWith with slash_command: evaluate-tests (scoped to events: [pull_request_comment, issue_comment]). Simplified the if: condition since the platform handles command matching now. Also added an anti-patterns table to the gh-aw instructions file so we don't miss built-in features like this in the future.

@jeffhandley

- Replace manual issue_comment + startsWith with slash_command: trigger (auto emoji reactions, sanitized input, eliminates skipped runs) - Add 'Before You Build' anti-patterns table to gh-aw instructions listing 13 manual patterns that have built-in gh-aw equivalents - Simplify if: condition (platform handles command matching) Addresses review feedback from @jeffhandley (slash_command suggestion). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Add labels: ['pr-review', 'testing'] for gh aw status filtering - Update Fork PR Behavior table in instructions to document that slash_command compiles to issue_comment with platform-managed matching Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

PureWeen · 2026-04-13T22:54:46Z

🔍 Multi-Model Code Review — PR #34678

Reviewed by: 3 independent reviewers + adversarial consensus rounds (3 review cycles)
Files: 4 changed (workflow source .md, compiled .lock.yml, PowerShell script, instruction docs)
CI Status: ⚠️ maui-pr skipping (expected — no runtime code changes), license/cla ✅, dogfood-comment ✅

Final Review (commits `d142b43` → `72356a7`)

All findings from previous review cycles have been addressed. This is the third and final pass.

All Findings — Final Status

#	Finding	Severity	Status
1	`isFork` not enforced in Checkout-GhAwPr.ps1	🟡	✅ FIXED — fork gate at line 64
2	Access gate dead code / stale docs	🟡	✅ FIXED — docs updated for `slash_command`
3	Fork SKILL.md risk via `checkout_pr_branch.cjs`	🟡	✅ DOCUMENTED — accepted residual risk (gh-aw#18481)
4	Restore failure silently swallowed	🟢	✅ FIXED — fatal `exit 1` on failure
5	Gate `exit 1` noise for no-ops	🟢	✅ BY DESIGN — documented as intentional
6	REST fallback fires on "no matches"	🟢	✅ FIXED — `if/else` on `gh pr diff` exit code
7	`if: issue_comment` tautological	🟢	✅ FIXED — removed; compiler generates proper filtering
8	`ConvertFrom-Json` before `LASTEXITCODE`	🟢	✅ FIXED — try/catch with separate exit code check
9	Docs reference `pull_request_target` as active	🟢	✅ FIXED — updated to `slash_command`
10	API errors masked as "PR not OPEN"	🟢	✅ FIXED — exit code checked separately
11	REST API fallback masks errors as "no test files"	🟢	✅ FIXED — fallback errors surfaced with `exit 1`

Score: 11/11 resolved (3/3 reviewers confirm on final pass)

Non-Issues Verified ✅

COPILOT_TOKEN exposure: Documented with layered mitigations (firewall, redaction, threat detection)
Concurrency group: Correct for current issue_comment-only trigger
hide-older-comments: true: Platform feature, correctly used
REST API pagination: --paginate with proper error handling
Permission check: Collaborator API approach is correct
noop report-as-issue: false: Correctly set
PR_NUMBER injection: Integer from github.event.issue.number — safe
Security documentation: Anti-patterns table and defense layers are high-quality additions
Fork gate ordering: Runs before checkout — correct
slash_command compiler output: Proper /evaluate-tests matching + PR-only filtering

Test Coverage

Workflow infrastructure — no automated tests available. Validation via /evaluate-tests slash command invocations is appropriate.

Recommendation

✅ Approve — All 11 findings across 3 review cycles are resolved. No new issues found on final pass (3/3 reviewers agree). PR is ready to merge.

- Remove if: restriction on gate step — now runs for all events (pull_request_target, workflow_dispatch, slash_command) - Unify PR_NUMBER from all event sources in gate step - Bump timeout-minutes from 15 to 20 for complex evaluations - Pass -PrNumber to Gather-TestContext.ps1 in prompt Fixes: agent evaluated wrong files for workflow_dispatch on no-test PRs because gate was skipped, causing 15min timeout. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…it code Gate step: exit 0 instead of exit 1 when no test files found. Sets HAS_TEST_FILES=false env var for the agent to noop quickly. PRs without tests now show clean ✅ in GitHub checks instead of ❌. Checkout script: exit 1 instead of exit 0 on permission denial. Prevents evaluating wrong code when author lacks write access. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…n check - Add HAS_TEST_FILES env var check to Checkout step condition so workflow_dispatch skips checkout when gate found no tests - Prevents Checkout-GhAwPr.ps1 from failing on bot-authored PRs (app/copilot-swe-agent can't be looked up via collaborator API) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

exit 0 allowed the agent container to spin up and waste 20 minutes on PRs with no test files. exit 1 stops the workflow immediately. HAS_TEST_FILES env var is no longer needed since exit 1 halts everything. For pull_request_target, the paths filter prevents triggering on no-test PRs anyway. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Comment out pull_request_target and workflow_dispatch triggers. The workflow now only runs when someone comments /evaluate-tests on a PR. This avoids noisy auto-triggered runs while we stabilize the workflow. The commented-out triggers can be re-enabled later. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Remove issue_comment from slash_command events so /evaluate-tests only works on PR comments, not regular issues - Add early state check to skip evaluation on closed/merged PRs Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Add explicit fork PR rejection in Checkout-GhAwPr.ps1 (isFork gate) - Fix ConvertFrom-Json ordering to prevent masking gh CLI errors - Make infrastructure restore failure fatal (exit 1) instead of warning - Remove pre-delete of .github/skills/ before restore to prevent data loss - Document fork SKILL.md residual risk in workflow with mitigations - Update instructions doc to match current script behavior Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…utological if - Remove tautological if: github.event_name == 'issue_comment' from .md source — compiler generates proper slash_command filtering automatically - Separate gh pr diff exit code from content check so REST API fallback only fires on actual command failure (HTTP 406), not empty results - Distinguish API failures from closed PRs in gate state check — API errors now show a clear error message instead of masking as 'UNKNOWN' Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

When gh pr diff fails (HTTP 406 on large PRs) and the REST API fallback also fails (rate limit, auth, transient 5xx), the gate now shows a clear error message and exits 1 instead of silently treating the failure as 'no test files found'. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Uncomment workflow_dispatch trigger and Checkout-GhAwPr.ps1 step. pull_request_target remains disabled (no auto-runs on PR create/update). Manual triggering via Actions tab is now available alongside /evaluate-tests. Restore dry-run mode, concurrency group, and PR number resolution for workflow_dispatch path. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings March 26, 2026 16:46

Copilot started reviewing on behalf of PureWeen March 26, 2026 16:47 View session

Copilot AI reviewed Mar 26, 2026

View reviewed changes

PureWeen marked this pull request as draft March 26, 2026 17:07

github-actions Bot and others added 5 commits March 26, 2026 12:59

PureWeen marked this pull request as ready for review March 26, 2026 20:29

Copilot started work on behalf of PureWeen March 26, 2026 20:41 View session

dotnet deleted a comment from github-actions Bot Mar 26, 2026

Copilot finished work on behalf of PureWeen March 26, 2026 20:49

github-actions Bot mentioned this pull request Mar 27, 2026

[repo-status] Daily Repo Status - March 27, 2026 #34694

Closed

github-actions Bot and others added 3 commits March 27, 2026 09:26

PureWeen changed the title ~~Switch evaluate-pr-tests to pull_request_target for fork PR support~~ Improve evaluate-pr-tests workflow: fork support, access gating, dry-run Mar 30, 2026

kubaflo previously approved these changes Mar 30, 2026

View reviewed changes

PureWeen marked this pull request as draft March 30, 2026 23:15

PureWeen marked this pull request as ready for review April 2, 2026 13:58

PureWeen dismissed kubaflo’s stale review via 17c0f27 April 2, 2026 15:44

github-actions Bot and others added 3 commits April 2, 2026 15:37

PureWeen changed the title ~~Improve evaluate-pr-tests workflow: fork support, access gating, dry-run~~ Improve evaluate-pr-tests workflow: pull_request_target, access gating, security docs Apr 2, 2026

jeffhandley reviewed Apr 3, 2026

View reviewed changes

github-actions Bot and others added 2 commits April 8, 2026 08:54

Fix bot identity: copilot[bot] → copilot-swe-agent[bot]

7bcc980

The correct identity for Copilot-authored PRs is copilot-swe-agent[bot] (882 commits in this repo), not copilot[bot] (0 commits). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

PureWeen requested review from jeffhandley and kubaflo April 9, 2026 17:38

kubaflo previously approved these changes Apr 9, 2026

View reviewed changes

jeffhandley approved these changes Apr 10, 2026

View reviewed changes

PureWeen dismissed kubaflo’s stale review via e7fdf22 April 13, 2026 22:15

github-actions Bot mentioned this pull request Apr 13, 2026

[aw] Evaluate PR Tests failed #34888

Closed

github-actions Bot and others added 8 commits April 13, 2026 19:05

PureWeen changed the title ~~Improve evaluate-pr-tests workflow: pull_request_target, access gating, security docs~~ Improve evaluate-pr-tests workflow: slash_command trigger, access gating, security hardening Apr 14, 2026

kubaflo previously approved these changes Apr 15, 2026

View reviewed changes

PureWeen dismissed kubaflo’s stale review via 338d7c1 April 15, 2026 15:58

PureWeen changed the title ~~Improve evaluate-pr-tests workflow: slash_command trigger, access gating, security hardening~~ Improve evaluate-pr-tests workflow: slash_command + workflow_dispatch, security hardening Apr 15, 2026

PureWeen merged commit eb0b82f into main Apr 15, 2026
17 of 20 checks passed

PureWeen deleted the fix/evaluate-tests-fork-support branch April 15, 2026 17:21

github-actions Bot mentioned this pull request Apr 16, 2026

[repo-status] 🌟 Daily Repo Status - April 16, 2026 #34990

Closed

	(github.event_name == 'pull_request_target' && github.event.pull_request.draft == false) \|\| github.event_name == 'workflow_dispatch' \|\| (github.event_name == 'issue_comment' &&
	(github.event_name == 'pull_request_target' && github.event.pull_request.draft == false && github.event.pull_request.head.repo.fork == false) \|\| github.event_name == 'workflow_dispatch' \|\| (github.event_name == 'issue_comment' &&

		suppress_comment:
		description: 'Dry-run — evaluate but do not post a comment on the PR'

	If dry-run mode is active (`suppress_comment` is true), log the evaluation report to stdout and stop — do not call `add_comment`.
	If dry-run mode is active (`suppress_output` is true), log the evaluation report to stdout and stop — do not call `add_comment`.

Conversation

PureWeen commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

What Changed

Trigger Behavior

Security Model

Validation

Known Limitations

Uh oh!

github-actions Bot commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

PureWeen commented Apr 8, 2026

Addressing Jeff's Review Feedback

✅ Applied

🐛 Additional fix discovered

📝 Acknowledged (no change needed)

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

PureWeen commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔍 Multi-Model Code Review — PR #34678

Final Review (commits d142b43 → 72356a7)

All Findings — Final Status

Non-Issues Verified ✅

Test Coverage

Recommendation

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

PureWeen commented Mar 26, 2026 •

edited

Loading

github-actions Bot commented Mar 26, 2026 •

edited

Loading

PureWeen commented Apr 13, 2026 •

edited

Loading

Final Review (commits `d142b43` → `72356a7`)