Skip to content

[PoC] Code testing agent + tests PoC#433

Merged
JanKrivanek merged 2 commits intomainfrom
dev/jankrivanek/code-testing-agent
Mar 30, 2026
Merged

[PoC] Code testing agent + tests PoC#433
JanKrivanek merged 2 commits intomainfrom
dev/jankrivanek/code-testing-agent

Conversation

@JanKrivanek
Copy link
Copy Markdown
Member

No description provided.

@JanKrivanek
Copy link
Copy Markdown
Member Author

/evaluate

@github-actions
Copy link
Copy Markdown
Contributor

Skill Validation Results

Skill Scenario Quality (Isolated) Quality (Plugin) Skills Loaded Agents Invoked Overfit Verdict
csharp-scripts Test a C# language feature with a script 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ csharp-scripts; tools: skill, create / ✅ csharp-scripts; tools: skill, create, edit — / — 🟡 0.32
nuget-trusted-publishing Set up trusted publishing for a new NuGet library 3.0/5 → 5.0/5 🟢 3.0/5 → 4.0/5 🟢 ✅ nuget-trusted-publishing; tools: skill, task / ✅ nuget-trusted-publishing; tools: skill explore / — ✅ 0.12
nuget-trusted-publishing Set up NuGet publishing without mentioning trusted publishing 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ nuget-trusted-publishing; tools: skill, view, glob / ✅ nuget-trusted-publishing; tools: skill, glob, view — / — ✅ 0.12
nuget-trusted-publishing Migrate existing workflow from API key to trusted publishing 3.0/5 → 4.0/5 🟢 3.0/5 → 4.0/5 🟢 ✅ nuget-trusted-publishing; tools: skill / ✅ nuget-trusted-publishing; tools: skill, view, bash — / — ✅ 0.12
dotnet-pinvoke Generate LibraryImport declaration from C header (.NET 8+) 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ dotnet-pinvoke; tools: skill / ✅ dotnet-pinvoke; tools: skill — / — ✅ 0.08
dotnet-pinvoke Generate LibraryImport declaration from C header (.NET Framework) 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ dotnet-pinvoke; tools: skill / ✅ dotnet-pinvoke; tools: skill — / — ✅ 0.08 [1]
dotnet-trace-collect High CPU in Kubernetes on Linux (.NET 8) 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ dotnet-trace-collect; tools: skill, report_intent, view / ✅ dotnet-trace-collect; tools: skill, report_intent, view — / — ✅ 0.13
dotnet-trace-collect .NET Framework on Windows without admin privileges 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ dotnet-trace-collect; tools: skill / ✅ dotnet-trace-collect; tools: skill — / — ✅ 0.13
dotnet-trace-collect .NET 10 on Linux with root access and native call stacks 1.0/5 → 3.0/5 🟢 1.0/5 → 4.0/5 🟢 ✅ dotnet-trace-collect; tools: skill / ✅ dotnet-trace-collect; tools: skill — / — ✅ 0.13
dotnet-trace-collect Memory leak on Linux (.NET 8) 3.0/5 → 2.0/5 🔴 3.0/5 → 3.0/5 ✅ dotnet-trace-collect; tools: skill, report_intent, view / ✅ dotnet-trace-collect; tools: skill, report_intent, view — / — ✅ 0.13
dotnet-trace-collect Slow requests on Windows with PerfView 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ dotnet-trace-collect; tools: skill, report_intent, view / ✅ dotnet-trace-collect; tools: skill — / — ✅ 0.13
dotnet-trace-collect Excessive GC on Linux (.NET 8) 5.0/5 → 4.0/5 🔴 5.0/5 → 4.0/5 🔴 ✅ dotnet-trace-collect; tools: skill / ✅ dotnet-trace-collect; tools: skill — / — ✅ 0.13
dotnet-trace-collect Hang or deadlock diagnosis on Linux 3.0/5 → 4.0/5 🟢 3.0/5 → 4.0/5 🟢 ✅ dotnet-trace-collect; tools: skill, report_intent, view / ✅ dotnet-trace-collect; tools: report_intent, skill, view — / — ✅ 0.13
dotnet-trace-collect Windows container high CPU with PerfView 1.0/5 → 5.0/5 🟢 1.0/5 → 5.0/5 🟢 ✅ dotnet-trace-collect; tools: report_intent, skill, view / ✅ dotnet-trace-collect; tools: report_intent, skill, view — / — ✅ 0.13
dotnet-trace-collect Long-running intermittent issue with PerfView triggers 3.0/5 → 4.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ dotnet-trace-collect; tools: skill, report_intent, view / ✅ dotnet-trace-collect; tools: skill, report_intent, view — / — ✅ 0.13
dotnet-trace-collect Linux pre-.NET 10 needing native call stacks 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ dotnet-trace-collect; tools: skill, report_intent, view / ✅ dotnet-trace-collect; tools: report_intent, skill, view — / — ✅ 0.13
dotnet-trace-collect Windows modern .NET with admin high CPU 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ dotnet-trace-collect; tools: skill, report_intent, view / ✅ dotnet-trace-collect; tools: skill, report_intent, view — / — ✅ 0.13
dotnet-trace-collect Memory leak on .NET Framework Windows 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ dotnet-trace-collect; tools: report_intent, skill, view / ✅ dotnet-trace-collect; tools: skill, report_intent, view — / — ✅ 0.13
dotnet-trace-collect Kubernetes with console access prefers console tools 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ dotnet-trace-collect; tools: skill, report_intent, view / ✅ dotnet-trace-collect; tools: report_intent, skill, view — / — ✅ 0.13 [2]
dotnet-trace-collect Container installation without .NET SDK 3.0/5 → 4.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ dotnet-trace-collect; tools: skill / ✅ dotnet-trace-collect; tools: skill — / — ✅ 0.13
dotnet-trace-collect HTTP 500s from downstream service on Linux (.NET 8) 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ dotnet-trace-collect; tools: skill, report_intent, view / ✅ dotnet-trace-collect; tools: skill, report_intent, view — / — ✅ 0.13 [3]
dotnet-trace-collect Networking timeouts on Windows with admin (.NET 8) 1.0/5 → 5.0/5 🟢 1.0/5 → 5.0/5 🟢 ✅ dotnet-trace-collect; tools: skill, report_intent, view / ✅ dotnet-trace-collect; tools: skill, report_intent, view — / — ✅ 0.13
dotnet-trace-collect Assembly loading failure on Linux (.NET 8) 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ dotnet-trace-collect; tools: report_intent, skill, view / ✅ dotnet-trace-collect; tools: skill, report_intent, view — / — ✅ 0.13
microbenchmarking Investigate runtime upgrade performance impact 3.0/5 → 3.0/5 3.0/5 → 4.0/5 🟢 ✅ microbenchmarking; tools: skill / ✅ microbenchmarking; tools: skill, read_bash — / — ✅ 0.11
clr-activation-debugging Diagnose unexpected FOD dialog from native build tool 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ clr-activation-debugging; tools: skill, bash / ✅ clr-activation-debugging; tools: skill — / — ✅ 0.09 [4]
clr-activation-debugging Diagnose FOD suppressed but activation still failing 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ clr-activation-debugging; tools: skill / ✅ clr-activation-debugging; tools: skill — / — ✅ 0.09
clr-activation-debugging Explain why same binary behaves differently under different launch methods 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ clr-activation-debugging; tools: skill / ✅ clr-activation-debugging; tools: skill — / — ✅ 0.09 [5]
clr-activation-debugging Analyze healthy managed EXE activation 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ clr-activation-debugging; tools: skill / ✅ clr-activation-debugging; tools: skill — / — ✅ 0.09 [6]
clr-activation-debugging Identify multiple activation sequences in a single log 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ clr-activation-debugging; tools: skill, bash / ✅ clr-activation-debugging; tools: skill — / — ✅ 0.09
clr-activation-debugging Explain useLegacyV2RuntimeActivationPolicy in activation log 3.0/5 → 3.0/5 3.0/5 → 4.0/5 🟢 ✅ clr-activation-debugging; tools: skill / ✅ clr-activation-debugging; tools: skill — / — ✅ 0.09
clr-activation-debugging Decline non-CLR-activation issue 1.0/5 → 5.0/5 🟢 1.0/5 → 5.0/5 🟢 ✅ clr-activation-debugging; tools: skill, bash / ✅ clr-activation-debugging; tools: skill — / — ✅ 0.09
analyzing-dotnet-performance Detects compiled regex startup budget and regex chain allocations 1.0/5 ⏰ → 5.0/5 🟢 1.0/5 ⏰ → 5.0/5 🟢 ✅ analyzing-dotnet-performance; tools: skill, bash / ✅ analyzing-dotnet-performance; tools: skill, bash — / — ✅ 0.16
analyzing-dotnet-performance Detects CurrentCulture comparer and compiled regex budget in inflection rules 3.0/5 → 5.0/5 🟢 3.0/5 → 1.0/5 ⏰ 🔴 ✅ analyzing-dotnet-performance; tools: skill, bash / ✅ analyzing-dotnet-performance; tools: skill, bash, stop_bash — / — ✅ 0.16
analyzing-dotnet-performance Finds per-call Dictionary allocation not hoisted to static 3.0/5 ⏰ → 5.0/5 🟢 3.0/5 ⏰ → 5.0/5 🟢 ✅ analyzing-dotnet-performance; tools: skill / ✅ analyzing-dotnet-performance; tools: skill — / — ✅ 0.16
analyzing-dotnet-performance Catches compound allocations in recursive number converter with ToLower 4.0/5 → 4.0/5 4.0/5 → 4.0/5 ✅ analyzing-dotnet-performance; tools: skill, bash / ✅ analyzing-dotnet-performance; tools: skill, bash — / — ✅ 0.16 [7]
analyzing-dotnet-performance Finds StringComparison.Ordinal missing and FrozenDictionary opportunities 5.0/5 → 5.0/5 5.0/5 → 3.0/5 ⏰ 🔴 ✅ analyzing-dotnet-performance; tools: skill, bash / ✅ analyzing-dotnet-performance; tools: skill, bash, read_bash — / — ✅ 0.16
analyzing-dotnet-performance Detects Aggregate+Replace chain and struct missing IEquatable 4.0/5 → 1.0/5 ⏰ 🔴 4.0/5 → 1.0/5 ⏰ 🔴 ✅ analyzing-dotnet-performance; tools: skill, bash, read_bash, stop_bash / ✅ analyzing-dotnet-performance; tools: skill, bash, write_bash, stop_bash — / — ✅ 0.16
analyzing-dotnet-performance Finds branched Replace chain in format string manipulation 3.0/5 → 3.0/5 3.0/5 → 4.0/5 🟢 ✅ analyzing-dotnet-performance; tools: skill, bash / ✅ analyzing-dotnet-performance; tools: skill, bash — / — ✅ 0.16 🟡
analyzing-dotnet-performance Catches LINQ on hot-path string processing and All(char.IsUpper) 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ analyzing-dotnet-performance; tools: skill, bash / ✅ analyzing-dotnet-performance; tools: skill, bash — / — ✅ 0.16 [8]
analyzing-dotnet-performance Detects LINQ pipeline in TimeSpan formatting and collection processing 4.0/5 → 4.0/5 4.0/5 → 4.0/5 ✅ analyzing-dotnet-performance; tools: skill, bash / ✅ analyzing-dotnet-performance; tools: skill, bash, read_bash — / — ✅ 0.16 [9]
analyzing-dotnet-performance Flags Span inconsistencies and compound method chains in truncation library 4.0/5 → 4.0/5 4.0/5 → 4.0/5 ✅ analyzing-dotnet-performance; tools: skill, bash / ✅ analyzing-dotnet-performance; tools: skill, bash — / — ✅ 0.16 [10]
analyzing-dotnet-performance Identifies unsealed leaf classes and locale hierarchy patterns 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ analyzing-dotnet-performance; tools: skill, bash / ✅ analyzing-dotnet-performance; tools: skill, bash — / — ✅ 0.16
android-tombstone-symbolication Symbolicate .NET frames in an Android tombstone 4.0/5 → 4.0/5 4.0/5 → 3.0/5 🔴 ✅ android-tombstone-symbolication; tools: skill, read_bash, stop_bash / ✅ android-tombstone-symbolication; tools: skill, read_bash, stop_bash — / — ✅ 0.11
android-tombstone-symbolication Recognize tombstone with no .NET frames 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ android-tombstone-symbolication; tools: skill, bash / ✅ android-tombstone-symbolication; tools: skill — / — ✅ 0.11 [11]
android-tombstone-symbolication Symbolicate CoreCLR frames in an Android tombstone 2.0/5 ⏰ → 4.0/5 🟢 2.0/5 ⏰ → 3.0/5 🟢 ✅ android-tombstone-symbolication; tools: skill / ✅ android-tombstone-symbolication; tools: skill — / — ✅ 0.11
android-tombstone-symbolication Recognize NativeAOT tombstone with app binary and libSystem.Native.so 3.0/5 → 4.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ android-tombstone-symbolication; tools: skill, stop_bash / ✅ android-tombstone-symbolication; tools: skill, stop_bash — / — ✅ 0.11
android-tombstone-symbolication Symbolicate multi-thread tombstone 4.0/5 → 4.0/5 4.0/5 → 4.0/5 ✅ android-tombstone-symbolication; tools: skill / ✅ android-tombstone-symbolication; tools: skill — / — ✅ 0.11
android-tombstone-symbolication Handle .NET frames with no BuildId metadata 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ android-tombstone-symbolication; tools: skill, bash / ✅ android-tombstone-symbolication; tools: skill, bash — / — ✅ 0.11
android-tombstone-symbolication Symbolicate tombstone with multiple .NET libraries and different BuildIds 4.0/5 → 4.0/5 4.0/5 → 4.0/5 ✅ android-tombstone-symbolication; tools: skill, read_bash, stop_bash / ✅ android-tombstone-symbolication; tools: skill, read_bash, stop_bash — / — ✅ 0.11 [12]
android-tombstone-symbolication Reject iOS crash log as wrong format 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ℹ️ not activated (expected) / ℹ️ not activated (expected) — / — ✅ 0.11
dump-collect Configure automatic crash dumps for CoreCLR app on Linux 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ dump-collect; tools: report_intent, skill, view / ✅ dump-collect; tools: skill, report_intent, view — / — 🟡 0.27 [13]
dump-collect Set up NativeAOT crash dumps with createdump in Kubernetes 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ dump-collect; tools: skill / ✅ dump-collect; tools: skill — / — 🟡 0.27
dump-collect Recover crash dump from macOS NativeAOT without createdump 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ dump-collect; tools: report_intent, skill, view / ✅ dump-collect; tools: report_intent, skill, view — / — 🟡 0.27
dump-collect Configure CoreCLR dump collection in Alpine Docker as non-root 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ dump-collect; tools: skill, report_intent, view / ✅ dump-collect; tools: report_intent, skill, view — / — 🟡 0.27
dump-collect Advisory: macOS NativeAOT crash dump recovery steps 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ dump-collect; tools: skill / ✅ dump-collect; tools: skill — / — 🟡 0.27
dump-collect Advisory: CoreCLR Alpine Docker non-root configuration 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ dump-collect; tools: skill, report_intent, view / ✅ dump-collect; tools: skill, report_intent, view — / — 🟡 0.27
dump-collect Advisory: NativeAOT Kubernetes dump collection setup 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ dump-collect; tools: report_intent, skill, view / ✅ dump-collect; tools: report_intent, skill, view — / — 🟡 0.27
dump-collect Detect runtime and configure crash dumps for unknown .NET app on Linux 4.0/5 → 5.0/5 🟢 4.0/5 → 4.0/5 ✅ dump-collect; tools: skill / ✅ dump-collect; tools: skill — / — 🟡 0.27
dump-collect Decline dump analysis request 2.0/5 → 5.0/5 🟢 2.0/5 → 3.0/5 🟢 ℹ️ not activated (expected) / ℹ️ not activated (expected) — / — 🟡 0.27
optimizing-ef-core-queries Optimize bulk operations with EF Core 7+ ExecuteUpdate and ExecuteDelete 4.0/5 → 4.0/5 4.0/5 → 4.0/5 ✅ optimizing-ef-core-queries; tools: skill / ✅ optimizing-ef-core-queries; tools: skill — / — 🟡 0.23 [14]
build-parallelism Analyze build parallelism bottlenecks 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ build-parallelism; tools: skill, task, glob / ✅ build-parallelism; tools: skill, task explore / explore ✅ 0.14
including-generated-files Diagnose generated file inclusion failure 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ including-generated-files; tools: skill / ✅ including-generated-files; tools: skill — / — ✅ 0.11
msbuild-antipatterns Review MSBuild files for anti-patterns and style issues 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ msbuild-antipatterns; tools: skill / ✅ msbuild-antipatterns; tools: skill — / — ✅ 0.06 [15]
build-perf-baseline Establish build performance baseline and recommend optimizations 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ build-perf-baseline; tools: skill / ✅ build-perf-baseline; tools: skill explore / explore 🟡 0.26
msbuild-modernization Modernize legacy project to SDK-style 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ msbuild-modernization; tools: skill, glob / ✅ msbuild-modernization; tools: skill — / — ✅ 0.06 [16]
directory-build-organization Organize build infrastructure for a multi-project repo 3.0/5 → 5.0/5 🟢 3.0/5 → 3.0/5 ✅ directory-build-organization; tools: skill, task / ⚠️ NOT ACTIVATED explore / explore ✅ 0.15 [17]
check-bin-obj-clash Diagnose bin/obj output path clashes 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ check-bin-obj-clash; tools: skill / ✅ check-bin-obj-clash; tools: glob, skill — / — ✅ 0.14 [18]
incremental-build Analyze incremental build issues 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ incremental-build; tools: skill, bash / ✅ incremental-build; tools: skill, bash — / — ✅ 0.12
eval-performance Analyze MSBuild evaluation performance issues 3.0/5 → 4.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ eval-performance; tools: skill, bash / ✅ eval-performance; tools: skill — / — ✅ 0.12
resolve-project-references Explain misleading ResolveProjectReferences time 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ resolve-project-references; tools: skill / ✅ resolve-project-references; tools: skill — / — ✅ 0.12
msbuild-server Recommend MSBuild Server for slow CLI incremental builds 3.0/5 → 4.0/5 🟢 3.0/5 → 4.0/5 🟢 ✅ msbuild-server; tools: skill / ✅ msbuild-server; tools: skill — / — 🟡 0.49
build-perf-diagnostics Diagnose slow build for a small project 4.0/5 → 4.0/5 4.0/5 → 5.0/5 🟢 ⚠️ NOT ACTIVATED / ✅ binlog-generation; build-perf-diagnostics; tools: skill — / — 🟡 0.21 [19]
binlog-generation Build project with /bl flag 1.0/5 → 5.0/5 🟢 1.0/5 → 5.0/5 🟢 ✅ binlog-generation; tools: skill / ✅ binlog-generation; tools: skill — / — ✅ 0.00
binlog-generation Build with /bl in PowerShell 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ binlog-generation; tools: skill / ✅ binlog-generation; tools: skill — / — ✅ 0.00
binlog-generation Build multiple configurations with unique binlogs 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ binlog-generation; tools: skill / ✅ binlog-generation; tools: skill — / — ✅ 0.00
binlog-failure-analysis Diagnose build failures from binlog only (no source files) 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ binlog-failure-analysis; tools: skill / ✅ binlog-failure-analysis; tools: skill — / — ✅ 0.04
dotnet-maui-doctor Plan macOS MAUI setup with Xcode 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ dotnet-maui-doctor; tools: skill / ✅ dotnet-maui-doctor; tools: skill — / — 🟡 0.24
dotnet-maui-doctor Plan Linux MAUI environment for Android 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ dotnet-maui-doctor; tools: skill / ✅ dotnet-maui-doctor; tools: skill — / — 🟡 0.24
dotnet-maui-doctor Guardrail against workload update and repair 1.0/5 → 3.0/5 🟢 1.0/5 → 3.0/5 🟢 ✅ dotnet-maui-doctor; tools: report_intent, skill / ✅ dotnet-maui-doctor; tools: skill — / — 🟡 0.24
dotnet-maui-doctor Diagnose non-Microsoft JDK causing build failure 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ dotnet-maui-doctor; tools: skill / ✅ dotnet-maui-doctor; tools: skill — / — 🟡 0.24
dotnet-maui-doctor Plan complete MAUI setup on Windows 4.0/5 → 4.0/5 4.0/5 → 5.0/5 🟢 ✅ dotnet-maui-doctor; tools: skill / ✅ dotnet-maui-doctor; tools: skill — / — 🟡 0.24
dotnet-maui-doctor Prevent incorrect JAVA_HOME configuration 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ dotnet-maui-doctor; tools: skill / ✅ dotnet-maui-doctor; tools: skill — / — 🟡 0.24
dotnet-maui-doctor Determine required Android SDK packages for specific .NET version 2.0/5 → 4.0/5 🟢 2.0/5 → 4.0/5 🟢 ✅ dotnet-maui-doctor; tools: report_intent, skill, view, bash / ✅ dotnet-maui-doctor; tools: report_intent, skill, view, bash — / — 🟡 0.24
dotnet-maui-doctor Fix stale MAUI workloads after SDK update 2.0/5 → 4.0/5 🟢 2.0/5 → 4.0/5 🟢 ✅ dotnet-maui-doctor; tools: skill / ✅ dotnet-maui-doctor; tools: skill — / — 🟡 0.24
technology-selection ML.NET classification on tabular data 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ technology-selection; tools: skill / ✅ technology-selection; tools: skill — / — 🟡 0.32
technology-selection LLM integration with MEAI abstraction 1.0/5 → 4.0/5 🟢 1.0/5 → 1.0/5 ✅ technology-selection; tools: skill, task / ⚠️ NOT ACTIVATED general-purpose / — 🟡 0.32 [20]
technology-selection Reject LLM for tabular classification 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ technology-selection; tools: skill, task / ✅ technology-selection; tools: skill, read_bash, stop_bash general-purpose / — 🟡 0.32
technology-selection Agentic workflow with guardrails 1.0/5 ⏰ → 1.0/5 ⏰ 1.0/5 ⏰ → 1.0/5 ⏰ ✅ technology-selection; tools: skill / ✅ technology-selection; tools: skill explore / explore 🟡 0.32
technology-selection Natural-language scenario decomposition — RAG chatbot 4.0/5 → 4.0/5 4.0/5 → 4.0/5 ✅ technology-selection; tools: skill / ✅ technology-selection; tools: skill — / — 🟡 0.32 [21]
technology-selection RAG pipeline with vector search 4.0/5 → 5.0/5 🟢 4.0/5 → 4.0/5 ✅ technology-selection; tools: skill / ✅ technology-selection; tools: skill, edit — / — 🟡 0.32
convert-to-cpm Decline CPM conversion for packages.config project 1.0/5 → 2.0/5 🟢 1.0/5 → 2.0/5 🟢 ℹ️ not activated (expected) / ℹ️ not activated (expected) — / — ✅ 0.19
convert-to-cpm Recommend CPM when updating packages with version conflicts 2.0/5 → 3.0/5 🟢 2.0/5 → 3.0/5 🟢 ✅ convert-to-cpm; tools: skill, create / ✅ convert-to-cpm; tools: skill, create — / — ✅ 0.19
convert-to-cpm Recommend CPM when updating packages in a complex repository 2.0/5 → 3.0/5 🟢 2.0/5 → 3.0/5 🟢 ✅ convert-to-cpm; tools: skill, glob / ✅ convert-to-cpm; tools: skill, read_agent, glob — / explore ✅ 0.19
convert-to-cpm Convert single project to CPM 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ convert-to-cpm; tools: skill, glob, bash / ✅ convert-to-cpm; tools: skill, task, glob, bash — / explore ✅ 0.19
convert-to-cpm Convert multi-project solution to CPM 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ convert-to-cpm; tools: skill, bash / ✅ convert-to-cpm; tools: skill, bash — / — ✅ 0.19
convert-to-cpm Convert solution with MSBuild property versions to CPM 3.0/5 → 4.0/5 🟢 3.0/5 → 4.0/5 🟢 ✅ convert-to-cpm; tools: skill, bash / ✅ convert-to-cpm; tools: skill, bash — / — ✅ 0.19
convert-to-cpm Convert solution with version conflicts to CPM 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ convert-to-cpm; tools: skill, bash / ✅ convert-to-cpm; tools: skill, task, bash, read_agent — / explore ✅ 0.19
convert-to-cpm Convert complex repository with multiple CPM challenges 3.0/5 → 4.0/5 🟢 3.0/5 → 4.0/5 🟢 ✅ convert-to-cpm; tools: skill / ✅ convert-to-cpm; tools: skill — / — ✅ 0.19
migrate-dotnet9-to-dotnet10 Console app with System.Linq.Async, SIGTERM, and BufferedStream 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ migrate-dotnet9-to-dotnet10; tools: skill, view / ✅ migrate-dotnet9-to-dotnet10; tools: skill, view — / —
migrate-dotnet9-to-dotnet10 Expression tree code broken by C# 14 span overload resolution 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ migrate-dotnet9-to-dotnet10; tools: report_intent, skill, grep, view / ✅ migrate-dotnet9-to-dotnet10; tools: report_intent, skill, bash, grep — / — [22]
migrate-dotnet9-to-dotnet10 ASP.NET Core app with WebHostBuilder, OpenAPI, and forwarded headers 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ migrate-dotnet9-to-dotnet10; tools: skill / ✅ migrate-dotnet9-to-dotnet10; tools: skill — / — [23]
migrate-dotnet9-to-dotnet10 ASP.NET Core app with OpenAPI transformers using Microsoft.OpenApi v1 APIs 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ migrate-dotnet9-to-dotnet10; tools: report_intent, skill, view / ✅ migrate-dotnet9-to-dotnet10; tools: skill, report_intent, view — / —
migrate-dotnet9-to-dotnet10 EF Core app with Azure SQL JSON columns and parameterized collections 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ migrate-dotnet9-to-dotnet10; tools: report_intent, skill, view / ✅ migrate-dotnet9-to-dotnet10; tools: report_intent, skill, view — / —
migrate-dotnet9-to-dotnet10 EF Core app with dynamic ExecuteUpdate and complex types 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ migrate-dotnet9-to-dotnet10; tools: skill, view / ✅ migrate-dotnet9-to-dotnet10; tools: skill, view — / — [24]
migrate-dotnet9-to-dotnet10 SQLite app with DateTimeOffset timezone handling 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ migrate-dotnet9-to-dotnet10; tools: skill, view / ✅ migrate-dotnet9-to-dotnet10; tools: skill, view — / — [25]
migrate-dotnet9-to-dotnet10 Worker service with config null array binding and ProviderAlias assembly change 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ migrate-dotnet9-to-dotnet10; tools: skill / ✅ migrate-dotnet9-to-dotnet10; tools: skill — / —
migrate-dotnet9-to-dotnet10 Cryptography app with OpenSSL, X.509, and Rfc2898DeriveBytes 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ migrate-dotnet9-to-dotnet10; tools: skill, view / ✅ migrate-dotnet9-to-dotnet10; tools: skill, view — / — [26]
migrate-dotnet9-to-dotnet10 SDK and NuGet obscure tooling changes 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-dotnet9-to-dotnet10; tools: skill / ✅ migrate-dotnet9-to-dotnet10; tools: skill — / —
migrate-dotnet9-to-dotnet10 JSON polymorphism with conflicting property names and XmlSerializer 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-dotnet9-to-dotnet10; tools: report_intent, skill, view / ✅ migrate-dotnet9-to-dotnet10; tools: report_intent, skill, view — / —
migrate-dotnet9-to-dotnet10 WinForms and WPF desktop app with System.Drawing and DynamicResource 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ migrate-dotnet9-to-dotnet10; tools: skill / ✅ migrate-dotnet9-to-dotnet10; tools: skill — / —
migrate-dotnet9-to-dotnet10 Containerized single-file app with P/Invoke and IDispatchEx 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-dotnet9-to-dotnet10; tools: report_intent, skill, view / ✅ migrate-dotnet9-to-dotnet10; tools: report_intent, skill, view — / —
migrate-dotnet9-to-dotnet10 App using SslStream properties and SystemEvents 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ migrate-dotnet9-to-dotnet10; tools: skill / ✅ migrate-dotnet9-to-dotnet10; tools: skill, view — / — [27]
migrate-dotnet9-to-dotnet10 Library with NuGet auditing, transitive deps, and InlineArray 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ migrate-dotnet9-to-dotnet10; tools: skill, report_intent, view / ✅ migrate-dotnet9-to-dotnet10; tools: report_intent, skill, view — / — [28]
migrate-dotnet9-to-dotnet10 C# 14 compiler breaking changes — field keyword, extension keyword, disposal 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ migrate-dotnet9-to-dotnet10; tools: report_intent, skill, view / ✅ migrate-dotnet9-to-dotnet10; tools: report_intent, skill, view — / —
migrate-dotnet9-to-dotnet10 Blazor WASM app with generic math shift masking and tar operations 5.0/5 → 5.0/5 5.0/5 → 4.0/5 🔴 ✅ migrate-dotnet9-to-dotnet10; tools: skill, view / ✅ migrate-dotnet9-to-dotnet10; tools: skill, view — / —
migrate-dotnet8-to-dotnet9 App with empty environment variables, ZIP encoding, and keyed DI services 2.0/5 → 4.0/5 🟢 2.0/5 → 4.0/5 🟢 ✅ migrate-dotnet8-to-dotnet9; tools: skill, bash, create / ✅ migrate-dotnet8-to-dotnet9; tools: skill, bash — / — ✅ 0.11
migrate-dotnet8-to-dotnet9 C# 13 compiler breaking changes — InlineArray on record, iterator safe context, collection expressions 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-dotnet8-to-dotnet9; tools: report_intent, skill, view / ✅ migrate-dotnet8-to-dotnet9; tools: skill, report_intent, view — / — ✅ 0.11
migrate-dotnet8-to-dotnet9 ASP.NET Core app with DI validation, forwarded headers, and HttpClientFactory casting 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ migrate-dotnet8-to-dotnet9; tools: skill / ✅ migrate-dotnet8-to-dotnet9; tools: skill — / — ✅ 0.11
migrate-dotnet8-to-dotnet9 EF Core app with migration patterns and Cosmos DB discriminator 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-dotnet8-to-dotnet9; tools: skill / ✅ migrate-dotnet8-to-dotnet9; tools: skill — / — ✅ 0.11
migrate-dotnet8-to-dotnet9 EF Core Cosmos DB app with existing documents and composite id format 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ migrate-dotnet8-to-dotnet9; tools: skill / ✅ migrate-dotnet8-to-dotnet9; tools: skill — / — ✅ 0.11
migrate-dotnet8-to-dotnet9 App with JsonDocument null deserialization and BinaryFormatter fallback 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ migrate-dotnet8-to-dotnet9; tools: skill / ✅ migrate-dotnet8-to-dotnet9; tools: skill — / — ✅ 0.11 [29]
migrate-dotnet8-to-dotnet9 CI pipeline with Terminal Logger parsing and version constraints 2.0/5 → 3.0/5 🟢 2.0/5 → 5.0/5 🟢 ℹ️ not activated (expected) / ✅ migrate-dotnet8-to-dotnet9; tools: skill — / — ✅ 0.11
migrate-dotnet8-to-dotnet9 WinForms app with custom UserControls and PictureBox URL loading 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-dotnet8-to-dotnet9; tools: skill, bash / ✅ migrate-dotnet8-to-dotnet9; tools: skill, bash — / — ✅ 0.11
migrate-dotnet8-to-dotnet9 Containerized app with zlib dependency and runtime configuration 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ migrate-dotnet8-to-dotnet9; tools: skill, bash / ✅ migrate-dotnet8-to-dotnet9; tools: skill — / — ✅ 0.11
migrate-dotnet8-to-dotnet9 EF Core Cosmos DB app with discriminator and sync I/O 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-dotnet8-to-dotnet9; tools: skill, bash / ✅ migrate-dotnet8-to-dotnet9; tools: skill, bash — / — ✅ 0.11
migrate-dotnet8-to-dotnet9 Library with String.Trim span overload, keyed services, and InlineArray 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ migrate-dotnet8-to-dotnet9; tools: skill, bash / ✅ migrate-dotnet8-to-dotnet9; tools: skill, bash — / — ✅ 0.11 [30]
migrate-dotnet8-to-dotnet9 Containerized app with env var precedence reversal and zlib removal 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ migrate-dotnet8-to-dotnet9; tools: skill / ✅ migrate-dotnet8-to-dotnet9; tools: skill — / — ✅ 0.11 [31]
thread-abort-migration Worker thread with abort-based cancellation 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ thread-abort-migration; tools: report_intent, skill / ✅ thread-abort-migration; tools: skill — / — ✅ 0.10 [32]
thread-abort-migration Timeout enforcement via Thread.Abort 5.0/5 → 5.0/5 5.0/5 → 4.0/5 🔴 ✅ thread-abort-migration; tools: skill / ✅ thread-abort-migration; tools: skill — / — ✅ 0.10
thread-abort-migration Blocking WaitHandle with Thread.Interrupt 4.0/5 → 3.0/5 🔴 4.0/5 → 4.0/5 ✅ thread-abort-migration; tools: skill, report_intent, bash / ✅ thread-abort-migration; tools: skill — / — ✅ 0.10
thread-abort-migration ASP.NET Response.End and Response.Redirect with Thread.Abort 4.0/5 → 4.0/5 4.0/5 → 4.0/5 ✅ thread-abort-migration; tools: skill, report_intent, create / ✅ thread-abort-migration; tools: report_intent, skill — / — ✅ 0.10
thread-abort-migration Thread.Join and Thread.Sleep only — should not migrate 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ⚠️ NOT ACTIVATED / ✅ thread-abort-migration; tools: skill — / — ✅ 0.10
migrate-nullable-references Enable NRT in a small library with mixed nullability 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ migrate-nullable-references; tools: skill / ✅ migrate-nullable-references; tools: skill — / — ✅ 0.13 [33]
migrate-nullable-references File-by-file migration: only modify the targeted file 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ⚠️ NOT ACTIVATED / ⚠️ NOT ACTIVATED — / — ✅ 0.13 [34]
migrate-nullable-references Enable NRT in ASP.NET Core Web API with EF Core 3.0/5 → 3.0/5 3.0/5 → 4.0/5 🟢 ⚠️ NOT ACTIVATED / ⚠️ NOT ACTIVATED — / — ✅ 0.13 [35]
dotnet-aot-compat Make Azure.ResourceManager AOT-compatible 1.0/5 ⏰ → 1.0/5 ⏰ 1.0/5 ⏰ → 3.0/5 ⏰ 🟢 ✅ dotnet-aot-compat; tools: skill, edit, create / ✅ dotnet-aot-compat; tools: skill, edit, create explore / explore, task ✅ 0.16
migrate-dotnet10-to-dotnet11 Console app with compression and TAR operations 4.0/5 → 4.0/5 4.0/5 → 5.0/5 🟢 ✅ migrate-dotnet10-to-dotnet11; tools: skill / ✅ migrate-dotnet10-to-dotnet11; tools: skill — / — ✅ 0.05
migrate-dotnet10-to-dotnet11 C# 15 compiler breaking changes — Span safe-context, nameof, with() 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ migrate-dotnet10-to-dotnet11; tools: skill, report_intent, view / ✅ migrate-dotnet10-to-dotnet11; tools: skill, report_intent, view — / — ✅ 0.05
migrate-dotnet10-to-dotnet11 EF Core app with Cosmos DB provider using sync APIs 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ migrate-dotnet10-to-dotnet11; tools: skill, report_intent, view / ✅ migrate-dotnet10-to-dotnet11; tools: report_intent, skill, view — / — ✅ 0.05 [36]
migrate-dotnet10-to-dotnet11 Deployment to older hardware with minimum requirement changes 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-dotnet10-to-dotnet11; tools: report_intent, skill, view / ✅ migrate-dotnet10-to-dotnet11; tools: report_intent, skill, view — / — ✅ 0.05
migrate-dotnet10-to-dotnet11 Cryptography app using DSA on macOS 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-dotnet10-to-dotnet11; tools: report_intent, skill, view / ✅ migrate-dotnet10-to-dotnet11; tools: report_intent, skill, view — / — ✅ 0.05 [37]
migrate-dotnet10-to-dotnet11 Basic TFM update with Docker and global.json 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-dotnet10-to-dotnet11; tools: skill / ✅ migrate-dotnet10-to-dotnet11; tools: skill — / — ✅ 0.05
migrate-dotnet10-to-dotnet11 C# 15 dynamic operator and ref readonly delegate issues 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ migrate-dotnet10-to-dotnet11; tools: report_intent, skill, view / ✅ migrate-dotnet10-to-dotnet11; tools: skill, report_intent, view — / — ✅ 0.05
exp-test-anti-patterns Detect mixed severity anti-patterns in repository service tests 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ exp-test-anti-patterns; tools: report_intent, skill / ✅ exp-test-anti-patterns; tools: report_intent, skill — / — ✅ 0.06 [38]
exp-test-anti-patterns Detect flakiness indicators and test coupling 2.0/5 → 5.0/5 🟢 2.0/5 → 4.0/5 🟢 ✅ exp-test-anti-patterns; tools: report_intent, skill / ✅ exp-test-anti-patterns; tools: report_intent, skill — / — ✅ 0.06
exp-test-anti-patterns Detect duplicated tests and magic values 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ exp-test-anti-patterns; tools: report_intent, skill / ✅ exp-test-anti-patterns; tools: skill — / — ✅ 0.06
exp-test-anti-patterns Recognize well-written tests without inventing false positives 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ exp-test-anti-patterns; tools: report_intent, skill / ✅ exp-test-anti-patterns; tools: report_intent, skill — / — ✅ 0.06
exp-test-tagging Tag an untagged MSTest test suite 3.0/5 → 5.0/5 🟢 3.0/5 → 4.0/5 🟢 ✅ exp-test-tagging; tools: skill / ✅ exp-test-tagging; tools: skill — / explore, general-purpose 🟡 0.24
exp-test-tagging Tag an untagged xUnit test suite 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ exp-test-tagging; tools: skill / ✅ exp-test-tagging; tools: skill — / — 🟡 0.24
exp-test-tagging Tag an untagged NUnit test suite 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ exp-test-tagging; tools: skill, glob / ✅ exp-test-tagging; tools: skill, glob — / — 🟡 0.24
exp-test-tagging Audit test distribution without modifying files 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ exp-test-tagging; tools: skill / ✅ exp-test-tagging; tools: skill — / — 🟡 0.24
exp-test-tagging Decline request to write new tests 3.0/5 → 3.0/5 3.0/5 → 4.0/5 🟢 ℹ️ not activated (expected) / ℹ️ not activated (expected) — / — 🟡 0.24
exp-crap-score Calculate CRAP score for a single method with partial coverage 3.0/5 → 5.0/5 🟢 3.0/5 → 4.0/5 🟢 ✅ exp-crap-score; tools: skill, bash / ✅ exp-crap-score; tools: skill, bash — / — ✅ 0.08
exp-crap-score Identify riskiest methods across a file 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ exp-crap-score; tools: skill, bash / ✅ exp-crap-score; tools: skill, bash — / — ✅ 0.08
exp-crap-score Generate coverage then compute CRAP score 4.0/5 → 2.0/5 🔴 4.0/5 → 3.0/5 🔴 ✅ exp-crap-score; tools: skill / ✅ exp-crap-score; tools: skill — / — ✅ 0.08
migrate-mstest-v1v2-to-v3 Migrate MSTest v1 project with assembly reference 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-mstest-v1v2-to-v3; tools: skill / ✅ migrate-mstest-v1v2-to-v3; tools: skill — / — ✅ 0.04
migrate-mstest-v1v2-to-v3 Migrate MSTest v2 NuGet project to v3 3.0/5 → 3.0/5 3.0/5 → 3.0/5 ✅ migrate-mstest-v1v2-to-v3; tools: skill / ✅ migrate-mstest-v1v2-to-v3; tools: skill — / — ✅ 0.04 [39]
migrate-mstest-v1v2-to-v3 Fix Assert.AreEqual object overload errors after v3 upgrade 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-mstest-v1v2-to-v3; tools: skill / ✅ migrate-mstest-v1v2-to-v3; tools: skill — / — ✅ 0.04
migrate-mstest-v1v2-to-v3 Migrate from .testsettings to .runsettings 4.0/5 → 4.0/5 4.0/5 → 4.0/5 ✅ migrate-mstest-v1v2-to-v3; tools: skill, bash / ✅ migrate-mstest-v1v2-to-v3; tools: skill, bash — / — ✅ 0.04 [40]
migrate-mstest-v1v2-to-v3 Fix DataRow type mismatch errors after v3 upgrade 3.0/5 → 3.0/5 3.0/5 → 3.0/5 ✅ migrate-mstest-v1v2-to-v3; tools: skill / ✅ migrate-mstest-v1v2-to-v3; tools: skill — / — ✅ 0.04 [41]
migrate-mstest-v1v2-to-v3 Migrate to MSTest.Sdk project style 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-mstest-v1v2-to-v3; tools: skill, bash / ✅ migrate-mstest-v1v2-to-v3; tools: skill, bash — / — ✅ 0.04
migrate-mstest-v1v2-to-v3 Handle dropped target framework during v3 migration 4.0/5 → 4.0/5 4.0/5 → 4.0/5 ⚠️ NOT ACTIVATED / ⚠️ NOT ACTIVATED — / — ✅ 0.04 [42]
migrate-mstest-v1v2-to-v3 Migrate complex MSTest v2 project with testsettings, DataRow issues, and dropped TFM 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ migrate-mstest-v1v2-to-v3; tools: skill / ✅ migrate-mstest-v1v2-to-v3; tools: skill — / — ✅ 0.04
migrate-mstest-v1v2-to-v3 Correctly identify MSTest v1 vs v2 and recommend different migration paths 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ⚠️ NOT ACTIVATED / ✅ migrate-mstest-v1v2-to-v3; tools: skill explore / — ✅ 0.04
code-testing-agent Generate tests for ContosoUniversity ASP.NET Core MVC app 4.0/5 → 3.0/5 🔴 4.0/5 → 4.0/5 ✅ code-testing-agent; tools: skill, glob / ✅ code-testing-agent; tools: skill explore / explore ✅ 0.02
writing-mstest-tests Write unit tests for a service class 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ writing-mstest-tests; tools: skill, task, glob, grep / ⚠️ NOT ACTIVATED explore / general-purpose 🟡 0.28 [43]
writing-mstest-tests Write data-driven tests for a calculator 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ writing-mstest-tests; tools: skill / ✅ writing-mstest-tests; tools: skill — / — 🟡 0.28 [44]
writing-mstest-tests Write async tests with cancellation 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ writing-mstest-tests; tools: skill / ✅ writing-mstest-tests; tools: skill — / — 🟡 0.28
writing-mstest-tests Fix swapped Assert.AreEqual arguments 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ writing-mstest-tests; tools: report_intent, skill / ⚠️ NOT ACTIVATED — / — 🟡 0.28 [45]
writing-mstest-tests Modernize legacy test patterns 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ writing-mstest-tests; tools: skill / ✅ writing-mstest-tests; tools: skill — / — 🟡 0.28
writing-mstest-tests Replace ExpectedException with Assert.Throws 3.0/5 → 4.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ writing-mstest-tests; tools: skill / ✅ writing-mstest-tests; tools: skill — / — 🟡 0.28
writing-mstest-tests Use proper collection assertions 4.0/5 → 3.0/5 🔴 4.0/5 → 2.0/5 🔴 ✅ writing-mstest-tests; tools: skill / ✅ writing-mstest-tests; tools: skill — / — 🟡 0.28
writing-mstest-tests Use proper type assertions instead of casts 2.0/5 → 3.0/5 🟢 2.0/5 → 5.0/5 🟢 ⚠️ NOT ACTIVATED / ✅ writing-mstest-tests; tools: skill — / — 🟡 0.28
writing-mstest-tests Set up test lifecycle correctly 3.0/5 → 4.0/5 🟢 3.0/5 → 4.0/5 🟢 ✅ writing-mstest-tests; tools: skill / ✅ writing-mstest-tests; tools: skill — / — 🟡 0.28
writing-mstest-tests Use DynamicData with ValueTuples over object arrays 1.0/5 → 3.0/5 🟢 1.0/5 → 3.0/5 🟢 ✅ writing-mstest-tests; tools: skill / ✅ writing-mstest-tests; tools: skill — / — 🟡 0.28
migrate-vstest-to-mtp Migrate MSTest project from VSTest to Microsoft.Testing.Platform 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ migrate-vstest-to-mtp; tools: skill / ✅ migrate-vstest-to-mtp; tools: skill, report_intent, view — / — ✅ 0.09 [46]
migrate-vstest-to-mtp Migrate NUnit project from VSTest to Microsoft.Testing.Platform 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ migrate-vstest-to-mtp; tools: report_intent, skill / ✅ migrate-vstest-to-mtp; tools: skill — / — ✅ 0.09
migrate-vstest-to-mtp Migrate xUnit.net v2 project from VSTest to Microsoft.Testing.Platform 1.0/5 → 5.0/5 🟢 1.0/5 → 5.0/5 🟢 ✅ migrate-vstest-to-mtp; tools: skill, report_intent, view / ✅ migrate-vstest-to-mtp; tools: skill — / — ✅ 0.09
migrate-vstest-to-mtp Update Azure DevOps pipeline from VSTest task to MTP 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ migrate-vstest-to-mtp; tools: skill / ✅ migrate-vstest-to-mtp; tools: skill — / — ✅ 0.09
migrate-vstest-to-mtp Migrate MSTest.Sdk project that explicitly uses VSTest 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-vstest-to-mtp; tools: skill / ✅ migrate-vstest-to-mtp; tools: skill — / — ✅ 0.09
migrate-vstest-to-mtp Translate dotnet test VSTest arguments to MTP equivalents 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ migrate-vstest-to-mtp; tools: skill / ✅ migrate-vstest-to-mtp; tools: skill — / — ✅ 0.09 [47]
migrate-vstest-to-mtp Handle exit code 8 when migrating from VSTest to MTP 3.0/5 → 4.0/5 🟢 3.0/5 → 2.0/5 🔴 ✅ migrate-vstest-to-mtp; tools: skill / ⚠️ NOT ACTIVATED — / — ✅ 0.09
migrate-vstest-to-mtp Configure dotnet test MTP mode on .NET 10 SDK 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ migrate-vstest-to-mtp; tools: skill / ✅ migrate-vstest-to-mtp; tools: skill — / — ✅ 0.09
migrate-vstest-to-mtp Migrate xUnit.net VSTest filter syntax to MTP 1.0/5 → 5.0/5 🟢 1.0/5 → 4.0/5 🟢 ✅ migrate-vstest-to-mtp; tools: skill / ✅ migrate-vstest-to-mtp; tools: skill — / — ✅ 0.09
migrate-vstest-to-mtp Full VSTest to MTP migration plan for MSTest solution 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ migrate-vstest-to-mtp; tools: skill / ✅ migrate-vstest-to-mtp; tools: skill, create — / — ✅ 0.09 [48]
run-tests Run tests in a VSTest MSTest project 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ run-tests; tools: skill / ✅ run-tests; tools: skill — / — ✅ 0.19
run-tests Run tests with trx reporting on MTP project (SDK 9) 4.0/5 → 4.0/5 4.0/5 → 4.0/5 ✅ run-tests; tools: skill / ✅ run-tests; tools: skill — / — ✅ 0.19
run-tests Run tests with blame-hang on MTP project (SDK 10) 2.0/5 → 2.0/5 2.0/5 → 2.0/5 ✅ run-tests; tools: skill, bash / ✅ run-tests; tools: skill, bash — / — ✅ 0.19 [49]
run-tests Run tests in a multi-TFM project targeting a specific framework 2.0/5 → 4.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ run-tests; tools: skill, bash / ✅ run-tests; tools: skill, bash — / — ✅ 0.19
run-tests Filter MSTest tests by category on VSTest 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ run-tests; tools: skill, bash / ⚠️ NOT ACTIVATED — / — ✅ 0.19 [50]
run-tests Filter NUnit tests by class name on VSTest 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ run-tests; tools: skill, bash / ⚠️ NOT ACTIVATED — / — ✅ 0.19
run-tests Filter xUnit v3 tests by class on MTP 1.0/5 → 5.0/5 🟢 1.0/5 → 5.0/5 🟢 ✅ run-tests; tools: skill, bash / ✅ run-tests; tools: skill, bash, grep — / — ✅ 0.19
run-tests Filter xUnit v3 tests by trait on MTP 1.0/5 → 5.0/5 🟢 1.0/5 → 5.0/5 🟢 ✅ run-tests; tools: skill, view / ✅ run-tests; tools: skill, view — / — ✅ 0.19
run-tests Filter TUnit tests by class using treenode-filter 1.0/5 → 5.0/5 🟢 1.0/5 → 5.0/5 🟢 ✅ run-tests; tools: skill, bash / ⚠️ NOT ACTIVATED — / — ✅ 0.19
run-tests Combine multiple filter criteria on VSTest MSTest 4.0/5 → 4.0/5 4.0/5 → 4.0/5 ✅ run-tests; tools: skill, bash / ✅ run-tests; tools: skill — / — ✅ 0.19 [51]
run-tests MTP project on SDK 9 must use -- separator for args 1.0/5 → 2.0/5 🟢 1.0/5 → 5.0/5 🟢 ⚠️ NOT ACTIVATED / ✅ run-tests; tools: skill — / — ✅ 0.19
run-tests MTP project on SDK 10 passes args directly 2.0/5 → 4.0/5 🟢 2.0/5 → 4.0/5 🟢 ✅ run-tests; tools: skill / ✅ run-tests; tools: skill, create — / — ✅ 0.19
run-tests Detect test platform from Directory.Build.props 1.0/5 → 5.0/5 🟢 1.0/5 → 5.0/5 🟢 ✅ run-tests; tools: skill / ✅ run-tests; tools: skill — / — ✅ 0.19
run-tests Negative test: do not use MTP syntax for a VSTest project 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ run-tests; tools: skill, view / ✅ run-tests; tools: skill, view — / — ✅ 0.19 [52]
mtp-hot-reload Suggest hot reload for failing test in MTP project (SDK 9) 1.0/5 → 4.0/5 🟢 1.0/5 → 5.0/5 🟢 ✅ mtp-hot-reload; tools: skill / ✅ mtp-hot-reload; tools: skill — / — ✅ 0.11
mtp-hot-reload Suggest hot reload for failing test in MTP project (SDK 10) 1.0/5 → 4.0/5 🟢 1.0/5 → 5.0/5 🟢 ✅ mtp-hot-reload; tools: skill, bash, create / ✅ mtp-hot-reload; tools: skill — / — ✅ 0.11
mtp-hot-reload Enable hot reload when package already installed 2.0/5 → 5.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ mtp-hot-reload; tools: skill / ✅ mtp-hot-reload; tools: skill, glob — / — ✅ 0.11
mtp-hot-reload Suggest launchSettings.json configuration for hot reload 1.0/5 → 5.0/5 🟢 1.0/5 → 5.0/5 🟢 ✅ mtp-hot-reload; tools: skill, bash, create / ✅ mtp-hot-reload; tools: skill, bash, create — / — ✅ 0.11
mtp-hot-reload Use dotnet run not dotnet test for hot reload 1.0/5 → 3.0/5 🟢 1.0/5 → 3.0/5 🟢 ✅ mtp-hot-reload; tools: skill / ✅ mtp-hot-reload; tools: skill — / — ✅ 0.11
mtp-hot-reload Negative: VSTest project cannot use MTP hot reload 2.0/5 → 2.0/5 2.0/5 → 5.0/5 🟢 ✅ mtp-hot-reload; tools: skill, create / ✅ mtp-hot-reload; tools: skill — / — ✅ 0.11
mtp-hot-reload Run specific failing test with hot reload filter 1.0/5 → 3.0/5 🟢 1.0/5 → 3.0/5 🟢 ✅ mtp-hot-reload; tools: report_intent, skill, view / ✅ mtp-hot-reload; tools: report_intent, skill, view — / — ✅ 0.11
migrate-mstest-v3-to-v4 Migrate custom TestMethodAttribute from Execute to ExecuteAsync 1.0/5 → 3.0/5 🟢 1.0/5 → 3.0/5 🟢 ✅ migrate-mstest-v3-to-v4; tools: skill / ✅ migrate-mstest-v3-to-v4; tools: skill — / — ✅ 0.06
migrate-mstest-v3-to-v4 Replace ExpectedExceptionAttribute with Assert.ThrowsExactly 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-mstest-v3-to-v4; tools: skill / ✅ migrate-mstest-v3-to-v4; tools: skill — / — ✅ 0.06
migrate-mstest-v3-to-v4 Fix multiple v4 breaking changes: Assert, ClassCleanup, TestContext, Timeout 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-mstest-v3-to-v4; tools: skill / ✅ migrate-mstest-v3-to-v4; tools: skill — / — ✅ 0.06
migrate-mstest-v3-to-v4 Handle net6.0 target framework dropped in MSTest v4 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-mstest-v3-to-v4; tools: skill / ⚠️ NOT ACTIVATED — / — ✅ 0.06
migrate-mstest-v3-to-v4 Fix TestMethodAttribute CallerInfo constructor breaking change 3.0/5 → 4.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-mstest-v3-to-v4; tools: skill / ✅ migrate-mstest-v3-to-v4; tools: skill — / — ✅ 0.06
migrate-mstest-v3-to-v4 Understand behavioral changes after MSTest v4 upgrade 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-mstest-v3-to-v4; tools: skill / ✅ migrate-mstest-v3-to-v4; tools: skill — / — ✅ 0.06
migrate-mstest-v3-to-v4 Handle MSTest.Sdk and MTP changes in v4 2.0/5 → 3.0/5 🟢 2.0/5 → 3.0/5 🟢 ✅ migrate-mstest-v3-to-v4; tools: skill / ✅ migrate-mstest-v3-to-v4; tools: skill — / — ✅ 0.06
migrate-mstest-v3-to-v4 Full MSTest v3 to v4 migration with multiple breaking changes 3.0/5 → 5.0/5 🟢 3.0/5 → 5.0/5 🟢 ✅ migrate-mstest-v3-to-v4; tools: skill / ✅ migrate-mstest-v3-to-v4; tools: skill — / — ✅ 0.06
migrate-mstest-v3-to-v4 Migrate MSTest.Sdk v3 project using ManagedType and TestTimeout 3.0/5 → 3.0/5 3.0/5 → 4.0/5 🟢 ✅ migrate-mstest-v3-to-v4; tools: skill / ✅ migrate-mstest-v3-to-v4; tools: skill — / — ✅ 0.06
migrate-mstest-v3-to-v4 Correctly identify MSTest v3 project and recommend v4 migration 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ migrate-mstest-v3-to-v4; tools: skill / ✅ migrate-mstest-v3-to-v4; tools: skill — / — ✅ 0.06
template-authoring Validate a template.json file 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ template-authoring; tools: skill / ✅ template-authoring; tools: skill — / — ✅ 0.06 [53]
template-discovery Find template for web API project 2.0/5 → 4.0/5 🟢 2.0/5 → 5.0/5 🟢 ✅ template-discovery; tools: report_intent, skill, bash / ✅ template-discovery; tools: report_intent, skill, bash — / — 🟡 0.27
template-discovery Inspect template parameters 4.0/5 → 4.0/5 4.0/5 → 4.0/5 ✅ template-discovery; tools: skill / ✅ template-discovery; tools: skill — / — 🟡 0.27 [54]
template-instantiation Create a console application 4.0/5 → 5.0/5 🟢 4.0/5 → 5.0/5 🟢 ✅ template-instantiation; tools: skill / ✅ template-instantiation; tools: skill — / — 🟡 0.35
template-instantiation Create project with specific framework 5.0/5 → 5.0/5 5.0/5 → 5.0/5 ✅ template-instantiation; tools: skill / ✅ template-instantiation; tools: skill — / — 🟡 0.35

[1] (Isolated) Quality unchanged but weighted score is -6.5% due to: tokens (24237 → 49377), tool calls (2 → 3)
[2] (Plugin) Quality unchanged but weighted score is -10.0% due to: tokens (11712 → 53708), tool calls (0 → 4), time (8.8s → 21.3s)
[3] (Plugin) Quality unchanged but weighted score is -10.0% due to: tokens (11827 → 54144), tool calls (0 → 3), time (12.1s → 29.0s)
[4] (Plugin) Quality unchanged but weighted score is -23.9% due to: completion (✓ → ✗), tokens (40471 → 102106), tool calls (5 → 10), time (37.5s → 58.2s)
[5] (Plugin) Quality unchanged but weighted score is -6.4% due to: tokens (38693 → 67427), tool calls (4 → 9)
[6] (Plugin) Quality unchanged but weighted score is -6.0% due to: tokens (36291 → 94113), tool calls (3 → 8), time (22.7s → 45.7s)
[7] (Plugin) Quality unchanged but weighted score is -13.2% due to: tokens (27522 → 113939), quality, tool calls (2 → 9), time (35.9s → 97.2s)
[8] (Plugin) Quality unchanged but weighted score is -7.3% due to: tokens (28886 → 156385), tool calls (2 → 12), time (61.2s → 174.9s)
[9] (Isolated) Quality unchanged but weighted score is -9.3% due to: tokens (29806 → 127300), tool calls (2 → 10), time (65.9s → 112.5s)
[10] (Plugin) Quality unchanged but weighted score is -15.3% due to: quality, tokens (27426 → 148501), tool calls (2 → 10), time (34.0s → 158.1s)
[11] (Isolated) Quality unchanged but weighted score is -6.5% due to: tokens (24360 → 43539), tool calls (2 → 5)
[12] (Plugin) Quality unchanged but weighted score is -10.2% due to: quality, time (124.7s → 170.5s)
[13] (Isolated) Quality unchanged but weighted score is -7.9% due to: tokens (11948 → 41241), tool calls (0 → 4)
[14] (Plugin) Quality unchanged but weighted score is -8.4% due to: tokens (11875 → 26376), tool calls (0 → 1), time (11.0s → 15.0s)
[15] (Plugin) Quality unchanged but weighted score is -4.9% due to: tokens (56520 → 114097)
[16] (Plugin) Quality unchanged but weighted score is -2.4% due to: tokens (71629 → 138377), time (49.8s → 65.3s)
[17] (Plugin) Quality unchanged but weighted score is -30.2% due to: quality, judgment, tokens (56434 → 86041), tool calls (17 → 22)
[18] (Isolated) Quality improved but weighted score is -9.2% due to: tokens (67804 → 296253), tool calls (8 → 25), time (66.7s → 112.8s)
[19] (Isolated) Quality unchanged but weighted score is -10.0% due to: tokens (41176 → 291517), tool calls (6 → 16), time (44.6s → 133.6s)
[20] (Plugin) Quality unchanged but weighted score is -3.8% due to: tokens (111258 → 169763), time (69.0s → 88.5s)
[21] (Plugin) Quality unchanged but weighted score is -4.6% due to: tokens (174573 → 296288), time (108.7s → 137.6s), tool calls (15 → 18)
[22] (Plugin) Quality unchanged but weighted score is -6.2% due to: tokens (14910 → 77552), tool calls (0 → 5)
[23] (Isolated) Quality unchanged but weighted score is -18.7% due to: quality, judgment
[24] (Plugin) Quality unchanged but weighted score is -5.7% due to: tokens (30165 → 56706), tool calls (2 → 3)
[25] (Plugin) Quality unchanged but weighted score is -0.1% due to: efficiency metrics
[26] (Isolated) Quality unchanged but weighted score is -0.9% due to: quality
[27] (Isolated) Quality unchanged but weighted score is -0.3% due to: quality
[28] (Isolated) Quality improved but weighted score is -7.4% due to: tokens (13042 → 55731), tool calls (0 → 5)
[29] (Isolated) Quality unchanged but weighted score is -6.9% due to: tokens (132348 → 245944), time (69.8s → 118.7s), tool calls (17 → 23)
[30] (Plugin) Quality unchanged but weighted score is -10.0% due to: tokens (57067 → 274919), tool calls (13 → 26), time (60.9s → 169.6s)
[31] (Isolated) Quality unchanged but weighted score is -5.0% due to: tokens (60896 → 113737), tool calls (12 → 18)
[32] (Plugin) Quality unchanged but weighted score is -7.7% due to: tokens (13008 → 32275), tool calls (0 → 1)
[33] (Plugin) Quality unchanged but weighted score is -3.5% due to: tokens (125387 → 198873)
[34] (Isolated) Quality unchanged but weighted score is -4.6% due to: tokens (64767 → 109290), time (32.0s → 44.0s)
[35] (Isolated) Quality unchanged but weighted score is -13.0% due to: judgment, quality
[36] (Plugin) Quality unchanged but weighted score is -7.2% due to: tokens (12488 → 49463), tool calls (0 → 4)
[37] (Isolated) Quality improved but weighted score is -7.6% due to: tokens (12702 → 45256), tool calls (0 → 4)
[38] (Plugin) Quality unchanged but weighted score is -7.7% due to: tokens (12978 → 31273), tool calls (0 → 2), time (14.3s → 49.4s)
[39] (Plugin) Quality unchanged but weighted score is -7.3% due to: completion (✓ → ✗)
[40] (Plugin) Quality unchanged but weighted score is -5.5% due to: quality, tokens (50020 → 68393)
[41] (Isolated) Quality unchanged but weighted score is -20.9% due to: judgment, quality, tokens (93012 → 160044), time (48.3s → 62.1s), tool calls (9 → 11)
[42] (Isolated) Quality unchanged but weighted score is -12.1% due to: judgment
[43] (Plugin) Quality unchanged but weighted score is -6.1% due to: tokens (201546 → 311870), time (77.1s → 143.1s), tool calls (17 → 25)
[44] (Plugin) Quality unchanged but weighted score is -7.1% due to: tokens (156380 → 366568), tool calls (17 → 25), time (77.5s → 105.6s)
[45] (Plugin) Quality unchanged but weighted score is -3.2% due to: time (5.7s → 10.8s), tokens (11667 → 13921)
[46] (Plugin) Quality unchanged but weighted score is -6.8% due to: tokens (13269 → 52015), tool calls (0 → 3)
[47] (Plugin) Quality unchanged but weighted score is -4.8% due to: tokens (12653 → 32995), tool calls (0 → 1)
[48] (Isolated) Quality improved but weighted score is -0.3% due to: judgment
[49] (Plugin) Quality unchanged but weighted score is -2.0% due to: tokens (37158 → 731107), tool calls (4 → 33), time (27.2s → 258.7s)
[50] (Plugin) Quality unchanged but weighted score is -2.3% due to: time (8.0s → 12.3s), tokens (23850 → 28382)
[51] (Plugin) Quality unchanged but weighted score is -9.5% due to: tokens (23988 → 65804), tool calls (3 → 6), time (10.4s → 18.7s)
[52] (Plugin) Quality unchanged but weighted score is -1.5% due to: tokens (23534 → 66005), tool calls (2 → 6), time (14.0s → 25.0s)
[53] (Plugin) Quality unchanged but weighted score is -4.8% due to: tokens (23695 → 42011), time (16.6s → 22.9s)
[54] (Isolated) Quality unchanged but weighted score is -1.1% due to: judgment, tokens (25414 → 40777), tool calls (2 → 3)

timeout — run hit the scenario timeout limit; scoring may be impacted by aborting model execution before it could produce its full output

Model: claude-opus-4.6 | Judge: claude-opus-4.6

Full results

github-actions Bot added a commit that referenced this pull request Mar 24, 2026
@JanKrivanek
Copy link
Copy Markdown
Member Author

/evaluate

@github-actions
Copy link
Copy Markdown
Contributor

Skill Validation Results

Skill Scenario Quality (Isolated) Quality (Plugin) Skills Loaded Agents Invoked Overfit Verdict
code-testing-agent Generate tests for ContosoUniversity ASP.NET Core MVC app 3.7/5 → 4.0/5 🟢 3.7/5 → 3.3/5 🔴 ✅ code-testing-agent; tools: skill, task / ✅ code-testing-agent; tools: skill, task explore / explore ✅ 0.02

Model: claude-opus-4.6 | Judge: claude-opus-4.6

Full results

@JanKrivanek JanKrivanek marked this pull request as ready for review March 30, 2026 13:01
Copilot AI review requested due to automatic review settings March 30, 2026 13:01
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a proof-of-concept “code testing agent” skill (with supporting sub-agents and prompts) plus a non-trivial ASP.NET Core MVC sample app (ContosoUniversity) and an evaluation scenario to drive test generation against it.

Changes:

  • Introduces code-testing-agent skill docs/prompts and a suite of RPI (Research → Plan → Implement) sub-agent definitions for test generation.
  • Adds a ContosoUniversity ASP.NET Core MVC + EF Core sample app used as the real-world target for generated tests.
  • Adds a dotnet-test evaluation scenario (eval.yaml) to validate generated tests compile/pass and produce coverage output.

Reviewed changes

Copilot reviewed 42 out of 42 changed files in this pull request and generated 9 comments.

Show a summary per file
File Description
tests/dotnet-test/code-testing-agent/eval.yaml Evaluation scenario that prompts the agent to generate tests and runs dotnet test with coverage collection.
tests/dotnet-test/code-testing-agent/ContosoUniversity/appsettings.json Adds app settings + default SQL Server connection string for the sample app.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Services/NotificationService.cs In-memory notification queue service used by controllers.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Program.cs Minimal hosting startup configuring EF Core, MVC, session, and static file serving.
tests/dotnet-test/code-testing-agent/ContosoUniversity/PaginatedList.cs Pagination helper used by list views/controllers.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Models/Student.cs Student entity model.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Models/SchoolViewModels/InstructorIndexData.cs ViewModel for instructor index (instructors/courses/enrollments).
tests/dotnet-test/code-testing-agent/ContosoUniversity/Models/SchoolViewModels/EnrollmentDateGroup.cs ViewModel used by Home/About grouping.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Models/SchoolViewModels/AssignedCourseData.cs ViewModel for course assignment UI.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Models/Person.cs Base class for TPH inheritance (Student/Instructor).
tests/dotnet-test/code-testing-agent/ContosoUniversity/Models/OfficeAssignment.cs OfficeAssignment entity model.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Models/Notification.cs Notification entity + EntityOperation enum.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Models/Instructor.cs Instructor entity model.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Models/ErrorViewModel.cs Error view model.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Models/Enrollment.cs Enrollment entity model + Grade enum.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Models/Department.cs Department entity model (includes concurrency token).
tests/dotnet-test/code-testing-agent/ContosoUniversity/Models/CourseAssignment.cs Join entity for instructor/course assignments.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Models/Course.cs Course entity model (includes uploaded-material path).
tests/dotnet-test/code-testing-agent/ContosoUniversity/Data/SchoolContextFactory.cs Context factory wiring SQL Server options from configuration.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Data/SchoolContext.cs EF Core DbContext with TPH config and relationships.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Data/DbInitializer.cs Seeds initial data on first run via EnsureCreated().
tests/dotnet-test/code-testing-agent/ContosoUniversity/Controllers/StudentsController.cs CRUD controller for students + notification hooks.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Controllers/NotificationsController.cs Basic endpoints/UI for retrieving/marking notifications.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Controllers/InstructorsController.cs CRUD controller for instructors + course assignment management.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Controllers/HomeController.cs Home/About/Contact/Error actions for the sample app.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Controllers/DepartmentsController.cs CRUD controller for departments + concurrency handling + notifications.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Controllers/CoursesController.cs CRUD controller for courses + teaching material upload/delete logic.
tests/dotnet-test/code-testing-agent/ContosoUniversity/Controllers/BaseController.cs Shared base controller providing DbContext + notification helper.
tests/dotnet-test/code-testing-agent/ContosoUniversity/ContosoUniversity.sln Solution file for the sample app.
tests/dotnet-test/code-testing-agent/ContosoUniversity/ContosoUniversity.csproj SDK-style web project targeting net10.0 with EF Core + UI packages.
plugins/dotnet-test/skills/code-testing-agent/unit-test-generation.prompt.md Cross-language unit test generation prompt used as default guidance.
plugins/dotnet-test/skills/code-testing-agent/extensions/dotnet.md .NET-specific guidance for builds/tests, references, and error handling.
plugins/dotnet-test/skills/code-testing-agent/SKILL.md Skill doc describing the multi-agent RPI pipeline and usage guidance.
plugins/dotnet-test/plugin.json Registers the new agent definitions under the dotnet-test plugin.
plugins/dotnet-test/agents/code-testing-generator.agent.md Orchestrator agent for Research/Plan/Implement + final validation steps.
plugins/dotnet-test/agents/code-testing-researcher.agent.md Sub-agent for repository research and convention discovery.
plugins/dotnet-test/agents/code-testing-planner.agent.md Sub-agent for producing phased test implementation plans.
plugins/dotnet-test/agents/code-testing-implementer.agent.md Sub-agent for implementing plan phases and verifying build/test.
plugins/dotnet-test/agents/code-testing-builder.agent.md Sub-agent for compiling projects and reporting errors.
plugins/dotnet-test/agents/code-testing-tester.agent.md Sub-agent for running tests and summarizing pass/fail output.
plugins/dotnet-test/agents/code-testing-fixer.agent.md Sub-agent for addressing compilation errors based on build output.
plugins/dotnet-test/agents/code-testing-linter.agent.md Sub-agent for formatting/linting fixes (polyglot).
Comments suppressed due to low confidence (2)

tests/dotnet-test/code-testing-agent/ContosoUniversity/Controllers/CoursesController.cs:176

  • Returning ex.Message to the client can leak internal details (paths, stack hints). Since you already log the exception, prefer a generic message for the user instead of concatenating the exception text.
                        _logger.LogError(ex, "Error uploading file");
                        ModelState.AddModelError("teachingMaterialImage", "Error uploading file: " + ex.Message);
                        ViewBag.DepartmentID = new SelectList(db.Departments, "DepartmentID", "Name", course.DepartmentID);

tests/dotnet-test/code-testing-agent/ContosoUniversity/Controllers/CoursesController.cs:201

  • Same issue here: .Single() will throw if no course matches the id, so the course == null check won't be hit. Use SingleOrDefault()/FirstOrDefault() and return NotFound when null.
            Course course = db.Courses.Include(c => c.Department).Where(c => c.CourseID == id).Single();
            if (course == null)
            {
                return NotFound();
            }

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread tests/dotnet-test/code-testing-agent/ContosoUniversity/Program.cs
@JanKrivanek JanKrivanek requested a review from Evangelink March 30, 2026 13:37
@JanKrivanek JanKrivanek merged commit e4670b3 into main Mar 30, 2026
34 checks passed
@JanKrivanek JanKrivanek deleted the dev/jankrivanek/code-testing-agent branch March 30, 2026 15:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants