π₯ Daily Health Check β 2026-05-26
Status: π΄ 2 critical Β· π‘ 2 warnings Β· π΅ 1 info
Since yesterday: π 1 new Β· β
3 resolved Β· π 4 unchanged
Summary: A welcome improvement β 3 findings resolved today, including the evaluation failure rate critical and both dotnet-blazor--coordinate-components and dotnet-test failures. The Markdown Linter startup_failure has recurred on main as predicted. The chronic dotnet-msbuild evaluation failure and eval duration critical remain active.
π New Findings (1)
These appeared since the last health check (2026-05-25).
π‘ [P1] Markdown Linter workflow startup_failure on main
Fingerprint: pipeline:markdown-linter:startup:startup_failure
Severity: π‘ Warning | First seen: 2026-05-26
The Markdown Linter workflow (.github/workflows/markdown-linter.lock.yml) had a startup_failure on main with 0 jobs executed.
- Run #23 β 2026-05-25T14:58:56Z (schedule, main)
- Conclusion:
startup_failure β the agentic lock workflow failed before any jobs started
- Triggered by commit:
5810d492 ("test-anti-patterns / test-smell-detection: narrow-specialist re-pitch (Variant B)")
- 0 jobs ran; the workflow never started
Context: This finding was previously resolved on 2026-05-25 (no run in 24h window), but the prior investigation noted the root cause β super-linter/super-linter action not in .github/aw/actions-lock.json β was not fixed. As predicted, the failure recurred on the next scheduled run.
Suggested action: Add super-linter/super-linter to .github/aw/actions-lock.json (or equivalent action lock file) to resolve the startup failure permanently. π Investigation dispatched.
π Investigation Results
Deep investigations are dispatched for new critical/warning findings.
The grooming workflow links results ~3 hours after this run.
β
Resolved Since Yesterday (3)
These were in yesterday's report but are no longer detected.
π΄ Evaluation failed β evaluate (dotnet-test) / Run skill-validator
Fingerprint: pipeline:evaluation:evaluate-(dotnet-test):run-skill-validator:failure
Resolved: The evaluate (dotnet-test) job succeeded in today's scheduled evaluation run (run #3786), run time 2026-05-26T00:54:31Z. The runner preemption issue affecting dotnet-test appears to have been transient or the underlying contention resolved. The chronic evaluate (dotnet-msbuild) failure continues.
π΄ Evaluation failed β evaluate (dotnet-blazor--coordinate-components) / Run skill-validator
Fingerprint: pipeline:evaluation:evaluate-(dotnet-blazor--coordinate-components):run-skill-validator:failure
Resolved: No evaluate (dotnet-blazor--coordinate-components) job failure detected in the last 24h. The dotnet-blazor PR triggering this evaluation appears to have been merged or evaluation conditions changed. The evaluate (dotnet-blazor) job succeeded in run#3786.
π΄ Evaluation failure rate β Critical threshold exceeded
Fingerprint: pipeline:evaluation:failure-rate:critical
Resolved: Evaluation failure rate across all branches in the last 24h has dropped to ~14% (2 failures out of 14 non-cancelled runs), well below the Critical (>30%) and Warning (>15%) thresholds. The improvement is driven by the resolution of the dotnet-blazor--coordinate-components and dotnet-test failures β only the chronic dotnet-msbuild failure remains.
π Existing Findings (4)
These have been present since before today. Sorted by severity then age.
π΄ [P1] Evaluation failed β evaluate (dotnet-msbuild) / Run skill-validator Β· first seen 2026-05-20 Β· 7 occurrences
Fingerprint: pipeline:evaluation:evaluate-(dotnet-msbuild):run-skill-validator:failure
Severity: π΄ Critical
The evaluate (dotnet-msbuild) job continues to fail at the Run skill-validator step in every scheduled evaluation run on main. This is the 7th consecutive day.
Today's run (run #3786):
- Started:
2026-05-26T00:54:31Z, concluded: 2026-05-26T03:03:29Z (~129 min total)
- Job
evaluate (dotnet-msbuild): failed at Run skill-validator step
- All other evaluate jobs (dotnet, dotnet-test, dotnet-blazor, dotnet-aspnet, dotnet-data, dotnet-diag, dotnet-upgrade, dotnet-maui, dotnet-ai, dotnet-template-engine) succeeded
Root cause (from investigation): AITools.BinlogMcp MCP server startup causes resource exhaustion, leading to SIGTERM. The skill-validator evaluation job is killed while parallel vally-evaluate jobs succeed.
Suggested action: Remove or conditionally disable AITools.BinlogMcp during skill-validator evaluation. The BinlogMcp binary downloads at evaluation time from a custom NuGet feed, consuming resources that trigger runner preemption.
π΄ [P3] Evaluation avg duration ~129 min β critical threshold exceeded (>55 min) Β· first seen 2026-05-14 Β· 12 occurrences
Fingerprint: resource:eval-duration:critical
Severity: π΄ Critical
Scheduled evaluation runs consistently exceed the π΄ Critical threshold (>55 min). Today's scheduled run was 129 min.
Today's run (run #3786):
- Total run duration: ~129 min (00:54Z β 03:03Z)
- Note: run ended with a failure (dotnet-msbuild) but other jobs including dotnet-test completed successfully
7-day trend (scheduled runs):
| Date |
Duration |
| 2026-05-26 |
~129 min |
| 2026-05-25 |
124 min |
| 2026-05-24 |
122.5 min |
| 2026-05-23 |
119.5 min |
| 2026-05-22 |
~131.7 min |
| 7d avg |
~125 min |
Root cause (from investigation): The evaluate (dotnet-test) / vally-evaluate (dotnet-test) job now runs ~164 min of complex agent evaluation scenarios, dominating overall run duration.
Suggested action: Split the dotnet-test plugin evaluation into smaller scenario batches, or increase parallelism to bring total run time below 55 min.
π‘ [I8] Orphan plugin: dotnet-experimental not listed in marketplace.json Β· first seen 2026-05-14 Β· 12 occurrences
Fingerprint: infra:orphan-plugin:dotnet-experimental
Severity: π‘ Warning
plugins/dotnet-experimental/ has a valid plugin.json but is not registered in .github/plugin/marketplace.json. The plugin is not discoverable by consumers.
Suggested action: Either register in all three marketplace files (.github/plugin/, .claude-plugin/, .cursor-plugin/) per CONTRIBUTING.md, or remove the directory if it is no longer active.
π΅ [I4] evaluation.yml uses --verdict-warn-only mode Β· first seen 2026-05-16 Β· 10 occurrences
Fingerprint: infra:verdict-warn-only
Severity: π΅ Info
The evaluation workflow passes --verdict-warn-only to skill-validator. Evaluation failures are reported as warnings rather than hard PR-blocking errors. This is intentional configuration for the current rollout phase.
π Trends (7-day)
| Metric |
Today |
7d Avg |
Ξ |
Trend |
| Eval duration β scheduled (min) |
129 |
~125 |
+4 |
βοΈ Increasing (watch) |
| Eval success rate (main, scheduled) |
0% |
~0% |
0% |
β‘οΈ Stable |
| Eval success rate (all branches, 24h) |
~86% |
~40% |
+46% |
β
Improving |
| Eval scheduled cancellation rate |
0% |
0% |
0% |
β‘οΈ Stable |
| Eval failure rate (all branches, 24h) |
~14% |
~45% |
-31% |
β
Improving |
| Workflow failure rate (7d, main) |
~7% |
~15% |
-8% |
β
Improving |
| Compute hours/day |
~3.5h |
~4.5h |
-1h |
β
Improving |
π€ Generated by DevOps Health Check agentic workflow Β· Run #26432322026 Β· 2026-05-26T04:31Z UTC
Generated by DevOps Daily Health Check Β· β 1.7M Β· β·
π Investigation Results
Deep investigations are dispatched for new critical/warning findings.
The grooming workflow links results ~3 hours after this run.
Generated by DevOps Health β Groom Dashboard Β· β 1.2M Β· β·
π₯ Daily Health Check β 2026-05-26
Status: π΄ 2 critical Β· π‘ 2 warnings Β· π΅ 1 info
Since yesterday: π 1 new Β· β 3 resolved Β· π 4 unchanged
π New Findings (1)
π‘ [P1] Markdown Linter workflow startup_failure on main
Fingerprint:
pipeline:markdown-linter:startup:startup_failureSeverity: π‘ Warning | First seen: 2026-05-26
The
Markdown Linterworkflow (.github/workflows/markdown-linter.lock.yml) had astartup_failureonmainwith 0 jobs executed.startup_failureβ the agentic lock workflow failed before any jobs started5810d492("test-anti-patterns / test-smell-detection: narrow-specialist re-pitch (Variant B)")Context: This finding was previously resolved on 2026-05-25 (no run in 24h window), but the prior investigation noted the root cause β
super-linter/super-linteraction not in.github/aw/actions-lock.jsonβ was not fixed. As predicted, the failure recurred on the next scheduled run.Suggested action: Add
super-linter/super-linterto.github/aw/actions-lock.json(or equivalent action lock file) to resolve the startup failure permanently. π Investigation dispatched.π Investigation Results
evaluate (dotnet-msbuild)job was killed by SIGTERM (exit code 143) after ~17 minutes, likely due to PR #673 adding aAITools.BinlogMcpMCP server with adotnet dnxstartup command that downloads a pinned prerelease from a custom NuGet feed at evaluation time; all 11 other evaluate jobs succeeded in the same run.evaluate (dotnet-test)job now takes ~164 minutes per scheduled run due to the large number of complex agent evaluation scenarios in the dotnet-test plugin; this is documented growth, but it caused 7 consecutive daily run cancellations (May 7β13) and the workflow still operates within 16 min of the 180-min job timeout ceiling.β Resolved Since Yesterday (3)
π΄ Evaluation failed β evaluate (dotnet-test) / Run skill-validatorFingerprint:
pipeline:evaluation:evaluate-(dotnet-test):run-skill-validator:failureResolved: The
evaluate (dotnet-test)job succeeded in today's scheduled evaluation run (run #3786), run time 2026-05-26T00:54:31Z. The runner preemption issue affecting dotnet-test appears to have been transient or the underlying contention resolved. The chronicevaluate (dotnet-msbuild)failure continues.π΄ Evaluation failed β evaluate (dotnet-blazor--coordinate-components) / Run skill-validatorFingerprint:
pipeline:evaluation:evaluate-(dotnet-blazor--coordinate-components):run-skill-validator:failureResolved: No
evaluate (dotnet-blazor--coordinate-components)job failure detected in the last 24h. The dotnet-blazor PR triggering this evaluation appears to have been merged or evaluation conditions changed. Theevaluate (dotnet-blazor)job succeeded in run#3786.π΄ Evaluation failure rate β Critical threshold exceededFingerprint:
pipeline:evaluation:failure-rate:criticalResolved: Evaluation failure rate across all branches in the last 24h has dropped to ~14% (2 failures out of 14 non-cancelled runs), well below the Critical (>30%) and Warning (>15%) thresholds. The improvement is driven by the resolution of the dotnet-blazor--coordinate-components and dotnet-test failures β only the chronic dotnet-msbuild failure remains.
π Existing Findings (4)
π΄ [P1] Evaluation failed β evaluate (dotnet-msbuild) / Run skill-validator Β· first seen 2026-05-20 Β· 7 occurrences
Fingerprint:
pipeline:evaluation:evaluate-(dotnet-msbuild):run-skill-validator:failureSeverity: π΄ Critical
The
evaluate (dotnet-msbuild)job continues to fail at the Run skill-validator step in every scheduled evaluation run onmain. This is the 7th consecutive day.Today's run (run #3786):
2026-05-26T00:54:31Z, concluded:2026-05-26T03:03:29Z(~129 min total)evaluate (dotnet-msbuild): failed atRun skill-validatorstepRoot cause (from investigation):
AITools.BinlogMcpMCP server startup causes resource exhaustion, leading to SIGTERM. The skill-validator evaluation job is killed while parallel vally-evaluate jobs succeed.Suggested action: Remove or conditionally disable
AITools.BinlogMcpduring skill-validator evaluation. The BinlogMcp binary downloads at evaluation time from a custom NuGet feed, consuming resources that trigger runner preemption.π΄ [P3] Evaluation avg duration ~129 min β critical threshold exceeded (>55 min) Β· first seen 2026-05-14 Β· 12 occurrences
Fingerprint:
resource:eval-duration:criticalSeverity: π΄ Critical
Scheduled evaluation runs consistently exceed the π΄ Critical threshold (>55 min). Today's scheduled run was 129 min.
Today's run (run #3786):
7-day trend (scheduled runs):
Root cause (from investigation): The
evaluate (dotnet-test)/vally-evaluate (dotnet-test)job now runs ~164 min of complex agent evaluation scenarios, dominating overall run duration.Suggested action: Split the
dotnet-testplugin evaluation into smaller scenario batches, or increase parallelism to bring total run time below 55 min.π‘ [I8] Orphan plugin: dotnet-experimental not listed in marketplace.json Β· first seen 2026-05-14 Β· 12 occurrences
Fingerprint:
infra:orphan-plugin:dotnet-experimentalSeverity: π‘ Warning
plugins/dotnet-experimental/has a validplugin.jsonbut is not registered in.github/plugin/marketplace.json. The plugin is not discoverable by consumers.Suggested action: Either register in all three marketplace files (
.github/plugin/,.claude-plugin/,.cursor-plugin/) per CONTRIBUTING.md, or remove the directory if it is no longer active.π΅ [I4] evaluation.yml uses --verdict-warn-only mode Β· first seen 2026-05-16 Β· 10 occurrences
Fingerprint:
infra:verdict-warn-onlySeverity: π΅ Info
The evaluation workflow passes
--verdict-warn-onlyto skill-validator. Evaluation failures are reported as warnings rather than hard PR-blocking errors. This is intentional configuration for the current rollout phase.π Trends (7-day)
π€ Generated by DevOps Health Check agentic workflow Β· Run #26432322026 Β· 2026-05-26T04:31Z UTC
π Investigation Results
evaluate (dotnet-msbuild)job was killed by SIGTERM (exit code 143) after ~17 minutes, likely due to PR #673 adding aAITools.BinlogMcpMCP server with adotnet dnxstartup command that downloads a pinned prerelease from a custom NuGet feed at evaluation time; all 11 other evaluate jobs succeeded in the same run.evaluate (dotnet-test)job now takes ~164 minutes per scheduled run due to the large number of complex agent evaluation scenarios in the dotnet-test plugin; this is documented growth, but it caused 7 consecutive daily run cancellations (May 7β13) and the workflow still operates within 16 min of the 180-min job timeout ceiling.super-linter/super-linteraction used in thesuper_linterjob is missing from.github/aw/actions-lock.json, causing the gh-aw framework to reject the compiled workflow before any jobs start; fix requires adding the action entry to the lock file and recompiling.