Skip to content

πŸ₯ Repository Health DashboardΒ #288

@github-actions

Description

@github-actions

πŸ₯ Daily Health Check β€” 2026-05-26

Status: πŸ”΄ 2 critical Β· 🟑 2 warnings Β· πŸ”΅ 1 info
Since yesterday: πŸ†• 1 new Β· βœ… 3 resolved Β· πŸ“Œ 4 unchanged

Summary: A welcome improvement β€” 3 findings resolved today, including the evaluation failure rate critical and both dotnet-blazor--coordinate-components and dotnet-test failures. The Markdown Linter startup_failure has recurred on main as predicted. The chronic dotnet-msbuild evaluation failure and eval duration critical remain active.


πŸ†• New Findings (1)

These appeared since the last health check (2026-05-25).

🟑 [P1] Markdown Linter workflow startup_failure on main

Fingerprint: pipeline:markdown-linter:startup:startup_failure
Severity: 🟑 Warning | First seen: 2026-05-26

The Markdown Linter workflow (.github/workflows/markdown-linter.lock.yml) had a startup_failure on main with 0 jobs executed.

  • Run #23 β€” 2026-05-25T14:58:56Z (schedule, main)
  • Conclusion: startup_failure β€” the agentic lock workflow failed before any jobs started
  • Triggered by commit: 5810d492 ("test-anti-patterns / test-smell-detection: narrow-specialist re-pitch (Variant B)")
  • 0 jobs ran; the workflow never started

Context: This finding was previously resolved on 2026-05-25 (no run in 24h window), but the prior investigation noted the root cause β€” super-linter/super-linter action not in .github/aw/actions-lock.json β€” was not fixed. As predicted, the failure recurred on the next scheduled run.

Suggested action: Add super-linter/super-linter to .github/aw/actions-lock.json (or equivalent action lock file) to resolve the startup failure permanently. πŸ”„ Investigation dispatched.


πŸ” Investigation Results

Deep investigations are dispatched for new critical/warning findings.
The grooming workflow links results ~3 hours after this run.

Finding Severity Investigation First Seen Result
Evaluation failed β€” evaluate (dotnet-msbuild) / Run skill-validator πŸ”΄ Critical βœ… Done 2026-05-20 The evaluate (dotnet-msbuild) job was killed by SIGTERM (exit code 143) after ~17 minutes, likely due to PR #673 adding a AITools.BinlogMcp MCP server with a dotnet dnx startup command that downloads a pinned prerelease from a custom NuGet feed at evaluation time; all 11 other evaluate jobs succeeded in the same run.
Evaluation avg duration β€” critical threshold exceeded πŸ”΄ Critical βœ… Done 2026-05-14 The evaluate (dotnet-test) job now takes ~164 minutes per scheduled run due to the large number of complex agent evaluation scenarios in the dotnet-test plugin; this is documented growth, but it caused 7 consecutive daily run cancellations (May 7–13) and the workflow still operates within 16 min of the 180-min job timeout ceiling.
Markdown Linter workflow startup_failure on main 🟑 Warning πŸ”„ Dispatched 2026-05-26 ⏳ Investigation dispatched β€” results arriving shortly...

βœ… Resolved Since Yesterday (3)

These were in yesterday's report but are no longer detected.

πŸ”΄ Evaluation failed β€” evaluate (dotnet-test) / Run skill-validator

Fingerprint: pipeline:evaluation:evaluate-(dotnet-test):run-skill-validator:failure
Resolved: The evaluate (dotnet-test) job succeeded in today's scheduled evaluation run (run #3786), run time 2026-05-26T00:54:31Z. The runner preemption issue affecting dotnet-test appears to have been transient or the underlying contention resolved. The chronic evaluate (dotnet-msbuild) failure continues.

πŸ”΄ Evaluation failed β€” evaluate (dotnet-blazor--coordinate-components) / Run skill-validator

Fingerprint: pipeline:evaluation:evaluate-(dotnet-blazor--coordinate-components):run-skill-validator:failure
Resolved: No evaluate (dotnet-blazor--coordinate-components) job failure detected in the last 24h. The dotnet-blazor PR triggering this evaluation appears to have been merged or evaluation conditions changed. The evaluate (dotnet-blazor) job succeeded in run#3786.

πŸ”΄ Evaluation failure rate β€” Critical threshold exceeded

Fingerprint: pipeline:evaluation:failure-rate:critical
Resolved: Evaluation failure rate across all branches in the last 24h has dropped to ~14% (2 failures out of 14 non-cancelled runs), well below the Critical (>30%) and Warning (>15%) thresholds. The improvement is driven by the resolution of the dotnet-blazor--coordinate-components and dotnet-test failures β€” only the chronic dotnet-msbuild failure remains.


πŸ“Œ Existing Findings (4)

These have been present since before today. Sorted by severity then age.

πŸ”΄ [P1] Evaluation failed β€” evaluate (dotnet-msbuild) / Run skill-validator Β· first seen 2026-05-20 Β· 7 occurrences

Fingerprint: pipeline:evaluation:evaluate-(dotnet-msbuild):run-skill-validator:failure
Severity: πŸ”΄ Critical

The evaluate (dotnet-msbuild) job continues to fail at the Run skill-validator step in every scheduled evaluation run on main. This is the 7th consecutive day.

Today's run (run #3786):

  • Started: 2026-05-26T00:54:31Z, concluded: 2026-05-26T03:03:29Z (~129 min total)
  • Job evaluate (dotnet-msbuild): failed at Run skill-validator step
  • All other evaluate jobs (dotnet, dotnet-test, dotnet-blazor, dotnet-aspnet, dotnet-data, dotnet-diag, dotnet-upgrade, dotnet-maui, dotnet-ai, dotnet-template-engine) succeeded

Root cause (from investigation): AITools.BinlogMcp MCP server startup causes resource exhaustion, leading to SIGTERM. The skill-validator evaluation job is killed while parallel vally-evaluate jobs succeed.

Suggested action: Remove or conditionally disable AITools.BinlogMcp during skill-validator evaluation. The BinlogMcp binary downloads at evaluation time from a custom NuGet feed, consuming resources that trigger runner preemption.

πŸ”΄ [P3] Evaluation avg duration ~129 min β€” critical threshold exceeded (>55 min) Β· first seen 2026-05-14 Β· 12 occurrences

Fingerprint: resource:eval-duration:critical
Severity: πŸ”΄ Critical

Scheduled evaluation runs consistently exceed the πŸ”΄ Critical threshold (>55 min). Today's scheduled run was 129 min.

Today's run (run #3786):

  • Total run duration: ~129 min (00:54Z β†’ 03:03Z)
  • Note: run ended with a failure (dotnet-msbuild) but other jobs including dotnet-test completed successfully

7-day trend (scheduled runs):

Date Duration
2026-05-26 ~129 min
2026-05-25 124 min
2026-05-24 122.5 min
2026-05-23 119.5 min
2026-05-22 ~131.7 min
7d avg ~125 min

Root cause (from investigation): The evaluate (dotnet-test) / vally-evaluate (dotnet-test) job now runs ~164 min of complex agent evaluation scenarios, dominating overall run duration.

Suggested action: Split the dotnet-test plugin evaluation into smaller scenario batches, or increase parallelism to bring total run time below 55 min.

🟑 [I8] Orphan plugin: dotnet-experimental not listed in marketplace.json · first seen 2026-05-14 · 12 occurrences

Fingerprint: infra:orphan-plugin:dotnet-experimental
Severity: 🟑 Warning

plugins/dotnet-experimental/ has a valid plugin.json but is not registered in .github/plugin/marketplace.json. The plugin is not discoverable by consumers.

Suggested action: Either register in all three marketplace files (.github/plugin/, .claude-plugin/, .cursor-plugin/) per CONTRIBUTING.md, or remove the directory if it is no longer active.

πŸ”΅ [I4] evaluation.yml uses --verdict-warn-only mode Β· first seen 2026-05-16 Β· 10 occurrences

Fingerprint: infra:verdict-warn-only
Severity: πŸ”΅ Info

The evaluation workflow passes --verdict-warn-only to skill-validator. Evaluation failures are reported as warnings rather than hard PR-blocking errors. This is intentional configuration for the current rollout phase.


πŸ“Š Trends (7-day)

Metric Today 7d Avg Ξ” Trend
Eval duration β€” scheduled (min) 129 ~125 +4 ↗️ Increasing (watch)
Eval success rate (main, scheduled) 0% ~0% 0% ➑️ Stable
Eval success rate (all branches, 24h) ~86% ~40% +46% βœ… Improving
Eval scheduled cancellation rate 0% 0% 0% ➑️ Stable
Eval failure rate (all branches, 24h) ~14% ~45% -31% βœ… Improving
Workflow failure rate (7d, main) ~7% ~15% -8% βœ… Improving
Compute hours/day ~3.5h ~4.5h -1h βœ… Improving

πŸ€– Generated by DevOps Health Check agentic workflow Β· Run #26432322026 Β· 2026-05-26T04:31Z UTC

Generated by DevOps Daily Health Check Β· ● 1.7M Β· β—·


πŸ” Investigation Results

Deep investigations are dispatched for new critical/warning findings.
The grooming workflow links results ~3 hours after this run.

Finding Severity Investigation First Seen Result
Evaluation failed β€” evaluate (dotnet-msbuild) / Run skill-validator πŸ”΄ Critical βœ… Done 2026-05-20 The evaluate (dotnet-msbuild) job was killed by SIGTERM (exit code 143) after ~17 minutes, likely due to PR #673 adding a AITools.BinlogMcp MCP server with a dotnet dnx startup command that downloads a pinned prerelease from a custom NuGet feed at evaluation time; all 11 other evaluate jobs succeeded in the same run.
Evaluation avg duration β€” critical threshold exceeded πŸ”΄ Critical βœ… Done 2026-05-14 The evaluate (dotnet-test) job now takes ~164 minutes per scheduled run due to the large number of complex agent evaluation scenarios in the dotnet-test plugin; this is documented growth, but it caused 7 consecutive daily run cancellations (May 7–13) and the workflow still operates within 16 min of the 180-min job timeout ceiling.
Markdown Linter workflow startup_failure on main 🟑 Warning βœ… Done 2026-05-26 The super-linter/super-linter action used in the super_linter job is missing from .github/aw/actions-lock.json, causing the gh-aw framework to reject the compiled workflow before any jobs start; fix requires adding the action entry to the lock file and recompiling.

Generated by DevOps Health β€” Groom Dashboard Β· ● 1.2M Β· β—·

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions