This report covers 13 agent workflows active in the Biblioteka repository as of 2026-05-31. Data was collected from workflow run history, PR merge rates, issue tracking, and behavioral pattern analysis via the pattern-detector sub-agent.
Key update since yesterday: Code Simplifier has recovered its run success rate (now 10/10 ✅) after being reported as a silent failure. However, PR merge rate remains 0%, indicating a scope-creep pattern. Two new agents joined the ecosystem: Efficiency Improver (first PR merged successfully) and PR Triage Agent (currently in first run).
Executive Summary
Agents analyzed: 13
Average quality score: 57/100 (↓ 6pts from May 30 — scope expansion)
Average effectiveness score: 57/100
Clean agents: 3/13 — Contribution Check, Go Fan, Efficiency Improver
Top performers: Contribution Check, Go Fan, Efficiency Improver
Needs improvement: Architecture Guardian, Code Refiner, Dead Code Remover, Caveman Optimizer
Performance Rankings
Top Performing Agents 🏆
Contribution Check (Quality: 98/100, Effectiveness: 98/100)
Go Fan (Quality: 85/100, Effectiveness: 88/100)
Efficiency Improver (Quality: 80/100, Effectiveness: 85/100)
Agents Needing Improvement 📉
Architecture Guardian (Quality: 35/100, Effectiveness: 25/100)
Intermittent failure: 3/10 runs succeed (30% success rate)
Failing since ~May 15 with no remediation progress (Day 16+)
Pattern: inconsistency — alternating success/failure with no apparent trigger
Recommendation: Run gh aw compile, check for config drift or external API dependency issues
Code Refiner (Quality: 0/100, Effectiveness: 0/100)
Dead Code Remover (Quality: 30/100, Effectiveness: 20/100)
0/2 PRs merged — both closed without merge
Runs appear to succeed but outputs are consistently rejected
Pattern: under-creation — runs but produces no net-effective change
Recommendation: Review PR rejection reasons; add quality gate before PR creation
Caveman Optimizer (Quality: 45/100, Effectiveness: 30/100)
3/11 PRs merged (27% merge rate) over history
Low-quality suggestions frequently rejected
Patterns: scope-creep + inconsistency
Recommendation: Narrow scope to high-confidence optimizations; add pre-submission check
Code Simplifier (Quality: 55/100, Effectiveness: 50/100)
Inactive / New Agents
PR Triage Agent : Currently in first run — under-creation pattern expected, too early to evaluate
Agentic Maintenance : Under-creation pattern flagged (run count lower than expected given every-2h cadence) but functional
Behavioral Pattern Analysis
Pattern detection run on 2026-05-31 via pattern-detector sub-agent.
Agent
Patterns
Contribution Check
✅ Clean
Go Fan
✅ Clean
Efficiency Improver
✅ Clean
Testify Expert
repetition
Daily Documentation Updater
scope-creep
Code Simplifier
scope-creep, inconsistency
Caveman Optimizer
scope-creep, inconsistency
Function Namer
inconsistency
Architecture Guardian
inconsistency
Code Refiner
under-creation
Dead Code Remover
under-creation
PR Triage Agent
under-creation (new)
Agentic Maintenance
under-creation
Pattern Summary:
✅ Clean: 3 agents (23%)
⚠️ Scope-creep: 3 agents — outputs regularly rejected as out-of-scope
⚠️ Inconsistency: 4 agents — highly variable success/quality rates
⚠️ Under-creation: 4 agents — not producing expected output volume/impact
⚠️ Repetition: 1 agent — duplicate task generation
PR Merge Rate Analysis
Agent Group (branch prefix)
PRs
Merged
Closed
Merge Rate
efficiency (Efficiency Improver)
1
1
0
100% ✅
copilot (Go Fan, Testify Expert, Function Namer)
10
9
1
90% ✅
dependabot
13
10
3
77% ✅
docs (Daily Doc Updater)
3
1
2
33% ⚠️
caveman (Caveman Optimizer)
11
3
8
27% ⚠️
code-simplifier (Code Simplifier)
2
0
2
0% ❌
chore (Dead Code Remover)
2
0
2
0% ❌
refiner (Code Refiner)
1
0
0
0% (open) ❌
The Copilot SWE agent (used by Go Fan, Testify Expert, Function Namer) produces the highest quality PRs at 90% merge rate, indicating the agent-as-orchestrator pattern with Copilot execution is highly effective.
Ecosystem Coverage
Well-Covered Areas
Code quality review (Contribution Check, Code Refiner — though Refiner offline)
Go module updates (Go Fan)
Documentation (Daily Documentation Updater)
Testing quality (Testify Expert)
Gaps
Security vulnerability tracking (no active security agent)
Performance optimization (Efficiency Improver is new but shows promise)
Architecture enforcement (Architecture Guardian intermittent)
New Additions
Efficiency Improver : Fills performance optimization gap — strong debut
PR Triage Agent : Fills PR review automation gap — first run today
Behavioral Patterns
Productive Patterns ✅
Go Fan → Copilot SWE execution : High-quality analysis + execution delivers 90% merge rate
Contribution Check cadence : Every-4h reporting provides consistent health signal
Efficiency Improver debut : First PR (gzip compression) merged immediately — right-sized scope
Problematic Patterns ⚠️
Code Refiner cancellation loop : Triggered on every qualifying PR but always cancelled — consuming trigger quota with no output
Caveman + Code Simplifier scope-creep : Both agents create PRs that get systematically closed — wasted effort and PR noise
Architecture Guardian instability : 30% success rate with no apparent fix applied after 16+ days
Recommendations
High Priority
Diagnose Code Refiner cancellation — 0% success, all runs cancelled on PR trigger
Check concurrency limits on PR-triggered workflows
Verify job dependencies and trigger conditions
Estimated effort: 1-2 hours | Expected: restore functional agent
Scope Architecture Guardian investigation — 30% success, Day 16+ without fix
Previous open issue [aw] Architecture Guardian failed #3033 ; run gh aw compile to check config
Add error logging to distinguish flaky API from config issue
Estimated effort: 2-4 hours | Expected: restore to 90%+ success
Review Code Simplifier PR rejection reasons
Medium Priority
Add Testify Expert deduplication ([testify-expert] Quality Regression: Issue #3046 Output is Empty Placeholder #3052 existing issue)
Agent creating duplicate improvement issues for same files
Add seen-file tracking between runs
Estimated effort: 1-2 hours | Expected: eliminate duplicate issue creation
Narrow Caveman Optimizer scope
Only 27% merge rate — refine prompt to target high-confidence, low-risk optimizations
Estimated effort: 30 min prompt change | Expected: improve to 50%+
Trigger Metrics Collector — 57+ days stale (last: 2026-04-04)
All meta-orchestrators relying on stale April 4 baseline
Critical for accurate trend analysis
Low Priority
Review Dead Code Remover PR quality — both closed, investigate whether detecting truly-dead or still-referenced code
Close stale recovery issues [aw] Daily Documentation Healer failed #3022 , [aw] Daily Documentation Updater failed #3031 — agents stable for 5+ days
Trends (May 30 → May 31)
Metric
May 30
May 31
Δ
Average quality
63/100
57/100
↓ -6 (scope expansion)
Average effectiveness
67/100
57/100
↓ -10 (scope expansion)
Clean agent %
53% (10/19)
23% (3/13)
↓ (new scope — note denominator change)
Architecture Guardian success
3/10
3/10
→ stable (failing)
Code Refiner success
0/10
0/10
→ stable (broken)
Code Simplifier run success
failing
10/10
✅ RECOVERED
Copilot SWE PR merge rate
90%
90%
→ stable
Actions Taken This Run
Next Steps
🚨 Diagnose Code Refiner cancellation loop (root cause unknown)
🚨 Fix Architecture Guardian (16+ days intermittent — escalate)
⚠️ Constrain Code Simplifier and Caveman Optimizer scope
⚠️ Add Testify Expert deduplication
📊 Trigger Metrics Collector to refresh 57-day-stale baseline
👀 Monitor PR Triage Agent first run results
Analysis period: 2026-05-24 to 2026-05-31
Previous report: §26684844700 (May 30)
Next report: 2026-06-07
Metrics baseline: 2026-04-04 (57 days stale — Metrics Collector needs manual trigger)
Warning
Firewall blocked 1 domain
The following domain was blocked by the firewall during workflow execution:
To allow these domains, add them to the network.allowed list in your workflow frontmatter:
network :
allowed :
- defaults
- " localhost"
See Network Configuration for more information.
Generated by Agent Performance Analyzer - Meta-Orchestrator · sonnet46 2.8M · ◷
Add this agentic workflows to your repo
To install this agentic workflow, run
gh aw add github/gh-aw/.github/workflows/agent-performance-analyzer.md@c88e1268ab92fe509e85c0bef376884880613be2
This report covers 13 agent workflows active in the Biblioteka repository as of 2026-05-31. Data was collected from workflow run history, PR merge rates, issue tracking, and behavioral pattern analysis via the pattern-detector sub-agent.
Key update since yesterday: Code Simplifier has recovered its run success rate (now 10/10 ✅) after being reported as a silent failure. However, PR merge rate remains 0%, indicating a scope-creep pattern. Two new agents joined the ecosystem: Efficiency Improver (first PR merged successfully) and PR Triage Agent (currently in first run).
Executive Summary
Performance Rankings
Top Performing Agents 🏆
Contribution Check (Quality: 98/100, Effectiveness: 98/100)
Go Fan (Quality: 85/100, Effectiveness: 88/100)
Efficiency Improver (Quality: 80/100, Effectiveness: 85/100)
Agents Needing Improvement 📉
Architecture Guardian (Quality: 35/100, Effectiveness: 25/100)
inconsistency— alternating success/failure with no apparent triggergh aw compile, check for config drift or external API dependency issuesCode Refiner (Quality: 0/100, Effectiveness: 0/100)
under-creation— zero effective output despite many trigger attemptsDead Code Remover (Quality: 30/100, Effectiveness: 20/100)
under-creation— runs but produces no net-effective changeCaveman Optimizer (Quality: 45/100, Effectiveness: 30/100)
scope-creep+inconsistencyCode Simplifier (Quality: 55/100, Effectiveness: 50/100)
scope-creep+inconsistency— changes fall outside project acceptance criteriaInactive / New Agents
Behavioral Pattern Analysis
Pattern detection run on 2026-05-31 via pattern-detector sub-agent.
repetitionscope-creepscope-creep,inconsistencyscope-creep,inconsistencyinconsistencyinconsistencyunder-creationunder-creationunder-creation(new)under-creationPattern Summary:
PR Merge Rate Analysis
efficiency(Efficiency Improver)copilot(Go Fan, Testify Expert, Function Namer)dependabotdocs(Daily Doc Updater)caveman(Caveman Optimizer)code-simplifier(Code Simplifier)chore(Dead Code Remover)refiner(Code Refiner)The Copilot SWE agent (used by Go Fan, Testify Expert, Function Namer) produces the highest quality PRs at 90% merge rate, indicating the agent-as-orchestrator pattern with Copilot execution is highly effective.
Ecosystem Coverage
Well-Covered Areas
Gaps
New Additions
Behavioral Patterns
Productive Patterns ✅
Problematic Patterns⚠️
Recommendations
High Priority
Diagnose Code Refiner cancellation — 0% success, all runs cancelled on PR trigger
Scope Architecture Guardian investigation — 30% success, Day 16+ without fix
gh aw compileto check configReview Code Simplifier PR rejection reasons
Medium Priority
Add Testify Expert deduplication ([testify-expert] Quality Regression: Issue #3046 Output is Empty Placeholder #3052 existing issue)
Narrow Caveman Optimizer scope
Trigger Metrics Collector — 57+ days stale (last: 2026-04-04)
Low Priority
Trends (May 30 → May 31)
Actions Taken This Run
Next Steps
Warning
Firewall blocked 1 domain
The following domain was blocked by the firewall during workflow execution:
localhostSee Network Configuration for more information.
Add this agentic workflows to your repo
To install this agentic workflow, run