Skip to content

docs(eval): runner injects category definitions, not just labels#77

Merged
evemcgivern merged 1 commit into
devfrom
docs/eval-readme-runner-legend
Jun 17, 2026
Merged

docs(eval): runner injects category definitions, not just labels#77
evemcgivern merged 1 commit into
devfrom
docs/eval-readme-runner-legend

Conversation

@evemcgivern

Copy link
Copy Markdown
Contributor

One-line accuracy fix to the eval data-flow diagram. After #72 the runner injects the cat#N definitions inline (from baseline-categories.md) in addition to the eval-categories label set — that was the crux of the determinism fix. The diagram only mentioned the label set.

Doc-only; no behavior change.

🤖 Generated with Claude Code

…bels

#72 made the runner inject the cat#N definitions inline (from baseline-categories.md),
not only the eval-categories label set — that was the fix for nondeterministic headless
scoring. Update the data-flow diagram to match.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
@evemcgivern evemcgivern requested a review from a team as a code owner June 17, 2026 13:20
@evemcgivern evemcgivern merged commit 163281a into dev Jun 17, 2026
4 checks passed
@evemcgivern evemcgivern deleted the docs/eval-readme-runner-legend branch June 17, 2026 13:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant