feat: pharaoh-sdd skill for gated V-model spec-driven development by bburda · Pull Request #21 · useblocks/pharaoh

bburda · 2026-05-18T17:46:36Z

Summary

Adds pharaoh-sdd, a process-orchestrator skill that drives a feature from a conversation through the full V-model: requirements, architecture, tests, and implementation. Each stage produces traceable sphinx-needs artefacts.

pharaoh-sdd is a non-atomic orchestrator. It does not draft or review artefacts itself. It runs an interactive elicitation phase, then walks the project's V-model tiers, dispatching the existing atomic draft skills (pharaoh-req-draft, pharaoh-arch-draft, pharaoh-vplan-draft), and enforces a human checkpoint plus a sphinx-build -W validation after each tier. It closes with pharaoh-quality-gate.

It exists to stop a specific failure. Handed "add feature X, do spec-driven development", an unaided agent fabricates the requirements (invented thresholds and acceptance criteria), runs the whole chain unattended, reviews nothing, and builds without -W. pharaoh-sdd makes elicitation, per-tier review, validation, and a human gate mandatory.

Tier order is derived from the project's tailoring rather than hardcoded, so the skill adapts to any sphinx-needs V-model.

Checklist

Both Claude Code skill and Copilot agent updated (if applicable)
Tested against tests/fixtures/basic-project/
README skill table updated (if new/renamed skill)
copilot-instructions.md updated (if new/renamed agent)
Commit messages use imperative mood

How to test this

With the Pharaoh skills from this branch available to your agent, work in a sphinx-needs project that has .pharaoh/project/ tailoring (tests/fixtures/basic-project/ in this repo, or a fuller project such as sphinx-needs-demo).

Ask the agent to add a feature, and deliberately leave a value out of the request (a threshold, a timeout, a limit). For example: "add automatic headlight control, the low beams switch on and off by ambient light", with no light level stated.
Expect the agent to trigger pharaoh-sdd and run Phase 0 elicitation. It should ask you for every unstated value rather than inventing one. If it drafts a requirement with a made-up number, that is a failure.
Answer the elicitation questions. Approve the requirement decomposition it presents.
For each V-model tier, expect a draft from the atomic skill with its review attached, a sphinx-build -W that must pass, and a checkpoint that waits for your approval before the next tier.
At the end, expect pharaoh-quality-gate to run and the V-model graph to build clean with every tier linked to the next.

What to stress: step 4 onward (drafting through to the quality gate) has not been run end to end, so a full tier-by-tier walk on a real project is the main thing to exercise. Report any tier where the draft, review, build, or checkpoint does not behave as the skill describes.

Testing done so far

Validated by running the spec-driven-development scenario on a sphinx-needs V-model project (sphinx-needs-demo), both with and without the skill, on the same feature request.

Without the skill, the agent fabricated the unstated requirement values (light thresholds and hysteresis) and proceeded straight to drafting. With the skill, the agent instead asked the developer for those exact values and stopped at the Phase 0 elicitation gate, drafting nothing until the developer answers.

The repo fixture tests/fixtures/basic-project/ was not used. Validation was against sphinx-needs-demo, which has a fuller multi-tier V-model.

Signed-off-by: Bartosz Burda <bartoszburda93@gmail.com>

- Terminal step now invokes pharaoh-quality-gate with project_root and run directory via self_review_coverage invariant; artefacts_summary_path marked optional because pharaoh-sdd runs no mece or coverage-gap tasks - persist step names exact review JSON filename conventions matched by pharaoh-self-review-coverage-check (<id>_review.json, <id>_diagram_review.json, <id>_code_grounding.json) - ubproject.toml added to Input section with note that [needs.links] is read from it for link-field convention; [pharaoh.traceability] references now clearly attributed to pharaoh.toml - data-access and strictness note added: concerns are delegated to atomic skills - needs.json output location clarified with typical demo path - implementation tier gains explicit sphinx-build -W rebuild and developer checkpoint after pharaoh-req-codelink-annotate, consistent with other tiers Signed-off-by: Bartosz Burda <bartoszburda93@gmail.com>

Copilot

Pull request overview

Adds a new non-atomic orchestrator skill (pharaoh-sdd) intended to enforce gated, spec-driven (V-model) development in sphinx-needs projects by requiring elicitation, per-tier review, sphinx-build -W, and human checkpoints, and exposes it via both Claude skill docs and a GitHub Copilot agent entry.

Changes:

Added the skills/pharaoh-sdd/SKILL.md specification describing the gated tier-by-tier orchestration process and quality gate terminal step.
Documented the new entry point in README.md and .github/copilot-instructions.md.
Added the Copilot agent descriptor .github/agents/pharaoh.sdd.agent.md.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
skills/pharaoh-sdd/SKILL.md	Defines the new orchestrator’s phases, gating rules, tier loop, and terminal quality gate invocation.
README.md	Adds `pharaoh-sdd` to the “Core workflow” skills list and updates surrounding description text.
.github/copilot-instructions.md	Adds `@pharaoh.sdd` to the “Available Agents” table.
.github/agents/pharaoh.sdd.agent.md	Introduces the Copilot agent entry for the new `@pharaoh.sdd` orchestrator.

+| draft | Dispatch the tier's atomic draft skill once per artefact. The draft skill self-invokes its review and returns `{artefact, review}`. |
+| evaluate | Read the attached review. If `overall: fail`, `overall: needs_work`, or any binary axis has `score: 0`, re-dispatch the draft skill with the review action items folded into the description. Use `pharaoh-req-regenerate` for requirements, re-invoke the draft skill directly for arch and vplan. |
+| normalise | If the project traces with a generic link field (read from `ubproject.toml [needs.links]` and the existing corpus), rewrite the drafted directive's typed link option (`:satisfies:` or `:verifies:`) to that field. If `ubproject.toml` is absent or has no `[needs.links]` table, keep the typed link as-is. |
+| persist | Write the artefact into the docs tree. Write the review JSON into the run directory using the filename convention `<id>_review.json` (for req-review), `<id>_arch_review.json` (for arch-review), and `<id>_vplan_review.json` (for vplan-review). These names are matched by `pharaoh-self-review-coverage-check`. |


+Run `pharaoh-quality-gate` passing `project_root` and the run directory (via
+`gate_spec.invariants.self_review_coverage`). The `self_review_coverage` invariant reads the
+run directory directly and confirms every drafted artefact has a matching review JSON on
+disk. `artefacts_summary_path` is optional and may be omitted here because the pharaoh-sdd
+chain runs draft-and-review per tier but does not run `pharaoh-mece` or
+`pharaoh-coverage-gap`. The deliverable is a V-model graph where every tier traces to the
+next, every artefact has a review on disk, and the build is clean.


bburda added 3 commits May 18, 2026 18:07

feat: pharaoh-sdd orchestrator skill for gated V-model SDD

7fa88c3

Signed-off-by: Bartosz Burda <bartoszburda93@gmail.com>

docs: register pharaoh-sdd skill (agent, README, copilot-instructions)

fd6562d

Signed-off-by: Bartosz Burda <bartoszburda93@gmail.com>

bburda marked this pull request as ready for review May 19, 2026 12:33

Copilot AI review requested due to automatic review settings May 19, 2026 12:33

Copilot started reviewing on behalf of bburda May 19, 2026 12:34 View session

Copilot AI reviewed May 19, 2026

View reviewed changes

patdhlk self-requested a review May 20, 2026 06:42

patdhlk approved these changes May 20, 2026

View reviewed changes

patdhlk merged commit 9cd10ed into main May 20, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: pharaoh-sdd skill for gated V-model spec-driven development#21

feat: pharaoh-sdd skill for gated V-model spec-driven development#21
patdhlk merged 3 commits into
mainfrom
feat/pharaoh-sdd

bburda commented May 18, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

bburda commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Checklist

How to test this

Testing done so far

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bburda commented May 18, 2026 •

edited

Loading