Skill Drafting — Building and Maintaining Agent Skills

Part of Agent Kaizen — Agent Kaizen is designed to support reliable AI agent workflows, context engineering, and AI systems engineering in VS Code, Codex, and Claude Code. Build reusable agent skills, reduce unnecessary context loading, add validation loops, and apply Spec → Verifier → Environment scaffolding to new and existing projects.

This repository is a skill-drafting skill: a reusable, trigger-rich task handbook that an AI coding agent (OpenAI Codex, Claude Code, etc) loads on demand when a task matches its triggers.

What this skill covers

skill-drafting is a thorough handbook for creating, updating, reviewing, and validating Codex or Claude skills. It guides the full lifecycle: staged interview gates (goal, triggers, package anatomy, quality harness, implementation plan), classification across Anthropic's three skill use-case categories, and progressive-disclosure drafting where the frontmatter description is the trigger surface, SKILL.md is the runtime guide, and references/ open on demand. It documents the deterministic corpus pipeline driven by scripts/skill_builder.py — ingest (mdbook, html, rustdoc, pdf), build (including the --verbatim preset), finalize, split, maintain, lint, a cross-skill index, and a recontext group that turns verbatim ingested docs into original, publishable prose with every identifier preserved — plus validate, policy, and the locked recontext-subagent writer, all exposed as subcommands of that same all-in-one tool, and separate Claude/Codex usage checks. The bundled tools are detailed under Scripts below. Every drafted skill must end with a Gotchas section and an observable Verification section, including trigger tests, structure tests (SKILL.md, references/INDEX.md, and references/topics.json agreement), workflow tests, and a quality harness that can reject bad output.

What's inside

SKILL.md — frontmatter (name + trigger-rich description) and a lean body.
references/ — right-sized topic files the agent loads only when relevant (plus INDEX.md and topics.json).
GOTCHA.md — known pitfalls and edge cases (for example: never bury trigger rules in the body — the frontmatter description is the main trigger surface; never claim quality from self-assessment alone — define an external or observable check before implementation).
scripts/ — the deterministic, stdlib-only tooling that builds, validates, recontextualizes, and maintains skills (detailed under Scripts).

Scripts

All of the tooling is one all-in-one command — scripts/skill_builder.py — compiled from a small, editable component package. Stdlib only; no third-party dependencies. There are two ways to use it, whichever fits the scenario:

The all-in-one: python skill_builder.py <subcommand> … exposes every capability (below). Run any subcommand with --help. One file an agent can read end-to-end to understand the whole system.
A single component: the editable source lives in scripts/builder_components/ (one module per stage, shared helpers under builder_components/util/). Each command module is independently runnable — e.g. python -m builder_components.validate <dir> — which is cheaper for an agent that only needs that one capability.
Edit, then rebuild: builder_components/ is the source of truth; skill_builder.py is generated from it by python builder_components/_assemble.py (or python skill_builder.py --rebuild). python builder_components/_assemble.py --check fails if the committed file is stale — re-run the rebuild after editing a component. See builder_components/README.md for the package map and the contributor workflow.

Subcommands

ingest {html|mdbook|rustdoc|pdf} — source docs → a corpus JSONL of text chunks (a shared HTML→Markdown converter backs the HTML formats).
build — corpus JSONL → a flat or router reference skill, packed into right-sized per-subject files with references/INDEX.md + references/topics.json + a starter SKILL.md. The --verbatim preset reproduces already-clean docs faithfully; --verify audits coverage and residue.
finalize — bring a built skill up to the gold SKILL.md / GOTCHA.md standard.
split — split oversized references/*.md into one file per topic (deterministic; no rewriting).
maintain — in-place gold maintenance: audit conformance, split oversize files, and cross-link, leaving the bespoke SKILL.md / GOTCHA.md intact.
lint — read-only link/topics health check → AI/lint/<skill>.md.
index — build a cross-skill master INDEX.md (with optional covers: seeding).
recontext {clean|extract|splice|gate|triage} and the lifecycle recontext {scan|batch|drain|integrate|finish|reconcile|promote} — recontextualize verbatim ingested docs into original, publishable prose with identifiers preserved. Roots and owner come from a --config JSON or CLI flags; nothing is hardcoded to a skill or path. See references/recontextualization.md.
readme {scaffold|apply} — create a new skill README.md, or re-apply the standard's managed boilerplate regions (the Agent Kaizen intro, the %DEVROOT% "Use it" instructions, the idle-context policy, and the License) to existing READMEs — located by their markdown headings, no markers in the shippable files — never touching authored prose. The README standard is single-sourced in references/readme-standard.md — edit that one file to set your own README standard, then run python skill_builder.py readme apply --all to propagate it across however many skill READMEs you have.
validate [--package] <dir> — the Verification gate for a finished skill: required files, frontmatter, and SKILL.md / INDEX.md / topics.json agreement (--package validates a router plus its product subskills).
policy {audit|plan|preview|apply|restore} — audit and set each installed skill's invocation policy so rarely-used skills cost no idle context while staying explicitly invocable. Full walkthrough under Reducing idle context cost.
recontext-subagent {prepare|show|submit|audit} — the locked, gated artifact writer a rewriting subagent must use: it derives every output path internally, confines all writes to one --work-root, refuses caller-supplied paths, and runs Gate A/B/C before writing, so a PASS is verified, never assumed. The shared, path-agnostic engine it sits on (prose-unit detection; the three gates — A identifiers, B ~13-word verbatim residue, C scrape cruft; clean/extract/splice; triage) lives in builder_components/recontext_core.py and also backs recontext.

`claude_usage_check.py` and `codex_usage_check.py` (+ `codex_usage_bridge.cmd`) — usage checks

Read-only utilization checks for long or batched builds. claude_usage_check.py prints Claude's 5-hour-window utilization. Codex usage needs the host-side bridge (codex_usage_bridge.cmd, or codex_usage_check.py --run-bridge) because the Codex agent sandbox blocks the usage endpoint by default; both exit cleanly without leaking token data if credentials are absent.

Use it

This skill is one git repo in a skills store — the lowercase skills\ folder that holds every skill as its own repo. Keep that folder wherever you like; you wire skills into a project by linking from it into the project's .agents\skills\ and .claude\skills\. Link one skill at a time (recommended) so the agent only lists what a task needs and you don't spend idle context on skills you're not using; linking the whole skills\ folder at once loads every skill and isn't recommended outside a skills-dev project.

The commands below use %DEVROOT% — the DEVROOT environment variable pointing at the folder that holds your projects. Set it once (no admin) by running SetDevRoot.cmd from the root of the Agent Kaizen project — a reusable VS Code-project template that ships the script; a new shell then resolves %DEVROOT% automatically. Replace <SKILLS-PROJECT> with whatever you named the folder that contains your skills\ store.

Link just this skill (recommended) — a Windows directory junction, no admin:

mklink /J .agents\skills\skill-drafting  "%DEVROOT%\<SKILLS-PROJECT>\skills\skill-drafting"
mklink /J .claude\skills\skill-drafting  "%DEVROOT%\<SKILLS-PROJECT>\skills\skill-drafting"

Or link the whole store at once (loads every skill — not recommended outside a skills-dev project):

mklink /J .agents\skills  "%DEVROOT%\<SKILLS-PROJECT>\skills"
mklink /J .claude\skills  "%DEVROOT%\<SKILLS-PROJECT>\skills"

Remove a link (the store copy is untouched):

rmdir .agents\skills\skill-drafting
rmdir .claude\skills\skill-drafting

The author keeps the skills\ store inside its own dedicated VS Code project (the <SKILLS-PROJECT> folder) with local-only build and maintenance scripts, because the same skills are reused across many different projects. If you work across several repos, consider the same: one central skills project gives every skill a single home, and an improvement to the shared scripts benefits all of your projects at once.

The agent (OpenAI Codex, Claude Code) then auto-loads this skill whenever a task matches its triggers. Built and validated with skill-drafting to the Agent Kaizen gold standard.

The linked-skills workflow

skill-drafting is the documentation home for how Agent Kaizen skills are stored, published, and wired into agents. The linking commands live in Use it above; this section is the rationale.

Store. Every skill lives in a private, local-only store at %DEVROOT%\SKILLS\skills\<name>\ — each its own git repo, published individually as github.com/DirectorGunner/AI-SKILL-<name> (one exception: this skill-drafting skill publishes as AI-skill-drafting, to avoid doubling the word). Both agents read that one copy through per-skill junctions (.agents/skills/<name>, .claude/skills/<name>), so the mirrors can't drift and the framework repo carries zero skill payload. Each LLM folder also has its own generated INDEX.md. (%DEVROOT% is the DEVROOT env var — set it once with SetDevRoot.cmd.)

Why per skill, not the whole store. Claude Code loads every available skill's name + description, and these trigger-rich descriptions are large — linking the whole catalog (100+ skills) blows the ~1% skillListingBudgetFraction, truncates descriptions, and degrades routing. Link the 3–8 skills a project actually uses. A whole-store link is only for a dev/everything workspace like the SKILLS store project itself.

Skill catalog

Skill	Repository
`skill-drafting`	AI-skill-drafting
`ableton-live`	AI-SKILL-ableton-live
`adobe-products`	AI-SKILL-adobe-products
`blender`	AI-SKILL-blender
`chrome-extensions`	AI-SKILL-chrome-extensions
`cli-design`	AI-SKILL-cli-design
`davinci-resolve`	AI-SKILL-davinci-resolve
`discord-developers`	AI-SKILL-discord-developers
`gimp`	AI-SKILL-gimp
`git`	AI-SKILL-git
`github`	AI-SKILL-github
`lumberyard`	AI-SKILL-lumberyard
`powershell-vsdevshell`	AI-SKILL-powershell-vsdevshell
`pymeasure`	AI-SKILL-pymeasure
`pyvisa`	AI-SKILL-pyvisa
`tauri-develop`	AI-SKILL-tauri-develop
`turso-db`	AI-SKILL-turso-db

Main project: agent-kaizen.

Reducing idle context cost (skill invocation policy)

Every installed skill costs a little context on every session: the agent sees each skill's name and description before you ever use it. If you rarely use this skill in a given project, you can keep it installed and still explicitly invocable while hiding it from the model's automatic listing.

Doing so does not modify this skill's source repo — the policy lives in your local agent settings. The Agent Kaizen skill-drafting repo ships the policy manager as the policy subcommand of its all-in-one scripts/skill_builder.py, which sets this for all your skills at once:

python skill_builder.py policy audit     # list every skill + its current policy + idle cost
python skill_builder.py policy plan      # write a decision file with recommendations (nothing applied)
# edit that decision file: set selected_policy + approved:true for the skills you choose
python skill_builder.py policy preview   # show the exact change
python skill_builder.py policy apply     # apply ONLY what you approved (backup + rollback recorded)
python skill_builder.py policy restore   # roll back

Claude Code (works today): writes skillOverrides: { "<skill>": "user-invocable-only" } to .claude/settings.local.json — zero idle listing cost, still available from the /skills menu. Start a new session for it to take effect; invoke it any time via /skills.
Codex (currently unreliable): explicit-only Codex skills are affected by an open bug (openai/codex#23454) where $skill invocation of an explicit-only local skill can fail. Until it's fixed, leave Codex skills implicit, or fully disable rarely-used ones with [[skills.config]] enabled = false in config.toml. The manager audits Codex but does not change Codex policy.

License

Licensed under AGPL-3.0, matching the Agent Kaizen project.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Skill Drafting — Building and Maintaining Agent Skills

What this skill covers

What's inside

Scripts

Subcommands

`claude_usage_check.py` and `codex_usage_check.py` (+ `codex_usage_bridge.cmd`) — usage checks

Use it

The linked-skills workflow

Skill catalog

Reducing idle context cost (skill invocation policy)

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
references		references
scripts		scripts
.gitignore		.gitignore
GOTCHA.md		GOTCHA.md
LICENSE		LICENSE
README.md		README.md
SKILL.md		SKILL.md

Folders and files

Latest commit

History

Repository files navigation

Skill Drafting — Building and Maintaining Agent Skills

What this skill covers

What's inside

Scripts

Subcommands

claude_usage_check.py and codex_usage_check.py (+ codex_usage_bridge.cmd) — usage checks

Use it

The linked-skills workflow

Skill catalog

Reducing idle context cost (skill invocation policy)

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`claude_usage_check.py` and `codex_usage_check.py` (+ `codex_usage_bridge.cmd`) — usage checks

Packages