Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 6 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,11 @@
# Changelog

## 0.70.0 - 2026-06-01

- Add Claude Code settings, agentic workflow, and GitHub Actions hardening checks to `contextforge doctor`.
- Expand proof-pack and scorecard evidence commands so first-readiness reports point directly at `claude-audit`, `workflow-audit`, and `actions-audit` SARIF/Markdown proof.
- Refresh README, doctor/proof-pack/scorecard docs, LLM discovery files, and research notes around one-command hardening evidence for Codex and Claude handoffs.

## 0.69.0 - 2026-06-01

- Add `contextforge actions-audit --summary contextforge-actions-audit.md --sarif contextforge-actions.sarif` for GitHub Actions hardening proof.
Expand Down
24 changes: 16 additions & 8 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,17 +22,23 @@ safely?**
## 30-second proof

```bash
contextforge doctor --summary contextforge-doctor.md
contextforge scorecard --output contextforge-scorecard.md
contextforge audit --min-context-score 70 --min-cache-score 70 --min-security-score 80
contextforge surface-map --output contextforge-agent-surface-map.md
contextforge surface-inventory --output contextforge-agent-surface-inventory.md
contextforge surface-diff --base main --output contextforge-agent-surface-diff.md
```

The audit gates context health, cache stability, and prompt/context poisoning.
The surface map shows exactly which agent-facing files are covered before a
maintainer has to read every doc. The inventory shows the agent-readable files
that are actually present in the current repository. The diff shows which
agent-readable files changed in a PR before reviewers trust the new context.
The doctor report now gives one first-readiness answer across context health,
security benchmark fixtures, MCP exposure, Claude Code settings, agentic
workflows, GitHub Actions hardening, public proof, launch assets, and community
health. The scorecard is the one-screen README/PR view. The audit gates context
health, cache stability, and prompt/context poisoning. The surface map shows
exactly which agent-facing files are covered before a maintainer has to read
every doc. The inventory shows the agent-readable files that are actually
present in the current repository. The diff shows which agent-readable files
changed in a PR before reviewers trust the new context.

| Agent stack | Surfaces ContextForge checks |
| --- | --- |
Expand Down Expand Up @@ -353,7 +359,7 @@ contextforge pack --task "review auth regression" --budget 20000 --sessions --ou
Or use the GitHub Action before npm publishing is complete:

```yaml
- uses: grnbtqdbyx-create/contextforge@v0.69.0
- uses: grnbtqdbyx-create/contextforge@v0.70.0
with:
min-context-score: 60
min-cache-score: 60
Expand Down Expand Up @@ -492,7 +498,7 @@ contextforge cost-estimate [--demo] [--json] [--summary contextforge-cost-estima
contextforge review-kit [--demo] [--base main] [--output contextforge-review-kit.md]
contextforge artifact-map [--output docs/artifacts.md]
contextforge publish-readiness [--json] [--summary contextforge-publish-readiness.md]
contextforge init [--all] [--github-action] [--pr-comment-workflow] [--agents-md] [--claude-md] [--copilot-instructions] [--project-name "My App"] [--action-ref grnbtqdbyx-create/contextforge@v0.69.0] [--force]
contextforge init [--all] [--github-action] [--pr-comment-workflow] [--agents-md] [--claude-md] [--copilot-instructions] [--project-name "My App"] [--action-ref grnbtqdbyx-create/contextforge@v0.70.0] [--force]
```

Local session scans are bounded by default. Use `--max-session-files` and
Expand Down Expand Up @@ -577,12 +583,13 @@ See [docs/research/adjacent-tools.md](docs/research/adjacent-tools.md).

## Current Status

ContextForge v0.69.0 is a public MVP CLI with:
ContextForge v0.70.0 is a public MVP CLI with:

- Claude Code and Codex JSONL fixture scanners
- bounded local session scanning fallbacks
- first-run `contextforge doctor` readiness report with JSON output
- shareable `contextforge doctor --summary` Markdown reports
- doctor, proof-pack, and scorecard hardening checks for Claude settings, agentic workflows, and GitHub Actions release safety
- shareable `contextforge proof-pack` readiness packets for launch, PR, and OSS evidence
- generated `contextforge adoption-brief` evaluator pages for first-time maintainers
- one-screen `contextforge scorecard` readiness snapshots for README, PR, and CI artifact readers
Expand Down Expand Up @@ -720,6 +727,7 @@ ContextForge v0.69.0 is a public MVP CLI with:
- **v0.67.0:** agentic workflow audits catch untrusted GitHub event text flowing into privileged AI workflows.
- **v0.68.0:** workflow audits expand attacker-controlled coverage to titles and branch/ref text.
- **v0.69.0:** GitHub Actions audits catch mutable action refs, pwn-request checkout, missing permissions, and direct script interpolation.
- **v0.70.0:** doctor, proof-pack, and scorecard reports surface Claude settings, agentic workflow, and GitHub Actions hardening evidence in one readiness path.
- **Next:** first approved npm publish and external launch outreach.

Release preparation lives in [docs/release-checklist.md](docs/release-checklist.md).
Expand Down
4 changes: 2 additions & 2 deletions contextforge-publish-readiness.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,11 +2,11 @@

Status: **warn**

Package: `contextforge@0.69.0`
Package: `contextforge@0.70.0`

| Check | Status | Detail |
| --- | --- | --- |
| Package metadata | pass | contextforge@0.69.0 is public-package ready with bin dist/cli.js |
| Package metadata | pass | contextforge@0.70.0 is public-package ready with bin dist/cli.js |
| Package provenance metadata | pass | repository, homepage, and issue tracker point at grnbtqdbyx-create/contextforge for npm provenance readers |
| Trusted publishing workflow | pass | npm Trusted Publishing uses GitHub OIDC, manual dispatch, dry-run default, and environment approval |
| Release artifact attestation | pass | GitHub artifact attestation covers the packed npm tarball before the same tarball is published |
Expand Down
6 changes: 6 additions & 0 deletions contextforge-scorecard.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,6 +19,9 @@ A one-screen snapshot for maintainers, reviewers, and coding agents deciding whe
| Cache stability | pass | 100/100 with no local sessions scanned |
| Context security | pass | 100/100 from repo instruction files |
| Security benchmark | pass | 4/4 benchmark cases passing |
| Claude Code settings | pass | 100/100 with no Claude Code settings found |
| Agentic workflows | pass | 100/100 across .github/workflows/ci.yml, .github/workflows/contextforge-audit.yml, .github/workflows/npm-publish.yml |
| GitHub Actions hardening | pass | 100/100 across .github/workflows/ci.yml, .github/workflows/contextforge-audit.yml, .github/workflows/npm-publish.yml |
| GitHub workflows | pass | ci.yml, contextforge-audit.yml present |
| Public proof surfaces | pass | README.md, LICENSE, CONTRIBUTING.md, CHANGELOG.md, llms.txt, llms-full.txt, examples/demo-output.md, examples/pr-comment.md, examples/review-kit.md present |
| Launch profile surfaces | pass | demo-terminal.svg, contextforge-report.png, docs/launch-post.md, docs/comparison.md, docs/artifacts.md present |
Expand All @@ -42,6 +45,9 @@ contextforge scorecard --output contextforge-scorecard.md
contextforge proof-pack --output contextforge-proof-pack.md
contextforge review-kit --base main --output contextforge-review-kit.md
contextforge mcp-audit --summary contextforge-mcp-audit.md
contextforge claude-audit --summary contextforge-claude-audit.md --sarif contextforge-claude.sarif
contextforge workflow-audit --summary contextforge-workflow-audit.md --sarif contextforge-workflow.sarif
contextforge actions-audit --summary contextforge-actions-audit.md --sarif contextforge-actions.sarif
contextforge artifact-map --output contextforge-artifact-map.md
```

2 changes: 1 addition & 1 deletion docs/architecture.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ ContextForge is split into small modules:
- `pack`: create task-specific context packs under a token budget.
- `improve`: turn audit findings into repo-rule suggestions.
- `report`: write local HTML reports.
- `doctor`: compose first-run readiness checks across audits, benchmark fixtures, and GitHub workflow presence.
- `doctor`: compose first-run readiness checks across audits, benchmark fixtures, MCP exposure, Claude settings, agentic workflows, GitHub Actions hardening, and GitHub workflow presence.
- `security`: ignore risky paths and redact common secrets.

The CLI composes these modules without network calls by default.
Expand Down
12 changes: 12 additions & 0 deletions docs/doctor.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,12 @@ The report checks:
- context security score
- public security benchmark status
- MCP exposure status for committed MCP server configs
- Claude Code settings status for shared permissions, hooks, HTTP allowlists,
and sensitive-file deny rules
- agentic workflow status for untrusted GitHub event text reaching model-backed
jobs
- GitHub Actions hardening status for SHA pins, token permissions,
`pull_request_target`, pwn-request checkout, and direct script interpolation
- GitHub workflow presence for CI and ContextForge audit artifacts
- public proof surfaces: README, license, contribution guide, changelog,
demo output, PR comment preview, review-kit preview, and LLM discovery files
Expand Down Expand Up @@ -60,6 +66,12 @@ shell installers, unpinned package launches, auto-approval, broad tool
permissions, and symlinked config files so maintainers can review tool access as
part of the same first-run readiness report.

For Codex/Claude handoffs, the Claude settings, agentic workflow, and GitHub
Actions hardening checks keep the first-run answer honest: a repo can have
great README proof and still be unsafe if shared agent settings weaken
permissions or CI lets untrusted PR text reach privileged model or release
steps.

The Markdown summary keeps the first-run proof portable. It uses the same
doctor result as terminal and JSON output, so maintainers can publish a report
without hand-copying or reinterpreting the readiness checks.
4 changes: 2 additions & 2 deletions docs/github-action.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,7 +36,7 @@ refuses to overwrite existing files by default:

```bash
contextforge init --github-action --force
contextforge init --github-action --action-ref grnbtqdbyx-create/contextforge@v0.69.0
contextforge init --github-action --action-ref grnbtqdbyx-create/contextforge@v0.70.0
```

`contextforge init --pr-comment-workflow` writes a separate
Expand Down Expand Up @@ -71,7 +71,7 @@ jobs:
- uses: actions/checkout@93cb6efe18208431cddfb8368fd83d5badbf9bfd # v5
with:
fetch-depth: 0
- uses: grnbtqdbyx-create/contextforge@v0.69.0
- uses: grnbtqdbyx-create/contextforge@v0.70.0
with:
min-context-score: 60
min-cache-score: 60
Expand Down
4 changes: 3 additions & 1 deletion docs/proof-pack.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,5 +35,7 @@ The proof pack includes:
- doctor status and every doctor check
- audit status, context health, cache stability, context security, and cache hit ratio
- top next actions from doctor and audit evidence
- commands to rerun doctor, audit, security benchmark, and context pack creation
- commands to rerun doctor, audit, security benchmark, Claude settings audit,
agentic workflow audit, GitHub Actions hardening audit, and context pack
creation
- a short Codex/Claude handoff paragraph for the next agent session
6 changes: 6 additions & 0 deletions docs/research/adjacent-tools.md
Original file line number Diff line number Diff line change
Expand Up @@ -570,3 +570,9 @@ permissions, `pull_request_target`, pwn-request checkout, and direct shell
interpolation of untrusted GitHub contexts. ContextForge dogfoods the feature by
pinning its own workflows to full action SHAs and uploading the new Actions
SARIF beside MCP, Claude settings, and agentic workflow alerts.
ContextForge v0.70.0 folds Claude settings, agentic workflow, and GitHub
Actions hardening checks into `contextforge doctor`, then points proof-pack and
scorecard readers at the matching Markdown/SARIF rerun commands. The product
reason is simple: a first-time maintainer, Codex session, or Claude session
should not need to remember every specialized audit command before it can tell
whether the repository is safe enough for agent-assisted work.
13 changes: 9 additions & 4 deletions docs/scorecard.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,15 +18,20 @@ reader needs a fast answer to one question:
It combines:

- agent readiness score from context health, cache stability, and context security
- doctor checks for public proof, launch profile, community health, MCP exposure, and workflows
- doctor checks for public proof, launch profile, community health, MCP
exposure, Claude settings, agentic workflows, GitHub Actions hardening, and
workflow presence
- next best actions
- links to the deeper proof pack, review kit, surface diff, artifact map, and action plan
- links to the deeper proof pack, review kit, hardening audits, surface diff,
artifact map, and action plan

## CI Artifact

The reusable GitHub Action, generated audit workflow, and ContextForge dogfood
workflow upload `contextforge-scorecard.md` next to the MCP audit, proof pack,
review kit, surface diff, artifact map, SARIF, summary, badge, and JSON report.
workflow upload `contextforge-scorecard.md` next to the MCP audit, Claude
settings audit, agentic workflow audit, GitHub Actions hardening audit, proof
pack, review kit, surface diff, artifact map, SARIF, summary, badge, and JSON
report.

Use `contextforge-scorecard.md` as the first artifact to open. If it is clean,
open `contextforge-proof-pack.md` for evidence and `contextforge-review-kit.md`
Expand Down
6 changes: 3 additions & 3 deletions llms-full.txt
Original file line number Diff line number Diff line change
Expand Up @@ -134,9 +134,9 @@ cases so maintainers can see what the scanner is expected to catch.
- `docs/use-cases.md`: maintainer scenarios with commands, artifacts, and success signals
- `contextforge-suggestions.json`: machine-readable improvement suggestions for Codex, Claude, bots, and CI scripts
- `contextforge-badge.svg`: compact audit status badge generated from context, cache, and security scores
- `contextforge-doctor.md`: shareable first-run readiness checklist from `doctor --summary`
- `contextforge-proof-pack.md`: shareable doctor, audit, command, and handoff evidence from `proof-pack`
- `contextforge-scorecard.md`: one-screen Codex/Claude readiness snapshot from `scorecard`
- `contextforge-doctor.md`: shareable first-run readiness checklist from `doctor --summary`, including MCP exposure, Claude settings, agentic workflow, and GitHub Actions hardening status
- `contextforge-proof-pack.md`: shareable doctor, audit, hardening command, and handoff evidence from `proof-pack`
- `contextforge-scorecard.md`: one-screen Codex/Claude readiness snapshot from `scorecard`, including hardening checks
- `contextforge-agent-surface-map.md`: cross-agent support matrix from `surface-map`
- `contextforge-agent-surface-inventory.md`: repo-specific list of actual agent-readable files from `surface-inventory`
- `contextforge-agent-surface-diff.md`: PR-specific list of changed agent-readable files from `surface-diff`
Expand Down
13 changes: 7 additions & 6 deletions llms.txt
Original file line number Diff line number Diff line change
Expand Up @@ -5,17 +5,18 @@
ContextForge is a local-first TypeScript CLI for maintainers who use AI coding
agents. It measures token usage, audits repo context files, checks prompt-cache
stability, scans for malicious repo instructions, audits committed MCP configs,
reviews Claude Code subagents and custom slash commands, checks Codex/Claude
trace efficiency, builds task-specific context packs, and produces CI artifacts
that Codex and Claude can act on.
reviews Claude Code settings, subagents, and custom slash commands, checks
agentic GitHub workflows and GitHub Actions hardening, checks Codex/Claude trace
efficiency, builds task-specific context packs, and produces CI artifacts that
Codex and Claude can act on.

## Quick Start

- `contextforge init --all --project-name "My Repo"`: scaffold the recommended repo setup, including `AGENTS.md`, `CLAUDE.md`, and `.github/copilot-instructions.md`.
- `contextforge doctor --demo --summary contextforge-doctor.md`: run first-readiness checks and write a shareable Markdown report.
- `contextforge doctor`: verifies public proof files, launch profile assets, community health files, and next actions.
- `contextforge proof-pack --output contextforge-proof-pack.md`: combine doctor, audit, evidence commands, and Codex/Claude handoff guidance into one Markdown proof packet.
- `contextforge scorecard --output contextforge-scorecard.md`: write a one-screen Codex/Claude readiness snapshot for README, PR, and CI artifact readers.
- `contextforge doctor`: verifies public proof files, launch profile assets, community health files, MCP exposure, Claude settings, agentic workflow, GitHub Actions hardening, and next actions.
- `contextforge proof-pack --output contextforge-proof-pack.md`: combine doctor, audit, hardening evidence commands, and Codex/Claude handoff guidance into one Markdown proof packet.
- `contextforge scorecard --output contextforge-scorecard.md`: write a one-screen Codex/Claude readiness snapshot with hardening checks for README, PR, and CI artifact readers.
- `contextforge surface-map --output contextforge-agent-surface-map.md`: write a support matrix for audited Codex, Claude Code, GitHub Copilot, MCP, Cursor, Cline, Gemini CLI, and Windsurf repo surfaces.
- `contextforge surface-inventory --output contextforge-agent-surface-inventory.md`: write the actual agent-readable files present in this repo and the commands that audit them.
- `contextforge surface-diff --base main --output contextforge-agent-surface-diff.md`: write the changed agent-readable files in a PR, affected ecosystems, and follow-up checks.
Expand Down
2 changes: 1 addition & 1 deletion package.json
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
{
"name": "contextforge",
"version": "0.69.0",
"version": "0.70.0",
"description": "Agent context gate for Codex, Claude Code, GitHub Copilot, MCP, Cursor, Cline, Gemini, and Windsurf repos.",
"type": "module",
"packageManager": "pnpm@11.2.2",
Expand Down
2 changes: 1 addition & 1 deletion src/cli.ts
Original file line number Diff line number Diff line change
Expand Up @@ -920,7 +920,7 @@ Usage:
contextforge surface-inventory [--json] [--output contextforge-agent-surface-inventory.md]
contextforge surface-diff [--base main] [--json] [--output contextforge-agent-surface-diff.md]
contextforge publish-readiness [--json] [--summary contextforge-publish-readiness.md]
contextforge init [--all] [--github-action] [--pr-comment-workflow] [--agents-md] [--claude-md] [--copilot-instructions] [--project-name "My App"] [--action-ref grnbtqdbyx-create/contextforge@v0.69.0] [--force]
contextforge init [--all] [--github-action] [--pr-comment-workflow] [--agents-md] [--claude-md] [--copilot-instructions] [--project-name "My App"] [--action-ref grnbtqdbyx-create/contextforge@v0.70.0] [--force]

Session scan safety:
--max-session-files 50 newest JSONL files to scan per provider
Expand Down
Loading
Loading