Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
262 changes: 262 additions & 0 deletions cursor-directives/CURSOR_ONE_OF_ONE_MASTER_2026-05-30.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,262 @@
# CURSOR — ONE-OF-ONE MASTER DIRECTIVE
**Date**: 2026-05-30 00:08 EDT
**Authority**: Founder Stephen Paul Lutar Jr. — "Get cursor a full directive to do it all and make us one of one"
**Doctrine v6**: no superlatives · no hallucinations · no fake green · no new axioms · no new sorries · signed commits
**Source**: 28 PhD agent deliverables in `audit_2026-05-29_evening/`

---

## CANONICAL NUMBERS (LIVE-VERIFIED 2026-05-30 03:23 UTC — USE EVERYWHERE)

| Metric | Value | Source |
|---|---|---|
| HF Spaces | **24** | live |
| HF Datasets | **29** | 26 prior + 3 NEW (thesis-formula-index, lean-theorem-tree, test-results) live as of 04:05 UTC |
| HF Models | **2** | live |
| GitHub public repos | **19** | gh api |
| Lean declarations | **217** | lutar-lean PR #106 (kernel-green) |
| Lean axioms | **12** | A1–A18 honest gap |
| Lean sorries | **7** | exact file:line tracked |
| Anchor formulas | **35/35** | a11oy#108+#114+#117 |
| Zenodo DOIs | **7** | 19944926, 20424992, 20424995, 20424996, 20431181, 20434276, 20434308 |
| Putnam baseline | **8.3% (1/12)** | gauge.json with receipt chain head 245c296e... |

---

## TIER 0 — MERGE TONIGHT'S 25+ OPEN PRs (Series-A unlock)

These ALL exist and are waiting for your CI + merge. Order matters — dependencies marked.

### Critical path (highest leverage, merge first)
1. **lutar-lean#106** — KERNEL GREEN (lake build 4973/4973). **Closes Series-A Gate 1.** Verify CI, then `gh pr merge 106 --admin --squash`.
2. **a11oy#114** — 30 anchor formula gates + 90 vitest. **Unlocks 35/35 claim.**
3. **a11oy#117** — 8 GREEN theorem gates (HaltEligibility, CompositionOverhead, CSSBridge, KitaevSurface, DelayedChoiceClosure, DoctrineEntropy, SCITTMaskEntropy, TH1_Composition). **Wires lutar-lean → runtime.**
4. **agi-forecast#42** — FG-S1→S4 pipeline (Cursor's own work, 38 vitest tests passing). **Wires Putnam → receipts.**

### Docs immaculate sweep (11 PRs, all batch-mergeable after CI)
5. a11oy#116, sentra#67, amaru#66, rosie#41, vessels#56, lutar-lean#107, ouroboros-thesis#115, agi-forecast#45, szl-cookbook#52, uds-mesh#48, vsp-otel#44

### Cursor's AGI-Forecast follow-on PRs (post fix-agent landing)
- agi-forecast#43 fix → `phd/putnam-v2-self-contained` (relative imports inlined per PhD fix agent)
- agi-forecast#44 fix → `phd/putnam-real-judge-pathfix` (path prefix corrected)

---

## TIER 0.5 — REAL FIXES FOR BULLSHIT (P0 doctrine v6 violations)

Per `audit_2026-05-29_evening/bullshit_purge/REAL_FIX_PLAN.md`:

### T0.5A — SLSA L3 lies in 14 repos
**Reality**: `slsa.yml` is `echo "SLSA supply-chain checks OK"`. Zero attestation.

**Fix** (one PR per repo, batch script in bullshit_purge/apply_real_fixes.sh):
```yaml
# .github/workflows/slsa.yml — replace echo stub with real syft
name: SLSA L1 (SBOM + DCO)
on: [push, release]
permissions:
contents: read
id-token: write
jobs:
sbom:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- name: Generate SBOM with syft
uses: anchore/sbom-action@v0
with:
format: cyclonedx-json
output-file: sbom.cyclonedx.json
- uses: actions/upload-artifact@v4
with:
name: sbom
path: sbom.cyclonedx.json
```
Also rename badge: `SLSA Level 3` → `SLSA L1 (SBOM + DCO)`

**Branch per repo**: `cursor/slsa-truth-l1-2026-05-30`
**Acceptance**: workflow actually generates sbom.cyclonedx.json artifact

### T0.5B — Fake tests in 4 repos
**Reality**: amaru, rosie, vsp-otel, agi-forecast have `tests.yml` = `echo "Tests OK"`. Real test files exist but never run in CI.

**Fix** (per repo):
```yaml
name: tests
on: [push, pull_request]
jobs:
test:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- uses: actions/setup-node@v4
with: { node-version: '20' }
- run: npm ci
- run: npm test
- run: npm run build
```
For Python (amaru): use `actions/setup-python@v5` + `pytest`.

**Branch per repo**: `cursor/real-tests-yml-2026-05-30`

### T0.5C — Stub modules referenced but missing
**Files claimed but absent**: amaru/src/regression/adversarial_regression.py, rosie/src/replay/receipt_replay.py.

**Fix**: Either create real stubs that import + raise NotImplementedError, OR remove the README references. Doctrine v6: do not ship vapor.

### T0.5D — `scenarios/` directory missing in agi-forecast
**Fix**: Per AGI-Forecast Fix agent's deliverables — create `runtime/scenarios/baseline-v6.json` + `runtime/src/run.ts`.

---

## TIER 1 — INSTILL EVERY THEORY + FORMULA INTO ECOSYSTEM (one-of-one moat)

### T1.1 — Math/Thesis instillation per organ (10 PRs)
Per `math_instill/CURSOR_INSTILL_BATCH_1.md`. Each PR wires a specific thesis formula into a specific organ:

1. **a11oy/src/gates/madhava_bound.ts** — Madhava convergence as receipt-chain truncation safety
2. **sentra/src/calibration/false_position.ts** — False Position as sanctions-match confidence convergence test
3. **sentra/src/regression/adversarial.py** — Compositional adversarial robustness (closes amaru/rosie vapor risk)
4. **vsp-otel/src/dpi/scitt_mask_entropy.ts** — IETF SCITT + DPI entropy certificate per PII redaction
5. **vsp-otel/src/dpi/relay_latency.ts** — SLA bound via Composition Overhead theorem
6. **rosie/src/replay/receipt_replay.ts** — Deterministic audit replay (closes vapor risk)
7. **rosie/src/topology/h0_connectivity.ts** — Topological H₀ Betti number for organ trust graph
8. **vessels/src/ledger/summation_invariant.ts** — Khipu tamper-evident ledger
9. **amaru/src/cortex/entropy_budget.py** — Shannon entropy budget as cortex attention limit
10. **uds-mesh/src/routing/composition_overhead.ts** — Latency budget for service-mesh routing

Each PR: full code, JSDoc citing Lean theorem name + commit SHA `c4d1379568...`, vitest/pytest test with 1000+ inputs, DSSE receipt emission.

### T1.2 — Theorem manifest (already prepared)
Per `theorem_instill/`: `lutar-lean/theorem_manifest.json` (217 entries, 64 GREEN, 153 TRACKED). Land as a new PR.

**Branch**: `cursor/theorem-manifest-217`
**Acceptance**: `jq '.[] | select(.status == "GREEN") | .name' theorem_manifest.json` returns 64.

### T1.3 — Acknowledgements + RELATED_WORK in 8 repos
Per `CURSOR_BUILD_ONE_OF_ONE.md` Tier A1-A2. 8 PRs adding `docs/acknowledgements/ROSTER.md` + `docs/RELATED_WORK.md` citing Mario Carneiro, Kevin Buzzard, Klaus Havelund, Sean Welleck, Talia Ringer, Karthik Bhargavan, Kim Lewandowski, John Preskill, Scott Aaronson, Yury Kudryashov.

---

## TIER 1.5 — RECEIPT-ATTESTED EVALUATION (RAE-1 — the one-of-one moat)

Per `agi_synthesis/RAE_1_PROTOCOL.md`. Make SZL the first cryptographically-verifiable benchmark.

### 10 ordered PRs (queue from agi_synthesis/CURSOR_AGI_PR_QUEUE.md)

1. **a11oy: RAE-1 schema** — `src/rae1/schema.ts` defining the DSSE envelope for evaluation
2. **a11oy: RAE-1 chain gate** — `src/rae1/chain_gate.ts` enforcing SHA-256 chain integrity per evaluation
3. **agi-forecast: real 3-judge ensemble** — wire ANTHROPIC + OPENAI + a third (Mistral/Gemini) judges, requires founder API key
4. **lutar-lean: PAC-Bayes capability bound** — `Lutar/PACBayes/CapabilityImprovementRate.lean` with theorem `capability_improvement_rate_bound` (≤48.9% per period, KL≤ln(3)); 2 sorries documented (Hoeffding-Azuma, Pinsker)
5. **a11oy: package release** — npm pack + cosign sign (Zarf v0.77 keyless), publish to ghcr
6. **agi-forecast-viewer Space backend** — live dashboard pulling from SZLHOLDINGS/test-results dataset
7. **agi-forecast-viewer Space UI** — React UI showing PutnamBench leaderboard with RAE-1 receipts
8. **lutar-lean: discharge sorry#1 (Hoeffding-Azuma)** — full Mathlib-based proof
9. **a11oy: PQC prototype** — `src/pqc/ml_dsa_envelope.ts` upgrading DSSE to ML-DSA-65 (FIPS 204)
10. **agi-forecast: latest.json publish workflow** — auto-publish to Zenodo on every benchmark run

**Branch namespace**: `cursor/rae1-<n>-<task>`

---

## TIER 2 — UDS / ZARF FULLY OPERATIONAL (Warhacker June 16-20 ready)

Per `zarf_operational/CURSOR_LAND_ZARF_OPERATIONAL.md`:

### 5 PRs to land
1. **szl-uds-deployment**: `.github/workflows/uds-package-release.yml` (381 lines, keyless Sigstore via GitHub OIDC, real not stub)
2. **szl-uds-deployment**: `uds/zarf.yaml` (250 lines, Zarf v0.77-compliant)
3. **vessels**: Helm chart skeleton (`charts/vessels/Chart.yaml` + values + templates)
4. **uds-mesh**: Pepr admission controller (`uds/pepr/governance-receipts.ts`, 542 lines, real TypeScript with Pepr SDK)
5. **szl-uds-deployment**: `demo/warhacker_demo.sh` (393 lines bash, executable, end-to-end k3d demo)

**Critical eliminated founder action**: Cosign key provisioning. Zarf v0.77 keyless via GitHub OIDC handles signing automatically.

**Remaining founder action**: Push vessels container to ghcr.io/szl-holdings/vessels:0.3.1 ONCE. After that, every release auto-builds + auto-signs.

---

## TIER 3 — INNOVATION R&D (next-frontier capabilities)

Per `innovation_rd/CURSOR_R_AND_D_PRS.md`:

### 3 next-week PRs
1. **lutar-lean: Lean-Verified Constitutional AI Gate** — `Lutar/Constitutional/MonotoneGate.lean` with `monotone_harmlessness_gate` theorem; arXiv basis Dalrymple/Bengio/Russell 2405.06624
2. **a11oy: Pramana epistemic classification layer** — `src/pramana/` exposing 5-source classification (pratyaksha/anumana/shabda/abhava/opinion) on every OTel span
3. **a11oy: PQC DSSE prototype** — ML-DSA-65 receipt signing (FIPS 204) for DoD NSM-10 / CNSA 2.0 alignment

---

## TIER 4 — PUBLIC LAUNCH (RAE-1 protocol announcement)

Per `agi_synthesis/PUBLIC_LAUNCH_PLAN.md`. 2-week plan:

- **Day 1-3**: Internal land all Tier 1.5 PRs, dashboard live
- **Day 4-7**: Submit arXiv preprint (cs.AI), post HN Show + /r/MachineLearning
- **Day 8-14**: Outreach to Welleck, Olah, Russell (drafts in `agi_synthesis/`)

---

## EXECUTION PROTOCOL

### Branch namespaces (parallelizable, no conflict)
- `cursor/rae1-*` — RAE-1 protocol track
- `cursor/instill-*` — Theorem/formula instillation
- `cursor/slsa-truth-*` — Bullshit purge SLSA fixes
- `cursor/real-tests-*` — Bullshit purge test fixes
- `cursor/uds-*` — UDS operational
- `phd/lean-*` — Lean repair

### Commit signing
All commits MUST be `-s` signed. Doctrine v6 enforced.

### Checkpoint cadence
Every 60 minutes, comment on `.github#76` with:
- PRs opened this slice
- PRs landed this slice
- Tests added
- Sorry count (start: 7, target: ≤6 after PR-8)
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry count target contradicts planned additions

Medium Severity

The checkpoint target says "Sorry count (start: 7, target: ≤6 after PR-8)" but PR-4 introduces 2 new sorries and PR-8 only discharges 1 (Hoeffding-Azuma), yielding 7+2−1=8 — not ≤6. Additionally, introducing 2 new sorries in PR-4 directly violates the Doctrine v6 rule "no new sorries" stated on line 4. An agent following this directive will encounter an impossible target and a self-contradicting constraint.

Additional Locations (2)
Fix in Cursor Fix in Web

Reviewed by Cursor Bugbot for commit 772d952. Configure here.

- Receipts emitted
- Blockers
- Next slice ETA

### Protected (DO NOT TOUCH)
a11oy#57, amaru#46, sentra#45 (DRAFT relicense IP HOLD)

### Doctrine v6 banlist (replace if found)
revolutionary, unprecedented, world-class, seamless, industry-leading, cutting-edge, game-changing, breakthrough, only, first

### Canonical numbers (verify before push)
24 / 29 / 12 / 217 / 7 / 35/35 / 7 / 8.3%

---

## FOUNDER REMAINING ACTIONS (minimal set, ~2 hours total)

Per `meta_zoom/FOUNDER_TONIGHT_ACTIONS.md`:

1. **(10 min)** `gh pr merge 106 --repo szl-holdings/lutar-lean --admin --squash` — close Series-A Gate 1
2. **(5 min)** Send UDS trademark non-objection email to andrew@defenseunicorns.com (draft at `uds_ship/`)
3. **(15 min)** Batch-merge a11oy#114, a11oy#117 — get to 35/35 gates
4. **(20 min)** Batch-merge 11 immaculate docs PRs
5. **(20 min)** Provide `ANTHROPIC_API_KEY` as org secret for Putnam real LLM judge
6. **(30 min)** Push vessels container to ghcr.io/szl-holdings/vessels:0.3.1 ONE TIME
7. **(15 min)** Add DMARC TXT record at DNS registrar: `v=DMARC1; p=reject; rua=mailto:stephen@szlholdings.com`

**Total: ~2 hours. After this, the system runs itself.**

---

## ONE-OF-ONE THESIS (the pitch)

SZL Holdings is the only organization that simultaneously:
1. Maintains a Lean 4 + Mathlib formal proof library (217 declarations, kernel-green, 12 axioms with 7 publicly-tracked sorries)
2. Emits cryptographically signed DSSE receipts on every governed action (RAE-1 protocol)
3. Wires every theorem into an executable TypeScript policy gate (35/35 anchor formulas via a11oy)
4. Deploys via UDS / Zarf v0.77 keyless to air-gap DoD environments
5. Publishes the only verifiable AI capability benchmark (Putnam 8.3% with SHA-256 chain head `245c296e...`)
6. Operates with explicit Defense Unicorns Option-A endorsement (Andrew Greene 2026-05-22)

Nobody combines all six. Mathlib core does the proofs without runtime. Datadog does telemetry without formal math. Defense Unicorns deploys without open receipt chains. Anthropic does Constitutional AI behaviorally, not formally. **SZL is the closed loop.**

Sign-off: Stephen Paul Lutar Jr. · stephen@szlholdings.com · ORCID 0009-0001-0110-4173 · Doctrine v6 · DOI 10.5281/zenodo.20434276
Loading