Abilityai · dolho · Jun 25, 2026 · Jun 25, 2026
@@ -17,16 +17,30 @@ name: Schema Parity Gate
 # doing work. When a relevant file changes, the real test runs and can fail.
 # Change detection is pure `git diff` — no third-party action and no token
 # scope beyond `contents: read`.
+#
+# CROSS-TRACK ALEMBIC GUARD (#1342): the pytest above guards only the SQLite
+# track. PostgreSQL is a SECOND track — `init_database()` runs
+# `alembic_runner.upgrade_to_head()`, which applies revision files only (it does
+# NOT autogenerate from `tables.py`). A SQLite-only schema change therefore
+# merges green yet PG-broken. The extra step below (`scripts/ci/check_alembic_parity.py`)
+# fails a PR that ADDS schema DDL to db/{migrations,schema,tables}.py without a
+# net-new revision under src/backend/migrations/versions/. Heuristic & false-
+# positive guard (comments / data-only / down migrations carry no DDL keyword,
+# so they don't trip) are documented in that script's module docstring. PR-only
+# (it needs the PR base/head to diff); pure stdlib, no extra deps.
 
 on:
   pull_request:
   push:
     branches: [main, dev]
     paths:
       - 'src/backend/db/**'
+      - 'src/backend/migrations/**'
       - 'src/backend/database.py'
       - 'src/backend/utils/helpers.py'
       - 'tests/unit/test_schema_parity.py'
+      - 'tests/unit/test_alembic_parity_guard.py'
+      - 'scripts/ci/check_alembic_parity.py'
       - 'tests/requirements-test.txt'
       - '.github/workflows/schema-parity.yml'
   workflow_dispatch:
@@ -42,6 +56,15 @@ jobs:
         with:
           fetch-depth: 0  # need base history to diff the PR's changed paths
 
+      # Cross-track guard (#1342): SQLite-track DDL must ship a paired Alembic
+      # revision. PR-only — needs the PR base/head SHAs to diff. Pure stdlib.
+      - name: Alembic (Postgres) parity guard
+        if: github.event_name == 'pull_request'
+        env:
+          PR_BASE_SHA: ${{ github.event.pull_request.base.sha }}
+          PR_HEAD_SHA: ${{ github.event.pull_request.head.sha }}
+        run: python3 scripts/ci/check_alembic_parity.py
+
       - name: Detect schema-relevant changes
         id: changes
         env:
@@ -64,7 +87,7 @@ jobs:
           fi
           CHANGED="$(git diff --name-only "$PR_BASE_SHA...$PR_HEAD_SHA")"
           echo "Changed files in PR:"; echo "$CHANGED"
-          if printf '%s\n' "$CHANGED" | grep -qE '^(src/backend/db/|src/backend/database\.py$|src/backend/utils/helpers\.py$|tests/unit/test_schema_parity\.py$|tests/requirements-test\.txt$|\.github/workflows/schema-parity\.yml$)'; then
+          if printf '%s\n' "$CHANGED" | grep -qE '^(src/backend/db/|src/backend/database\.py$|src/backend/utils/helpers\.py$|tests/unit/test_schema_parity\.py$|tests/unit/test_alembic_parity_guard\.py$|scripts/ci/check_alembic_parity\.py$|tests/requirements-test\.txt$|\.github/workflows/schema-parity\.yml$)'; then
             echo "relevant=true" >> "$GITHUB_OUTPUT"
             echo "schema-relevant files changed — running the parity check."
           else
@@ -98,4 +121,4 @@ jobs:
           REDIS_URL: "redis://test:test@redis:6379"
           REDIS_PASSWORD: "test"
           REDIS_BACKEND_PASSWORD: "test"
-        run: pytest tests/unit/test_schema_parity.py -v --tb=short
+        run: pytest tests/unit/test_schema_parity.py tests/unit/test_alembic_parity_guard.py -v --tb=short
diff --git a/docs/memory/architecture.md b/docs/memory/architecture.md
@@ -785,7 +785,7 @@ These are structural patterns that must be preserved. Breaking them causes casca
 
 2. **DB Layer: Class-per-domain with Mixin Composition** — Each `db/` file defines an `XOperations` class. Agent-specific settings use mixins (`db/agent_settings/`) composed into `AgentOperations`. New agent settings → new mixin, not a bigger class.
 
-3. **Schema in `db/schema.py`, Migrations in `db/migrations.py`** — All OSS table DDL lives in `schema.py`. Schema changes require a versioned migration in `migrations.py` (tracked in the `schema_migrations` table). Never create tables ad-hoc in service code. **Runner safety (#1160):** `init_database()` wraps both migration passes + `init_schema` in a cross-process `flock` (`db/migration_lock.py`) so concurrent uvicorn workers + the scheduler container can't race the suite; table-rebuild migrations use `_atomic_rebuild` (rename-swap inside an explicit `BEGIN`/`COMMIT`) so a crash mid-rebuild rolls back instead of losing rows between an autocommitted `DROP` and the re-insert; a failed migration is named on its traceback (`add_note`) and surfaced as `first_pending` in the `/health` 503. **Backend split (#1183):** the `db/migrations.py` bespoke runner (PRAGMA + `INSERT OR IGNORE`) is **SQLite-only**. The PostgreSQL backend is owned by **Alembic** — `init_database()`'s non-SQLite branch calls `db/alembic_runner.upgrade_to_head()` (`src/backend/migrations/` + `alembic.ini`; `env.py` targets the `db/tables.py` MetaData). A fresh PG DB is built by the `0001_baseline` revision (which reuses the exact `init_schema_postgres` head DDL); a pre-Alembic PG DB is stamped at baseline, not rebuilt. The two systems coexist during the SQLite→Postgres transition, so a schema change currently lands in **both** `migrations.py` (SQLite) and a new Alembic revision (Postgres) until SQLite reaches **end-of-support on September 1, 2026** (decision #1278; migration guide `docs/migrations/SQLITE_TO_POSTGRES.md`) — after which the long-term goal (#746) is `tables.py` MetaData as the single source with autogenerated revisions. **Two-track migrations (open-core):** enterprise modules own only `enterprise_*` tables and migrate them through a **separate** runner (`enterprise/backend/_migrations.py`) tracked in `enterprise_schema_migrations` — never the OSS `schema_migrations`, so the two version-lines can't collide. Enterprise authors one file per migration in the module's `migrations/` package (`NNNN_slug.py` with `NAME` + `upgrade(cursor, conn)`, auto-discovered in filename order). Enterprise migrations may FK-into OSS tables but must **never ALTER** an OSS table — anything OSS must enforce goes through an OSS migration as an edition-agnostic primitive (e.g. `users.suspended_at`, #995). The enterprise runner is invoked from `register_enterprise` *after* OSS `init_database`, so OSS tables already exist.
+3. **Schema in `db/schema.py`, Migrations in `db/migrations.py`** — All OSS table DDL lives in `schema.py`. Schema changes require a versioned migration in `migrations.py` (tracked in the `schema_migrations` table). Never create tables ad-hoc in service code. **Runner safety (#1160):** `init_database()` wraps both migration passes + `init_schema` in a cross-process `flock` (`db/migration_lock.py`) so concurrent uvicorn workers + the scheduler container can't race the suite; table-rebuild migrations use `_atomic_rebuild` (rename-swap inside an explicit `BEGIN`/`COMMIT`) so a crash mid-rebuild rolls back instead of losing rows between an autocommitted `DROP` and the re-insert; a failed migration is named on its traceback (`add_note`) and surfaced as `first_pending` in the `/health` 503. **Backend split (#1183):** the `db/migrations.py` bespoke runner (PRAGMA + `INSERT OR IGNORE`) is **SQLite-only**. The PostgreSQL backend is owned by **Alembic** — `init_database()`'s non-SQLite branch calls `db/alembic_runner.upgrade_to_head()` (`src/backend/migrations/` + `alembic.ini`; `env.py` targets the `db/tables.py` MetaData). A fresh PG DB is built by the `0001_baseline` revision (which reuses the exact `init_schema_postgres` head DDL); a pre-Alembic PG DB is stamped at baseline, not rebuilt. The two systems coexist during the SQLite→Postgres transition, so a schema change currently lands in **both** `migrations.py` (SQLite) and a new Alembic revision (Postgres) — both tracks CI-guarded by the `schema-parity` job (SQLite parity pytest + the `scripts/ci/check_alembic_parity.py` cross-track guard that fails a DDL change missing its Alembic revision, #1342) — until SQLite reaches **end-of-support on September 1, 2026** (decision #1278; migration guide `docs/migrations/SQLITE_TO_POSTGRES.md`) — after which the long-term goal (#746) is `tables.py` MetaData as the single source with autogenerated revisions. **Two-track migrations (open-core):** enterprise modules own only `enterprise_*` tables and migrate them through a **separate** runner (`enterprise/backend/_migrations.py`) tracked in `enterprise_schema_migrations` — never the OSS `schema_migrations`, so the two version-lines can't collide. Enterprise authors one file per migration in the module's `migrations/` package (`NNNN_slug.py` with `NAME` + `upgrade(cursor, conn)`, auto-discovered in filename order). Enterprise migrations may FK-into OSS tables but must **never ALTER** an OSS table — anything OSS must enforce goes through an OSS migration as an edition-agnostic primitive (e.g. `users.suspended_at`, #995). The enterprise runner is invoked from `register_enterprise` *after* OSS `init_database`, so OSS tables already exist.
 
 4. **Router Registration Order Matters** — In `main.py`, static routes like `/api/agents/context-stats` must come before `/{name}` catch-all. New collection-level agent endpoints must be registered before parameterized routes.
 

diff --git a/scripts/ci/check_alembic_parity.py b/scripts/ci/check_alembic_parity.py
@@ -0,0 +1,247 @@
+#!/usr/bin/env python3
+"""Cross-track schema-parity guard: SQLite DDL change ⇒ Alembic revision (#1342).
+
+The `schema-parity` pytest suite only validates the **SQLite** track
+(`db/migrations.py` ↔ `db/schema.py`). Per Architectural Invariant #3, every
+schema change must ALSO ship a new **Alembic** revision under
+`src/backend/migrations/versions/` — otherwise PostgreSQL breaks, because
+`init_database()` runs `alembic_runner.upgrade_to_head()`, which applies
+revision files only (it does not autogenerate from `tables.py`). With no guard,
+a SQLite-only schema change merges green yet PG-broken.
+
+Heuristic (documented for the false-positive guard, AC #4):
+
+  A PR that registers a NEW SQLite migration carrying schema DDL MUST also ADD
+  a net-new file under `src/backend/migrations/versions/`. Two conjuncts:
+
+  (1) NEW MIGRATION — an added line in `db/migrations.py` registers a net-new
+      entry in the `MIGRATIONS` list: `("<name>", _migrate_<fn>),`. This is the
+      unambiguous "I'm shipping a schema change" act; runner refactors, lock
+      changes, and table-rebuild helpers do NOT add an entry.
+  (2) DDL — an *added* (`+`), *non-comment* diff line in
+      `db/{migrations,schema,tables}.py` carries a schema keyword:
+        - SQL (migrations.py / schema.py): CREATE TABLE/INDEX/TRIGGER,
+          ALTER TABLE, ADD/DROP COLUMN, RENAME COLUMN/TO, DROP TABLE/INDEX.
+        - SQLAlchemy (tables.py): Column(/Table(/Index(/UniqueConstraint(/
+          ForeignKey(/PrimaryKeyConstraint(.
+
+  Requiring BOTH is what keeps the false-positive rate down — it does NOT trip on:
+    - comment / docstring-only edits (comment lines are skipped),
+    - data-only or down migrations (a new entry, but no DDL keyword),
+    - migration-runner refactors / table REBUILDS that re-emit CREATE TABLE for
+      an *existing* table without registering a new migration (conjunct 1 false).
+
+  A real column/table add always registers a new `MIGRATIONS` entry AND emits
+  `ADD COLUMN` / `CREATE TABLE` (migrations.py) or a new `Column(`/`Table(`
+  (tables.py), so it is caught. A schema.py-only DDL edit that forgets the
+  SQLite migration is caught separately (red) by the existing schema-parity
+  pytest, which then forces the migration — and this guard then forces the
+  revision.
+
+Usage:
+    check_alembic_parity.py <base_sha> <head_sha>
+    # or, with no args, reads PR_BASE_SHA / PR_HEAD_SHA from the environment.
+
+Exit codes: 0 = pass (no DDL, or DDL + revision present), 1 = missing revision,
+2 = usage / git error. Pure stdlib; no third-party deps.
+"""
+from __future__ import annotations
+
+import os
+import re
+import subprocess
+import sys
+
+# Files whose DDL changes require a paired Alembic revision.
+SQL_DDL_FILES = ("src/backend/db/migrations.py", "src/backend/db/schema.py")
+SQLALCHEMY_DDL_FILE = "src/backend/db/tables.py"
+WATCHED_FILES = (*SQL_DDL_FILES, SQLALCHEMY_DDL_FILE)
+
+# Net-new revision target.
+REVISION_DIR = "src/backend/migrations/versions/"
+
+# Raw-SQL DDL keywords (case-insensitive). Anchored on word boundaries so e.g.
+# "created_at" does not match "CREATE".
+_SQL_DDL = re.compile(
+    r"\b("
+    r"CREATE\s+(?:UNIQUE\s+)?INDEX"
+    r"|CREATE\s+TABLE"
+    r"|CREATE\s+TRIGGER"
+    r"|ALTER\s+TABLE"
+    r"|ADD\s+COLUMN"
+    r"|DROP\s+COLUMN"
+    r"|DROP\s+TABLE"
+    r"|DROP\s+INDEX"
+    r"|RENAME\s+COLUMN"
+    r"|RENAME\s+TO"
+    r")\b",
+    re.IGNORECASE,
+)
+
+# SQLAlchemy schema constructs in tables.py (the Alembic MetaData source).
+_SQLALCHEMY_DDL = re.compile(
+    r"\b(Column|Table|Index|UniqueConstraint|ForeignKey|PrimaryKeyConstraint)\s*\("
+)
+
+# A net-new entry in the `MIGRATIONS = [...]` list in migrations.py:
+#   ("agent_loops_tables", _migrate_agent_loops_tables),
+# Trinity registers every migration here and names the fn `_migrate_*`.
+_MIGRATIONS_ENTRY = re.compile(r"""\(\s*["'][^"']+["']\s*,\s*_migrate_\w+""")
+
+
+def _patterns_for(path: str) -> re.Pattern | None:
+    if path == SQLALCHEMY_DDL_FILE:
+        return _SQLALCHEMY_DDL
+    if path in SQL_DDL_FILES:
+        return _SQL_DDL
+    return None
+
+
+def _is_comment(content: str) -> bool:
+    """A line we treat as a comment for DDL-detection purposes."""
+    stripped = content.lstrip()
+    return stripped.startswith("#")
+
+
+def iter_added_lines(diff_text: str):
+    """Yield (path, content) for every added line of a unified diff.
+
+    Tracks the current file from `+++ b/<path>` headers; skips the `+++`/`---`
+    headers themselves. Robust to multi-file diffs.
+    """
+    current: str | None = None
+    for line in diff_text.splitlines():
+        if line.startswith("+++ "):
+            target = line[4:].strip()
+            if target == "/dev/null":
+                current = None
+            else:
+                # strip the "b/" prefix git uses
+                current = target[2:] if target.startswith("b/") else target
+            continue
+        if line.startswith("--- ") or line.startswith("diff --git"):
+            continue
+        if line.startswith("+") and not line.startswith("+++"):
+            if current is not None:
+                yield current, line[1:]
+
+
+def find_ddl_evidence(diff_text: str) -> list[str]:
+    """Return human-readable evidence strings for added DDL lines."""
+    evidence: list[str] = []
+    for path, content in iter_added_lines(diff_text):
+        pattern = _patterns_for(path)
+        if pattern is None or _is_comment(content):
+            continue
+        match = pattern.search(content)
+        if match:
+            evidence.append(f"{path}: + {content.strip()[:120]}")
+    return evidence
+
+
+def diff_has_ddl(diff_text: str) -> bool:
+    return bool(find_ddl_evidence(diff_text))
+
+
+def added_migration_entries(diff_text: str) -> list[str]:
+    """Net-new `MIGRATIONS` list entries added in db/migrations.py."""
+    entries: list[str] = []
+    for path, content in iter_added_lines(diff_text):
+        if path != "src/backend/db/migrations.py" or _is_comment(content):
+            continue
+        if _MIGRATIONS_ENTRY.search(content):
+            entries.append(content.strip())
+    return entries
+
+
+def added_revision_files(name_status_lines: list[str]) -> list[str]:
+    """Net-new (`A`) files under the Alembic versions dir, excluding __init__."""
+    found: list[str] = []
+    for raw in name_status_lines:
+        parts = raw.split("\t")
+        if len(parts) < 2:
+            continue
+        status, path = parts[0], parts[-1]
+        if not status.startswith("A"):
+            continue
+        if path.startswith(REVISION_DIR) and path.endswith(".py") and not path.endswith("__init__.py"):
+            found.append(path)
+    return found
+
+
+def is_schema_change(ddl_diff_text: str) -> bool:
+    """A new SQLite migration registering schema DDL (conjuncts 1 AND 2)."""
+    return bool(added_migration_entries(ddl_diff_text)) and diff_has_ddl(ddl_diff_text)
+
+
+def would_block(ddl_diff_text: str, name_status_lines: list[str]) -> bool:
+    """Core decision: a DDL-bearing new migration with no net-new revision."""
+    return is_schema_change(ddl_diff_text) and not added_revision_files(name_status_lines)
+
+
+def _git(args: list[str]) -> str:
+    return subprocess.run(
+        ["git", *args], check=True, capture_output=True, text=True
+    ).stdout
+
+
+def main(argv: list[str]) -> int:
+    base = argv[1] if len(argv) > 2 else os.environ.get("PR_BASE_SHA", "")
+    head = argv[2] if len(argv) > 2 else os.environ.get("PR_HEAD_SHA", "")
+    null_sha = "0" * 40
+    if not base or not head or base == null_sha:
+        print("alembic-parity: no PR base/head available — skipping (pass).")
+        return 0
+
+    rng = f"{base}...{head}"
+    try:
+        ddl_diff = _git(["diff", rng, "--", *WATCHED_FILES])
+        name_status = _git(["diff", "--name-status", rng]).splitlines()
+    except subprocess.CalledProcessError as exc:
+        print(f"alembic-parity: git diff failed: {exc.stderr}", file=sys.stderr)
+        return 2
+
+    new_migrations = added_migration_entries(ddl_diff)
+    evidence = find_ddl_evidence(ddl_diff)
+    if not new_migrations:
+        print(
+            "alembic-parity: no net-new MIGRATIONS entry — runner refactor / rebuild /\n"
+            "non-schema edit, nothing to guard (pass)."
+        )
+        return 0
+    if not evidence:
+        print(
+            "alembic-parity: new migration is data-only (no DDL keyword) — "
+            "no Alembic revision required (pass)."
+        )
+        return 0
+
+    revisions = added_revision_files(name_status)
+    print("alembic-parity: new schema migration detected in this PR:")
+    for entry in new_migrations[:10]:
+        print(f"  + {entry}")
+    print("alembic-parity: carrying DDL:")
+    for line in evidence[:20]:
+        print(f"  • {line}")
+    if revisions:
+        print("alembic-parity: paired Alembic revision(s) added:")
+        for rev in revisions:
+            print(f"  ✓ {rev}")
+        print("alembic-parity: PASS — both tracks updated.")
+        return 0
+
+    print(
+        "\nalembic-parity: FAIL — this PR adds schema DDL on the SQLite track but\n"
+        f"adds no new Alembic revision under {REVISION_DIR}\n"
+        "PostgreSQL applies revision files only (it does NOT autogenerate from\n"
+        "tables.py), so this change would break PG. Add a revision per\n"
+        "Architectural Invariant #3 (see docs/migrations/SQLITE_TO_POSTGRES.md).\n"
+        "If this is a data-only/down migration or a non-DDL edit, ensure the\n"
+        "added lines carry no DDL keyword (see the heuristic in this script).",
+        file=sys.stderr,
+    )
+    return 1
+
+
+if __name__ == "__main__":
+    raise SystemExit(main(sys.argv))