PerryTS · proggeramlug · Jun 18, 2026 · Jun 18, 2026 · coderabbitai · Jun 18, 2026
diff --git a/.github/workflows/test.yml b/.github/workflows/test.yml
@@ -19,6 +19,13 @@ on:
       - '*.md'
       - '!CLAUDE.md'
       - '!CHANGELOG.md'
+  schedule:
+    # Nightly full-workspace cargo-test safety net (04:00 UTC). The per-PR
+    # cargo-test gate only exercises crates affected by the diff
+    # (scripts/ci_test_scope.py); a cross-crate regression that slips a scoped
+    # PR is caught here within a day. `schedule` is not `pull_request`, so the
+    # cargo-test job runs the full workspace.
+    - cron: '0 4 * * *'
   # Manual escape hatch for the opt-in jobs. Maintainers (write access)
   # can dispatch the workflow against any ref with `run_extended_tests=true`
   # to run parity / compile-smoke / package smokes / doc-tests on demand
@@ -288,6 +295,8 @@ jobs:
           # test builds accumulated in target/debug.
           CARGO_PROFILE_TEST_DEBUG: "0"
           CARGO_PROFILE_DEV_DEBUG: "0"
+          # For `gh pr view` (PR changed-file list → affected-crate scope).
+          GH_TOKEN: ${{ github.token }}
         run: |
           (
             while sleep 60; do
@@ -297,65 +306,93 @@ jobs:
           cargo_test_heartbeat_pid=$!
           trap 'kill "$cargo_test_heartbeat_pid" 2>/dev/null || true' EXIT
 
+          # Test scope: a per-PR run only exercises the crates the diff can
+          # affect (changed crates + their reverse-dependency closure, plus a
+          # `perry` edge for runtime-linked stdlib/ext archives) instead of the
+          # whole workspace (~90 min). Release tags, the nightly cron, and
+          # workflow_dispatch run the FULL workspace as the safety net. See
+          # scripts/ci_test_scope.py for the rules.
+          if [ "${{ github.event_name }}" = "pull_request" ]; then
+            changed_files="$(gh pr view "${{ github.event.pull_request.number }}" \
+              --json files --jq '.files[].path')"
+            echo "Changed files in PR:"; printf '%s\n' "$changed_files"
+            scope="$(printf '%s\n' "$changed_files" | python3 scripts/ci_test_scope.py)"
+          else
+            scope="$(python3 scripts/ci_test_scope.py --full </dev/null)"
+          fi
+          echo "Packages in test scope:"; printf '%s\n' "$scope"
+          if [ -z "$scope" ]; then
+            echo "No crates affected by this diff — nothing to test."
+            exit 0
+          fi
+
           # #1444: perry-runtime's tests share process-global state — the
           # per-thread arena/GC, the timer queues, and the `NOTIFIED` flag are
           # process singletons. Running them across the default test-harness
           # thread pool lets one test's `js_notify_main_thread` / timer
           # scheduling perturb another's wait budget (the event_pump timing
           # flakes) and races the GC/threading tests into intermittent SIGSEGV.
           # Run perry-runtime single-threaded so the tests can't interfere.
-          RUST_TEST_THREADS=1 cargo test -p perry-runtime
-          # `cargo test` only builds lib/bin/test targets — NOT the `staticlib`
-          # crate-type — so libperry_runtime.a / libperry_stdlib.a are never
-          # produced by the steps above; they only exist if restored from the
-          # cache. A PR that touches perry-runtime/perry-stdlib invalidates the
-          # cached staticlib and cargo never rebuilds it, so integration tests
-          # that compile with PERRY_NO_AUTO_OPTIMIZE=1 (e.g.
-          # functional_batch2_regressions, which link the prebuilt archive
-          # directly) fail with "Could not find libperry_runtime.a". Build the
-          # staticlibs explicitly so those tests are deterministic regardless of
-          # cache state.
-          cargo build -p perry-runtime -p perry-stdlib
-          find target/debug/deps -maxdepth 1 -type f -perm -111 ! -name '*.so' -delete
-          # The remaining workspace includes large `perry` / `perry-stdlib`
-          # test binaries. Keep Cargo build jobs serialized so the runner
-          # does not link several of those large test binaries at once, then
-          # run packages one at a time and prune linked test executables so
-          # target/debug/deps does not exhaust the runner disk mid-job.
-          export CARGO_BUILD_JOBS=1
-          workspace_packages="$(
-            cargo metadata --no-deps --format-version 1 | python3 -c '
-          import json
-          import sys
-
-          excluded = {
-              "perry-runtime",
-              "perry-ui-macos",
-              "perry-ui-ios",
-              "perry-ui-visionos",
-              "perry-ui-tvos",
-              "perry-ui-watchos",
-              "perry-ui-gtk4",
-              "perry-ui-android",
-              "perry-ui-windows",
-              "perry-ui-windows-winui",
-              "perry-doc-fixture-my-bindings",
-          }
-          metadata = json.load(sys.stdin)
-          workspace_members = set(metadata["workspace_members"])
-          for package in metadata["packages"]:
-              if package["id"] in workspace_members and package["name"] not in excluded:
-                  print(package["name"])
-          '
-          )"
-
-          for package in $workspace_packages; do
-            echo "::group::cargo test -p $package"
-            cargo test -p "$package"
-            echo "::endgroup::"
-            cargo clean -p "$package" || true
+
+          if [ "${{ github.event_name }}" != "pull_request" ]; then
+            # ---- FULL run: release tags / nightly cron / workflow_dispatch ----
+            # Every target, including the slow auto-optimize integration tests.
+            if printf '%s\n' "$scope" | grep -qx 'perry-runtime'; then
+              RUST_TEST_THREADS=1 cargo test -p perry-runtime
+            fi
+            # `cargo test` only builds lib/bin/test targets — NOT the `staticlib`
+            # crate-type — so libperry_runtime.a / libperry_stdlib.a are never
+            # produced by the steps above; they only exist if restored from the
+            # cache. Integration tests that compile with PERRY_NO_AUTO_OPTIMIZE=1
+            # (e.g. functional_batch2_regressions) link the prebuilt archive
+            # directly and fail with "Could not find libperry_runtime.a" if the
+            # cached staticlib was invalidated. Build them explicitly.
+            if printf '%s\n' "$scope" | grep -qE '^(perry|perry-stdlib)$'; then
+              cargo build -p perry-runtime -p perry-stdlib
+            fi
             find target/debug/deps -maxdepth 1 -type f -perm -111 ! -name '*.so' -delete
-          done
+            # Large perry / perry-stdlib integration-test binaries: serialize
+            # builds and prune linked executables between packages so the runner
+            # disk doesn't exhaust mid-job.
+            export CARGO_BUILD_JOBS=1
+            for package in $(printf '%s\n' "$scope" | grep -vx 'perry-runtime'); do
+              echo "::group::cargo test -p $package"
+              cargo test -p "$package"
+              echo "::endgroup::"
+              cargo clean -p "$package" || true
+              find target/debug/deps -maxdepth 1 -type f -perm -111 ! -name '*.so' -delete
+            done
+          else
+            # ---- FAST per-PR run (<10 min target) ----
+            # Unit / lib / bin tests for the affected crates only. The slow
+            # auto-optimize *integration* tests (tests/*.rs — each shells out to
+            # `perry compile`, ~4–6 min apiece, ~30 of them) are NOT run per-PR;
+            # they run in the nightly full job, on release tags, and on demand
+            # via the `run-extended-tests` label. Unit-test binaries are small,
+            # so builds parallelize safely (no serialization / clean churn) and
+            # there is no staticlib dependency to build.
+            #
+            # perry-runtime first, single-threaded (process-global state); it is
+            # a lib-only crate, so filter to --lib.
+            if printf '%s\n' "$scope" | grep -qx 'perry-runtime'; then
+              RUST_TEST_THREADS=1 cargo test --lib -p perry-runtime
+            fi
+            rest="$(printf '%s\n' "$scope" | grep -vx 'perry-runtime' || true)"
+            if [ -n "$rest" ]; then
+              # One parallel invocation across the rest of the scope. `cargo test
+              # --lib` errors if NO selected package has a library (a perry-only
+              # diff → just the bin-only `perry` crate), so include --lib only
+              # when some selected crate has one; --bins is always safe.
+              pkg_args="$(printf '%s\n' "$rest" | sed 's/^/-p /' | tr '\n' ' ')"
+              if printf '%s\n' "$rest" | python3 scripts/ci_test_scope.py --has-lib; then
+                target_filter="--lib --bins"
+              else
+                target_filter="--bins"
+              fi
+              echo "Running: cargo test $target_filter $pkg_args"
+              cargo test $target_filter $pkg_args
+            fi
+          fi
 
   # ---------------------------------------------------------------------------
   # GC write-barrier stress (optional / non-blocking)

diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -1,4 +1,37 @@
-## v0.5.1184 — fix: unblock main CI — dead-stripped symbol + cargo-test staticlib + oversized link/mod.rs
+## v0.5.1185 — perf(ci): make per-PR cargo-test fast (<10 min) — unit tests for affected crates; integration tests move to nightly/tags
+
+The `cargo-test` gate took ~90 min: it built the **entire workspace** in debug,
+serially (`CARGO_BUILD_JOBS=1` + `cargo clean` between packages, a 14 GB-disk
+workaround), and 30/40 perry **integration** test files (`tests/*.rs`) each shell
+out to `perry compile` on the **auto-optimize** path — a whole-program optimized
+rebuild, ~4–6 min apiece, and concurrent auto-opt builds thrash a shared target
+so they can't be parallelized. That made the gate roughly worthless on every PR.
+
+Per-PR `cargo-test` is now two things at once — **scoped** and **unit-only**:
+
+1. **Scoped to the diff** (`scripts/ci_test_scope.py`): test only each changed
+   `crates/<dir>` plus its **reverse-dependency closure** (a foundational-crate
+   change still fans out). Runtime-linked crates (`perry-stdlib`, `perry-ffi`,
+   `perry-ext-*`) add a `perry` edge (the driver links those archives at runtime,
+   not via cargo). Infra changes (`.github/`, `scripts/`, `rust-toolchain*`) or
+   any unrecognized path → full; metadata-only changes (`CHANGELOG.md`,
+   `CLAUDE.md`, `*.md`, `docs/`, root `Cargo.toml`/`Cargo.lock`) → nothing (a
+   version-bump PR is instantly green).
+2. **Unit / lib / bin tests only** (`cargo test --lib --bins`): the slow
+   auto-optimize integration tests are **not** run per-PR. Unit-test binaries are
+   small, so builds parallelize safely (no serialization / clean churn) and there
+   is no staticlib to build. This is the part that bounds the per-PR wall-clock.
+
+The **full** suite — including every integration test — runs on **release tags**,
+a new **nightly `schedule`** (04:00 UTC), `workflow_dispatch`, and any PR labeled
+`run-extended-tests`. Release tags gate publishing, so nothing ships untested;
+the nightly run is the cross-crate / integration regression backstop (main pushes
+don't trigger Tests today). `test.yml` branches the cargo-test step on
+`github.event_name`: `pull_request` → fast path; everything else → full.
+
+Trade-off (chosen deliberately, prioritizing a usable per-PR gate): a regression
+only an integration test would catch lands on a PR and is caught by the nightly /
+release-tag full run rather than at PR time.
 
 Three pre-existing CI fragilities on `main` were turning required checks red for
 PRs. Fixed together since all are "make main's CI green again."

diff --git a/CLAUDE.md b/CLAUDE.md
@@ -8,7 +8,7 @@ This file provides guidance to Claude Code (claude.ai/code) when working with co
 
 Perry is a native TypeScript compiler written in Rust that compiles TypeScript source code directly to native executables. It uses SWC for TypeScript parsing and LLVM for code generation.
 
-**Current Version:** 0.5.1184
+**Current Version:** 0.5.1185
 
 
 ## TypeScript Parity Status