Skip to content

tests_v2: expand Linux socket-test coverage beyond the current stable baseline #216

@Jesssullivan

Description

@Jesssullivan

Summary

Track expansion of Linux tests_v2 socket coverage beyond the current stable baseline.

Current state as of 2026-04-21

  • scripts/run-socket-tests.sh now uses an explicit BASELINE allowlist instead of the old blacklist model
  • the stable baseline currently contains 15 tests; failures here fail the self-hosted Linux job
  • CMUX_TEST_PHASE1=1 now enables 14 additional candidate tests as non-fatal observations
  • .github/workflows/test-socket.yml currently enables that candidate gate on the self-hosted Linux runner
  • tests_v2/ currently has 123 test files on disk, but most remain outside the Linux baseline because they still depend on broader browser, debug-helper, CLI, or remote/fleet parity

What this issue now covers

  1. keep the Linux baseline small, explicit, and trustworthy
  2. observe candidate results on real CI and promote consistently green tests into BASELINE
  3. land the next highest-yield Linux command coverage without pretending the entire suite is ready
  4. keep runner-selection mechanics separate from larger parity families

Current promotion lane

The current phase-1 candidate set already includes:

  • surface.action.* coverage slices
  • workspace.action
  • system_tree
  • auth and notification helper coverage
  • pane.resize
  • the current sprint-B parity bundle in test_sprint_b_core_parity

The immediate next move is not another runner rewrite. It is recording candidate behavior and promoting the stable subset.

Separate follow-on lanes

These remain parallel or follow-on work and should not be hidden inside runner wording:

  • #206 Linux WebAuthn bridge completion
  • broader browser automation parity
  • command-palette and debug-helper parity
  • Linux CLI parity beyond the current socket baseline
  • remote/fleet transport work under #201

Exit criteria

  • at least one full candidate cohort has been observed on CI
  • stable candidate tests have been promoted into BASELINE
  • Linux socket-test coverage is broader than the current 15-test baseline without regressing the green self-hosted lane
  • remaining unsupported families are explicitly tracked as parity follow-up, not mislabeled as “already covered by tests”

References

  • scripts/run-socket-tests.sh
  • .github/workflows/test-socket.yml
  • docs/distro-testing-readiness-plan.md
  • docs/linear-qa-shard-punchlist.md

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions