tests_v2: expand Linux socket-test coverage beyond the current stable baseline

## Summary

Track expansion of Linux `tests_v2` socket coverage beyond the current stable baseline.

## Current state as of 2026-04-21

- `scripts/run-socket-tests.sh` now uses an explicit `BASELINE` allowlist instead of the old blacklist model
- the stable baseline currently contains **15** tests; failures here fail the self-hosted Linux job
- `CMUX_TEST_PHASE1=1` now enables **14** additional candidate tests as non-fatal observations
- `.github/workflows/test-socket.yml` currently enables that candidate gate on the self-hosted Linux runner
- `tests_v2/` currently has 123 test files on disk, but most remain outside the Linux baseline because they still depend on broader browser, debug-helper, CLI, or remote/fleet parity

## What this issue now covers

1. keep the Linux baseline small, explicit, and trustworthy
2. observe candidate results on real CI and promote consistently green tests into `BASELINE`
3. land the next highest-yield Linux command coverage without pretending the entire suite is ready
4. keep runner-selection mechanics separate from larger parity families

## Current promotion lane

The current phase-1 candidate set already includes:

- `surface.action.*` coverage slices
- `workspace.action`
- `system_tree`
- auth and notification helper coverage
- `pane.resize`
- the current sprint-B parity bundle in `test_sprint_b_core_parity`

The immediate next move is not another runner rewrite. It is recording candidate behavior and promoting the stable subset.

## Separate follow-on lanes

These remain parallel or follow-on work and should not be hidden inside runner wording:

- `#206` Linux WebAuthn bridge completion
- broader browser automation parity
- command-palette and debug-helper parity
- Linux CLI parity beyond the current socket baseline
- remote/fleet transport work under `#201`

## Exit criteria

- at least one full candidate cohort has been observed on CI
- stable candidate tests have been promoted into `BASELINE`
- Linux socket-test coverage is broader than the current 15-test baseline without regressing the green self-hosted lane
- remaining unsupported families are explicitly tracked as parity follow-up, not mislabeled as “already covered by tests”

## References

- `scripts/run-socket-tests.sh`
- `.github/workflows/test-socket.yml`
- `docs/distro-testing-readiness-plan.md`
- `docs/linear-qa-shard-punchlist.md`


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests_v2: expand Linux socket-test coverage beyond the current stable baseline #216

Summary

Current state as of 2026-04-21

What this issue now covers

Current promotion lane

Separate follow-on lanes

Exit criteria

References

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

tests_v2: expand Linux socket-test coverage beyond the current stable baseline #216

Description

Summary

Current state as of 2026-04-21

What this issue now covers

Current promotion lane

Separate follow-on lanes

Exit criteria

References

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions