diff --git a/PLAN.md b/PLAN.md
new file mode 100644
index 00000000..84558f2d
--- /dev/null
+++ b/PLAN.md
@@ -0,0 +1,554 @@
+# Apra Fleet -- OS Service Lifecycle Implementation Plan
+
+> Make apra-fleet behave like a normal OS service: top-level start/stop/restart/status
+> verbs, per-user service registration folded into install/uninstall, and cross-platform
+> support for Windows (schtasks), Linux (systemd --user), and macOS (launchd LaunchAgent)
+> -- all without elevation or admin rights. Extends PR #273 on feat/mcp-sse-transport.
+
+---
+
+## Design Summary
+
+The implementation adds three new layers to the existing codebase:
+
+1. **Service Manager Adapter** (`src/services/service-manager/`) -- a single TypeScript
+   interface (`ServiceManager`) with three OS-specific implementations (Windows schtasks,
+   Linux systemd, macOS launchd). The factory selects the right adapter at runtime via
+   `process.platform`. All adapters operate in per-user scope with no elevation.
+
+2. **Graceful Shutdown Endpoint** -- a `POST /shutdown` handler on the existing HTTP
+   server (localhost-only, same trust boundary as `/mcp`). This enables cross-platform
+   graceful stop without relying on OS signal semantics (Windows cannot send SIGTERM to
+   external processes).
+
+3. **CLI Verbs** (`src/cli/start.ts`, `stop.ts`, `restart.ts`, `status.ts`) -- thin
+   command modules wired into the existing dispatch table in `src/index.ts`. Each verb
+   is idempotent. `start` goes through the service manager when a unit is installed,
+   otherwise spawns the process directly. `stop` always uses HTTP POST /shutdown directly
+   -- it is service-agnostic (stopping the process is the same whether a service unit
+   exists or not) and never routes through the adapter. `status` queries both
+   server.json/health and the service manager.
+
+**Stop call path (unified):** The CLI `stop` verb, and all internal stop flows
+(adapter unregister, uninstall cleanup), use the same mechanism: read server.json for
+the server URL, POST /shutdown to trigger graceful exit (code 0), poll pid for up to 5s,
+fallback force-kill. Because the process exits cleanly (code 0), service managers
+configured with Restart=on-failure (systemd) and KeepAlive.SuccessfulExit=false (launchd)
+will NOT auto-restart. The adapter interface includes a `stop()` method for use within
+`unregister()` and for completeness, but the CLI stop verb bypasses the adapter -- it
+calls the POST /shutdown flow directly since the mechanism is identical regardless of
+whether a service unit is installed.
+
+**Service unit configuration by OS:**
+- **Windows:** Per-user Scheduled Task "ApraFleet" with at-logon trigger, /rl limited
+  (no elevation). A wrapper .bat script in BIN_DIR handles stdout/stderr redirection to
+  the log file (schtasks cannot redirect output natively).
+- **Linux:** systemd user unit at `~/.config/systemd/user/apra-fleet.service`.
+  Type=simple, Restart=on-failure, StandardOutput/StandardError=append:logPath.
+  loginctl enable-linger attempted with a warning on failure.
+- **macOS:** LaunchAgent plist at `~/Library/LaunchAgents/com.apra-fleet.server.plist`.
+  RunAtLoad=true, KeepAlive.SuccessfulExit=false (restart on crash, not on clean exit).
+  StandardOutPath/StandardErrorPath point to the log file.
+
+**Graceful stop mechanism:** The server's existing SIGINT/SIGTERM handlers exit with
+code 0 after cleaning up server.json, lock file, and connections. Service managers
+configured with Restart=on-failure (systemd) and KeepAlive.SuccessfulExit=false (launchd)
+will NOT restart the process after a clean exit. This means the CLI `stop` command
+(which triggers a clean exit via /shutdown) is compatible with managed services.
+
+---
+
+## Verb x OS Matrix
+
+The table below defines the exact behavior for every verb on every OS, both when a
+service unit is installed and when it is not. No "and similarly for X" -- each cell is
+explicit.
+
+### start
+
+| OS      | Service Installed                                          | No Service Installed                                    |
+|---------|------------------------------------------------------------|---------------------------------------------------------|
+| Windows | `schtasks /run /tn "ApraFleet"`                            | Spawn detached: `apra-fleet.exe --transport http`       |
+| Linux   | `systemctl --user start apra-fleet`                        | Spawn detached: `apra-fleet --transport http`           |
+| macOS   | `launchctl kickstart gui/<uid>/com.apra-fleet.server`      | Spawn detached: `apra-fleet --transport http`           |
+
+All: Idempotent -- checkRunningInstance() first; if already running, report and exit 0.
+When spawning directly, stdout/stderr redirect to ~/.apra-fleet/data/fleet.log.
+
+### stop
+
+| OS      | Behavior                                                                              |
+|---------|---------------------------------------------------------------------------------------|
+| Windows | Read server.json -> POST /shutdown -> wait up to 5s for exit -> fallback taskkill /F  |
+| Linux   | Read server.json -> POST /shutdown -> wait up to 5s for exit -> fallback kill -TERM   |
+| macOS   | Read server.json -> POST /shutdown -> wait up to 5s for exit -> fallback kill -TERM   |
+
+All: Idempotent -- if not running (server.json missing or pid dead), report and exit 0.
+Clean up stale server.json and lock file if found. The HTTP /shutdown approach is used
+on all OSes for consistency; service managers do not restart because the process exits 0.
+
+### restart
+
+| OS      | Behavior              |
+|---------|-----------------------|
+| Windows | stop (above) then start (above) |
+| Linux   | stop (above) then start (above) |
+| macOS   | stop (above) then start (above) |
+
+### status
+
+| OS      | Behavior                                                                              |
+|---------|---------------------------------------------------------------------------------------|
+| Windows | server.json + GET /health + `schtasks /query /tn "ApraFleet" /fo csv /nh`             |
+| Linux   | server.json + GET /health + `systemctl --user is-active` + `is-enabled`               |
+| macOS   | server.json + GET /health + `launchctl print gui/<uid>/com.apra-fleet.server`         |
+
+All: Works whether or not service unit is installed. Reports: running/stopped, pid, port,
+url, version, uptime, active sessions, service unit state (installed/not, enabled/not).
+
+### install (extended)
+
+| OS      | Additional Steps (after existing install)                                             |
+|---------|---------------------------------------------------------------------------------------|
+| Windows | Write wrapper.bat to BIN_DIR. `schtasks /create /tn "ApraFleet" /tr "<wrapper>" /sc onlogon /rl limited /f`. `schtasks /run /tn "ApraFleet"`. |
+| Linux   | Write unit file to ~/.config/systemd/user/apra-fleet.service. `systemctl --user daemon-reload`. `systemctl --user enable apra-fleet`. `systemctl --user start apra-fleet`. Attempt `loginctl enable-linger $USER` (warn on failure). |
+| macOS   | Write plist to ~/Library/LaunchAgents/com.apra-fleet.server.plist. `launchctl bootout gui/<uid>/com.apra-fleet.server` (tolerate "not loaded" error). Then `launchctl bootstrap gui/<uid> <plist>`. |
+
+All: Only when --transport http (default). Skipped for --transport stdio. Skipped in
+dev mode (non-SEA). Server is running immediately after install.
+
+### uninstall (extended)
+
+| OS      | Additional Steps (before existing uninstall)                                          |
+|---------|---------------------------------------------------------------------------------------|
+| Windows | POST /shutdown (graceful stop). `schtasks /delete /tn "ApraFleet" /f`. Remove wrapper.bat. |
+| Linux   | `systemctl --user stop apra-fleet`. `systemctl --user disable apra-fleet`. Remove unit file. `systemctl --user daemon-reload`. |
+| macOS   | `launchctl bootout gui/<uid>/com.apra-fleet.server`. Remove plist file.               |
+
+All: Idempotent -- each step tolerates "not found" errors. Replaces the existing
+isApraFleetRunning/killApraFleet approach with graceful /shutdown + service cleanup.
+
+---
+
+## Tasks
+
+### Phase 1: Platform Service Foundation
+
+Front-loads the two riskiest assumptions: (a) per-user service management without
+elevation on all three OSes, (b) cross-platform graceful stop. If schtasks/systemctl/
+launchctl cannot be called without elevation, this phase fails immediately -- before
+any CLI verb or install integration work is done.
+
+#### Task 1: Shutdown endpoint and service constants
+
+- **Change:** Add a POST /shutdown endpoint to the HTTP server in http-transport.ts.
+  When hit, send a 200 JSON response (`{ "status": "shutting-down" }`), then trigger
+  graceful shutdown after a 100ms delay by emitting the process SIGINT event (which
+  fires the existing shutdown handler chain in index.ts). Add LOG_FILE_PATH constant
+  to paths.ts (`~/.apra-fleet/data/fleet.log`). Create the service-manager types file
+  with service name constants: WINDOWS_TASK_NAME="ApraFleet",
+  LINUX_UNIT_NAME="apra-fleet.service",
+  MACOS_PLIST_LABEL="com.apra-fleet.server".
+- **Files:** src/services/http-transport.ts, src/paths.ts,
+  src/services/service-manager/types.ts (new)
+- **Tier:** cheap
+- **Done when:** POST to /shutdown triggers clean server shutdown (server.json deleted,
+  lock released, process exits 0). LOG_FILE_PATH and service name constants exported.
+- **Blockers:** None -- builds on existing HTTP handler infrastructure.
+
+#### Task 2: ServiceManager interface and factory
+
+- **Change:** Define the ServiceManager interface with methods: register(binaryPath,
+  args, logPath), unregister(), start(), stop(), query() returning ServiceStatus,
+  isInstalled() returning boolean. ServiceStatus includes fields: installed, running,
+  pid (optional), enabled (optional). Create a factory function getServiceManager()
+  that returns the correct adapter based on process.platform ('win32' -> Windows,
+  'linux' -> Linux, 'darwin' -> macOS). For unsupported platforms, return a no-op stub
+  that logs a warning and returns safe defaults (installed=false, running=false).
+- **Files:** src/services/service-manager/types.ts (extend),
+  src/services/service-manager/index.ts (new)
+- **Tier:** standard
+- **Done when:** Interface compiles. Factory returns per-platform implementation. Stub
+  adapter returns safe defaults without throwing on unsupported platforms.
+- **Blockers:** None.
+
+#### Task 3: Windows Scheduled Task adapter
+
+- **Change:** Implement WindowsServiceManager class.
+  - register(binaryPath, args, logPath): Write a wrapper batch script to BIN_DIR
+    (`apra-fleet-service.bat`) that runs the binary with args and redirects
+    stdout/stderr to logPath. Create a per-user Scheduled Task via
+    `schtasks /create /tn "ApraFleet" /tr "<wrapper-path>" /sc onlogon /rl limited /f`.
+    No elevation required for per-user tasks.
+  - unregister(): `schtasks /delete /tn "ApraFleet" /f`. Remove wrapper script.
+    Tolerate "task not found" error.
+  - start(): `schtasks /run /tn "ApraFleet"`.
+  - stop(): Read server.json for URL. POST /shutdown. Wait up to 5s for process exit
+    (poll pid). Fallback: `taskkill /F /PID <pid>`.
+  - query(): Parse `schtasks /query /tn "ApraFleet" /fo csv /nh` output. Extract
+    status (Running/Ready/Disabled) and combine with server.json data.
+  - isInstalled(): Run `schtasks /query /tn "ApraFleet"` -- success means installed.
+- **Files:** src/services/service-manager/windows.ts (new)
+- **Tier:** standard
+- **Done when:** All methods implemented. No UAC prompt triggered. Commands use
+  child_process.execFile (not shell) where possible for safety.
+- **Blockers:** "Log on as a batch job" right -- may be restricted on domain-joined
+  machines. See risk register.
+
+#### Task 4: Linux systemd user unit adapter
+
+- **Change:** Implement LinuxServiceManager class.
+  - register(binaryPath, args, logPath): Write a systemd user unit file to
+    `~/.config/systemd/user/apra-fleet.service` with [Unit] Description, [Service]
+    Type=simple, ExecStart=<binaryPath> <args>, Restart=on-failure,
+    StandardOutput=append:<logPath>, StandardError=append:<logPath>, [Install]
+    WantedBy=default.target. Run `systemctl --user daemon-reload` then
+    `systemctl --user enable apra-fleet`. Attempt `loginctl enable-linger $USER` and
+    warn (not error) if it fails.
+  - unregister(): First stop the server via POST /shutdown (same as stop() above).
+    Then `systemctl --user disable apra-fleet`,
+    `systemctl --user stop apra-fleet` (tolerate not-running -- informs systemd the
+    unit is being removed), remove unit file, `systemctl --user daemon-reload`.
+  - start(): `systemctl --user start apra-fleet`.
+  - stop(): Read server.json for URL. POST /shutdown. Wait up to 5s for process exit
+    (poll pid). Fallback: kill -TERM <pid>. This matches the Windows and macOS adapters
+    and the CLI stop verb for cross-platform consistency. (systemctl --user stop would
+    also work since it sends SIGTERM, but POST /shutdown is preferred so all three
+    adapters share the same contract.) Tolerate not-running.
+  - query(): `systemctl --user is-active apra-fleet` (active/inactive/failed),
+    `systemctl --user is-enabled apra-fleet` (enabled/disabled).
+  - isInstalled(): Check if unit file exists at the expected path.
+  - Non-systemd detection: Before any operation, check for systemd user bus
+    (XDG_RUNTIME_DIR + /run/user/<uid>/systemd). If absent, throw with clear message:
+    "systemd user mode is not available. Service management requires systemd."
+- **Files:** src/services/service-manager/linux.ts (new)
+- **Tier:** standard
+- **Done when:** All methods implemented. Non-systemd systems get a clear, actionable
+  error. loginctl linger is attempted with a non-fatal warning on failure.
+- **Blockers:** loginctl enable-linger may need root. See risk register.
+
+#### Task 5: macOS launchd LaunchAgent adapter
+
+- **Change:** Implement MacOSServiceManager class.
+  - register(binaryPath, args, logPath): Write a plist to
+    `~/Library/LaunchAgents/com.apra-fleet.server.plist` with Label, ProgramArguments
+    (array: [binaryPath, ...args]), RunAtLoad=true, KeepAlive with
+    SuccessfulExit=false, StandardOutPath=logPath, StandardErrorPath=logPath. Before
+    loading, call `launchctl bootout gui/<uid>/com.apra-fleet.server` and tolerate
+    "not loaded" / "no such process" errors -- this makes register() idempotent
+    (launchctl bootstrap fails with "service already loaded" if called twice without
+    bootout). Then load via `launchctl bootstrap gui/<uid> <plist-path>`.
+  - unregister(): `launchctl bootout gui/<uid>/com.apra-fleet.server`. Remove plist.
+    Tolerate "not loaded" error.
+  - start(): `launchctl kickstart gui/<uid>/com.apra-fleet.server`.
+  - stop(): POST /shutdown to server URL from server.json (same as Windows approach --
+    clean exit 0 prevents KeepAlive restart). Wait up to 5s, fallback kill -TERM.
+  - query(): Parse output of `launchctl print gui/<uid>/com.apra-fleet.server` for
+    pid and state. If command fails (not loaded), return installed=false.
+  - isInstalled(): Check plist file exists at expected path.
+  - uid retrieval: Use `id -u` or process.getuid() to get the current user's uid for
+    the gui/<uid> domain specifier.
+- **Files:** src/services/service-manager/macos.ts (new)
+- **Tier:** standard
+- **Done when:** All methods implemented. No elevation required. bootstrap/bootout
+  API used (available since macOS 10.10).
+- **Blockers:** None significant. See risk register for macOS version note.
+
+#### Task 6: Service manager unit tests
+
+- **Change:** Write vitest tests for all three adapters. Use vi.mock to mock
+  child_process.execFile and child_process.execFileSync. For each adapter, test:
+  register (verifies correct command/args), unregister (verifies cleanup commands),
+  start (verifies start command), stop (verifies graceful shutdown attempt), query
+  (mock command output, verify parsed ServiceStatus), isInstalled (mock success/failure).
+  Test edge cases: already registered (idempotent register), not installed (idempotent
+  unregister), process not running (stop is no-op), non-systemd Linux (clear error
+  thrown). Use vi.hoisted for mock definitions per existing test conventions.
+- **Files:** tests/service-manager.test.ts (new)
+- **Tier:** standard
+- **Done when:** Tests cover all adapter methods and key error paths. All pass.
+  Existing test suite (npm test) stays fully green.
+- **Blockers:** None -- tests mock OS commands, no real services created.
+
+#### Task 6.5: MCP session capability logging (apra-fleet-projects-78g)
+
+- **Change:** In http-transport.ts, extract clientInfo and capabilities from the
+  initialize request body (parsedBody is already in scope in the isInitializeRequest
+  block). After sessions.set() in onsessioninitialized, call logLine to record:
+  session ID, client name, client version, client capability keys, and whether
+  experimental['claude/channel'] was declared. Import logLine from
+  utils/log-helpers.js (already used in the file). Example format:
+  `logLine('session', 'new sid=<sid> client=<name>/<version> caps=<list> channel=true')`.
+  Store clientInfo on a local variable captured in the initialize block closure so it
+  is available in onsessioninitialized. Also log session close with the same sid so
+  sessions can be correlated in logs.
+- **Files:** src/services/http-transport.ts
+- **Tier:** cheap
+- **Done when:** Each new MCP session logs client name, version, capabilities, and
+  channel flag. Session close is logged. Existing tests pass. ASCII-only.
+- **Blockers:** None.
+
+#### VERIFY: Platform Service Foundation
+- Run full test suite (npm test)
+- Confirm all Phase 1 changes compile cleanly
+- Confirm no regressions in existing tests
+- Report: tests passing, adapter coverage, any issues
+
+---
+
+### Phase 2: CLI Verbs
+
+Build the four new top-level commands. Each is a thin module in src/cli/ wired into
+the dispatch table in src/index.ts.
+
+#### Task 7: start and stop commands
+
+- **Change:** Create src/cli/start.ts with exported runStart(args). Logic:
+  (1) checkRunningInstance() -- if running, log "Server already running at <url>
+  pid=<pid>" and exit 0 (idempotent). (2) Get service manager via getServiceManager().
+  If service is installed, call serviceManager.start(). (3) If no service installed,
+  spawn the binary in detached mode with stdout/stderr redirected to LOG_FILE_PATH.
+  **Binary path resolution:** In SEA mode, the binary is at the stable installed path
+  (`BIN_DIR + 'apra-fleet'` or `'apra-fleet.exe'` from src/cli/config.ts). In dev mode
+  (non-SEA), the command is `process.execPath` (the Node.js binary) with args
+  `[path.join(findProjectRoot(), 'dist', 'index.js'), '--transport', 'http']` -- using
+  the same `findProjectRoot()` function from src/cli/install.ts that walks up from
+  __dirname looking for version.json. Import `findProjectRoot` from install.ts (it is
+  already exported) or extract it to a shared util. Both modes append
+  `['--transport', 'http']` to the args. Wait 2s then verify server started via
+  checkRunningInstance. Report success or failure.
+  Create src/cli/stop.ts with exported runStop(args). Logic: (1) checkRunningInstance()
+  -- if not running, log "Server is not running." and exit 0 (idempotent). (2) Read URL
+  from server.json. POST /shutdown to the URL. The stop verb does NOT go through the
+  service manager adapter -- stopping the running process is service-agnostic. (3) Poll
+  pid alive every 500ms for up to 5s. (4) If process still alive after timeout (or if
+  /shutdown returned an error -- e.g. the running binary predates the /shutdown endpoint),
+  force kill: process.kill(pid, 'SIGTERM') on Unix, taskkill /F /PID on Windows.
+  (5) Clean up stale server.json and lock file. Report "Server stopped."
+  Note on version skew: if an older binary without the /shutdown endpoint is running, the
+  POST will fail (404 or connection error). The fallback force-kill path handles this
+  correctly -- the 5s poll detects the process is still alive and proceeds to kill it.
+  Wire both commands into src/index.ts dispatch: `arg === 'start'` and `arg === 'stop'`
+  with dynamic imports, same pattern as existing install/uninstall/secret/auth dispatch.
+- **Files:** src/cli/start.ts (new), src/cli/stop.ts (new), src/index.ts
+- **Tier:** cheap
+- **Done when:** `apra-fleet start` starts the server (or reports already running).
+  `apra-fleet stop` stops the server gracefully (or reports not running). Both are
+  idempotent with exit code 0.
+- **Blockers:** Depends on Phase 1 (service manager, /shutdown endpoint).
+
+#### Task 8: restart command
+
+- **Change:** Create src/cli/restart.ts with exported runRestart(args). Import and call
+  runStop(args) then runStart(args). Wire into src/index.ts dispatch table.
+- **Files:** src/cli/restart.ts (new), src/index.ts
+- **Tier:** cheap
+- **Done when:** `apra-fleet restart` stops then starts the server. Works whether or not
+  the server was running (stop is idempotent).
+- **Blockers:** Depends on T7.
+
+#### Task 9: status command
+
+- **Change:** Create src/cli/status.ts with exported runStatus(args). Logic:
+  (1) Read server.json -- if present and pid alive, GET /health to obtain version,
+  uptime, sessions, port, url. (2) Query service manager via getServiceManager().query()
+  for unit state (installed, enabled, running from service perspective). (3) Format
+  output:
+  ```
+  apra-fleet status
+    State:    running | stopped
+    PID:      <pid>
+    Port:     <port>
+    URL:      <url>
+    Version:  <version>
+    Uptime:   <Xh Ym Zs>
+    Sessions: <count>
+    Service:  installed (enabled) | installed (disabled) | not installed
+  ```
+  If server is not running, show "State: stopped" and omit pid/port/url/uptime/sessions.
+  Service line always shown regardless of server state.
+  Wire into src/index.ts dispatch table.
+- **Files:** src/cli/status.ts (new), src/index.ts
+- **Tier:** standard
+- **Done when:** `apra-fleet status` shows all required fields. Works correctly whether
+  server is running or not, and whether service unit is installed or not.
+- **Blockers:** Depends on Phase 1 (service manager query).
+
+#### Task 10: CLI verb tests and --help update
+
+- **Change:** Update the --help output in src/index.ts to include the four new verbs:
+  ```
+  apra-fleet start                    Start the fleet server
+  apra-fleet stop                     Stop the fleet server
+  apra-fleet restart                  Restart the fleet server
+  apra-fleet status                   Show server and service status
+  ```
+  Write tests in tests/cli-verbs.test.ts covering: start when already running (idempotent),
+  start when not running (spawns process or uses service manager), stop when running
+  (sends /shutdown), stop when not running (idempotent), restart (stop then start),
+  status with running server (full output), status with stopped server (partial output),
+  status with/without service installed. Mock checkRunningInstance, HTTP calls, service
+  manager, and child_process.spawn.
+- **Files:** src/index.ts, tests/cli-verbs.test.ts (new)
+- **Tier:** standard
+- **Done when:** --help lists all verbs. Tests cover all verb logic and edge cases. All
+  tests pass. Pre-commit ASCII hook passes.
+- **Blockers:** None.
+
+#### VERIFY: CLI Verbs
+- Run full test suite (npm test)
+- Verify `apra-fleet --help` includes all new verbs
+- Confirm no regressions in existing tests
+- Report: tests passing, verb behavior verified
+
+---
+
+### Phase 3: Install/Uninstall Integration
+
+Wire the service manager adapter into the existing install and uninstall commands. The
+existing install steps (binary, hooks, scripts, settings, MCP, skills) are unchanged;
+service registration is additive. For uninstall, service removal is prepended.
+
+#### Task 11: Extend install to register and start service
+
+- **Change:** In src/cli/install.ts, after the existing final step (Beads tracker
+  install + permissions + install-config.json), add a new step:
+  (1) If transport === 'http' and isSea() (installed binary exists), call
+  serviceManager.register(binaryPath, ['--transport', 'http'], LOG_FILE_PATH) then
+  serviceManager.start(). (2) If transport === 'stdio', skip service registration (stdio
+  transport is per-client, not a persistent service). (3) In dev mode (!isSea()), skip
+  service registration but optionally start the server directly.
+  Update the install output to include a "Service: registered and running" line.
+  Update totalSteps calculation to include the new step.
+- **Files:** src/cli/install.ts
+- **Tier:** standard
+- **Done when:** `apra-fleet install` registers the per-user service and the server is
+  running immediately afterward. A fresh MCP client connects without any manual step.
+  The existing install behavior (binary, hooks, settings, MCP, skills) is unchanged.
+- **Blockers:** Service manager adapter must be complete (Phase 1).
+
+#### Task 12: Extend uninstall to stop and remove service
+
+- **Change:** In src/cli/uninstall.ts, before the existing provider cleanup loop, add:
+  (1) If server is running, stop it gracefully via POST /shutdown (replacing the existing
+  isApraFleetRunning/killApraFleet approach -- which does a hard kill -- with the graceful
+  /shutdown endpoint). Wait for exit. (2) Call serviceManager.unregister() to remove the
+  service unit on the current OS. Tolerate "not installed" (idempotent).
+  The existing cleanup steps (settings cleanup, skill removal, binary removal) remain
+  unchanged. The --force flag triggers the graceful /shutdown approach instead of the
+  old killApraFleet hard kill.
+- **Files:** src/cli/uninstall.ts
+- **Tier:** standard
+- **Done when:** `apra-fleet uninstall` stops the server gracefully, removes the service
+  unit, and removes MCP config. No orphaned service units, plist files, or scheduled
+  tasks remain.
+- **Blockers:** Depends on T11 (service registration during install).
+
+#### Task 13: Install/uninstall service integration tests
+
+- **Change:** Add or extend tests covering: (1) install with HTTP transport calls
+  serviceManager.register and start, (2) install with stdio transport skips service
+  registration, (3) install in dev mode skips service registration, (4) uninstall calls
+  graceful /shutdown and serviceManager.unregister, (5) uninstall with no service
+  installed is idempotent (unregister tolerates "not found"), (6) uninstall with server
+  not running skips /shutdown (idempotent). Mock service manager and HTTP calls.
+- **Files:** tests/install.test.ts (extend or new), tests/uninstall.test.ts (new)
+- **Tier:** standard
+- **Done when:** Tests verify service lifecycle during install/uninstall. Existing
+  install tests remain unchanged and passing.
+- **Blockers:** None.
+
+#### VERIFY: Install/Uninstall Integration
+- Run full test suite (npm test)
+- Confirm install registers service and server starts
+- Confirm uninstall removes service cleanly with no orphans
+- Report: tests passing, no regressions
+
+---
+
+### Phase 4: Documentation
+
+#### Task 14: Update README with service model and verbs
+
+- **Change:** Add a "Service Management" section to README.md documenting:
+  - The four new verbs: start, stop, restart, status (with usage examples)
+  - Automatic service registration during install (per-user, no elevation)
+  - Per-OS mechanisms at a glance (schtasks, systemd, launchd)
+  - Log file location (~/.apra-fleet/data/fleet.log)
+  - Troubleshooting: how to check logs, restart after issues, verify service state
+  Update the existing command reference table to include the new verbs. Update the
+  install/uninstall sections to mention service registration/removal.
+- **Files:** README.md
+- **Tier:** cheap
+- **Done when:** README documents all service verbs and behavior. ASCII-only.
+- **Blockers:** None.
+
+#### Task 15: Update architecture docs with service manager adapter
+
+- **Change:** Add a "Service Manager" section to docs/architecture.md documenting:
+  - The adapter pattern: ServiceManager interface + per-OS implementations
+  - How install/uninstall interact with the service manager
+  - The /shutdown endpoint and why it exists (cross-platform graceful stop)
+  - The verb -> adapter -> OS command flow
+  - How the singleton lifecycle interacts with service management (startup lock,
+    server.json, clean exit preventing auto-restart)
+  Update the existing "Singleton lifecycle" paragraph to reference service management.
+  Update the ASCII diagram to show the service manager layer.
+- **Files:** docs/architecture.md
+- **Tier:** cheap
+- **Done when:** Architecture docs explain the service manager design. ASCII-only.
+- **Blockers:** None.
+
+#### VERIFY: Documentation
+- Confirm ASCII-only in all doc files (pre-commit hook)
+- Confirm docs accurately reflect the planned implementation
+- Report: docs updated, hook passes
+
+---
+
+## Risk Register
+
+| Risk | Impact | Mitigation |
+|------|--------|------------|
+| Windows schtasks /end is TerminateProcess (hard kill, not SIGTERM) | High | Never use schtasks /end for graceful stop. Always use HTTP /shutdown endpoint. Force kill via taskkill only as last-resort fallback. |
+| loginctl enable-linger may require root on some Linux distros | Medium | Attempt and warn (non-fatal). Server starts on login but may not persist across reboots without an active session on those systems. Document the manual sudo command in README. |
+| Non-systemd Linux (Alpine, older distros, containers, WSL1) | Medium | Detect systemd absence at register time. Return clear, actionable error. start/stop/status CLI verbs still work via direct process management and HTTP /shutdown -- only automatic service registration is unavailable. |
+| Windows "Log on as a batch job" right restricted (domain-joined) | Medium | Detect schtasks /create failure. Provide actionable error naming the specific right. start/stop/status still work via direct process management. |
+| launchctl API differences across macOS versions | Low | Use bootstrap/bootout/kickstart API (available since macOS 10.10 Yosemite, 2014). All currently-supported macOS versions have this API. |
+| Binary path changes after update break service unit | Medium | install command re-registers the service unit (updates binary path). update command calls install --force, which also re-registers. Document this interaction. |
+| Backward compat: existing install/uninstall behavior changes | Medium | Service registration is purely additive -- all existing install steps unchanged. Service removal is prepended to uninstall. All existing tests must pass. |
+| Concurrent start race (two starts at the same time) | Low | Existing claimStartupLock prevents double-start. The binary exits 0 when another instance is running (checkRunningInstance). No change needed. |
+| /shutdown endpoint security | Low | Localhost-only binding (127.0.0.1). Same trust boundary as the /mcp endpoint, which has full tool access. No auth token needed (parity with existing MCP surface). |
+| Stale server.json after crash or kill -9 | Low | Existing checkRunningInstance validates pid + /health and cleans up stale files. No change needed. |
+
+---
+
+## Deferred Items
+
+Items explicitly out of scope for this sprint but tracked for follow-up:
+
+- **Log rotation:** The fleet service log (~/.apra-fleet/data/fleet.log) will grow
+  unboundedly. A future task should add log rotation -- either a size-based rotation at
+  server startup (rename fleet.log to fleet.log.1, cap at N files) or integration with
+  OS-native log rotation (logrotate on Linux, newsyslog on macOS). For this sprint, the
+  log file is append-only with no rotation.
+- **TLS / auth on HTTP endpoint:** The /shutdown and /mcp endpoints are localhost-only
+  (127.0.0.1) and share the same trust boundary. No authentication is added in this
+  sprint. A follow-up could add a bearer token if the threat model changes.
+
+---
+
+## Notes
+
+- Each task should result in a git commit
+- Verify tasks are checkpoints -- stop and report after each one
+- Base branch: feat/mcp-sse-transport (extends PR #273 -- no new branch)
+- Implementation branch: feat/mcp-sse-transport (commit directly onto this branch)
+- Service name constants:
+  - Windows task: "ApraFleet"
+  - Linux unit: "apra-fleet.service"
+  - macOS label: "com.apra-fleet.server"
+- Log file: ~/.apra-fleet/data/fleet.log
+- /shutdown endpoint: POST http://127.0.0.1:<port>/shutdown (localhost-only)
+- The /shutdown endpoint reuses the existing SIGINT handler chain in index.ts
+- ASCII-only in all committed files (pre-commit hook enforced)
diff --git a/README.md b/README.md
index f8fe4529..c94e8cd8 100644
--- a/README.md
+++ b/README.md
@@ -233,6 +233,105 @@ reviewer  Opus 4.7        final review
 Provider strengths, role recommendations, and gotchas:
 [docs/provider-guide.md](docs/provider-guide.md).
 
+## Transport
+
+Fleet runs as a singleton service on your machine. When you start it, the server
+listens on port 7523 by default and multiple LLM clients (Claude Code, Gemini,
+Copilot, Codex) connect concurrently to the same fleet instance.
+
+### HTTP+SSE Transport (default)
+
+By default, fleet uses the **HTTP+SSE transport** -- clients connect over HTTP and
+receive server-push notifications over Server-Sent Events (SSE).
+
+```bash
+apra-fleet                  # Start HTTP server (default)
+apra-fleet --transport http # Explicitly use HTTP
+```
+
+When the server starts, it writes a `server.json` file to `~/.apra-fleet/` containing:
+```json
+{
+  "pid": 12345,
+  "port": 7523,
+  "url": "http://localhost:7523/mcp",
+  "version": "x.y.z",
+  "startedAt": "2026-05-19T..."
+}
+```
+
+If port 7523 is busy, the server falls back to port 0 (OS-assigned random port) and
+records the actual port in `server.json`. You can override the default port with the
+`APRA_FLEET_PORT` environment variable.
+
+**Multiple clients, one server.** When a second LLM client starts, it reads
+`server.json`, detects the running server, and connects to it. All clients share the
+same fleet instance -- no restart needed. When you close all clients, the server
+keeps running (as a singleton service on your machine). It shuts down on explicit
+exit (`apra-fleet --shutdown` tool) or on system reboot.
+
+**Re-register with HTTP.** When you upgrade or re-install Fleet, run:
+```bash
+apra-fleet install  # Registers fleet with HTTP transport (default)
+```
+
+### Event Bus
+
+The event bus is an internal notification system. When a subsystem (like credential
+storage) completes an operation, it emits an event, and the HTTP server broadcasts
+the notification to all connected clients via SSE. This lets clients respond
+immediately to fleet events without polling.
+
+### Backward Compatibility: stdio Transport
+
+Existing fleets can continue using the stdio transport:
+
+```bash
+apra-fleet --transport stdio # Use legacy stdio transport
+apra-fleet --stdio            # Alias for --transport stdio
+```
+
+When you run `apra-fleet install --transport stdio`, the MCP config keeps the old
+command-based format (no HTTP URL). The server's behavior is identical to pre-HTTP
+versions: it reads JSON-RPC from stdin, writes responses to stdout, and communicates
+with one client at a time via the stdio pipe.
+
+If you want to stay on stdio for now, run:
+```bash
+apra-fleet install --transport stdio
+```
+
+If you later switch back to HTTP, re-run the default install:
+```bash
+apra-fleet install  # Switches to HTTP transport
+```
+
+## Service Mode
+
+Fleet keeps a singleton server running so all your LLM clients share one instance.
+Registering it as an OS service keeps it alive across terminal sessions -- the server
+survives terminal close and restarts automatically on login:
+
+- Windows: a per-user Scheduled Task (Task Scheduler, OnLogon trigger)
+- Linux: a systemd user unit (`systemctl --user`)
+- macOS: a LaunchAgent in `~/Library/LaunchAgents/`
+
+Four verbs manage the lifecycle directly:
+
+```
+apra-fleet start    # start the server (idempotent -- exits cleanly if already running)
+apra-fleet stop     # graceful shutdown: POST /shutdown, poll, force-kill fallback
+apra-fleet restart  # stop then start
+apra-fleet status   # state, PID, port, uptime, version, and OS service status
+```
+
+`install` and `uninstall` include service registration. Running
+`apra-fleet install` on a packaged binary with the HTTP transport (the default)
+registers and starts the OS service automatically -- no extra step.
+`apra-fleet uninstall` stops and deregisters the service before removing files.
+Service registration failures are non-fatal: a warning is printed and the install
+continues.
+
 ## The PM skill
 
 The **PM skill** is Fleet's reference workflow for **software development**
diff --git a/docs/architecture.md b/docs/architecture.md
index 32afcc55..6cd5f223 100644
--- a/docs/architecture.md
+++ b/docs/architecture.md
@@ -1,4 +1,4 @@
-<!-- llm-context: This document explains the internal architecture of apra-fleet — the MCP server, member registry, SSH transport, session management, and how tools are dispatched. Read this when a user asks how fleet works under the hood, or when debugging connectivity or session issues. -->
+<!-- llm-context: This document explains the internal architecture of apra-fleet - the MCP server, member registry, SSH transport, session management, and how tools are dispatched. Read this when a user asks how fleet works under the hood, or when debugging connectivity or session issues. -->
 <!-- keywords: MCP server, member registry, SSH, transport, session, tool dispatch, child process, local member, remote member, architecture -->
 <!-- see-also: ../README.md (getting started), tools-infrastructure.md (tool details), vocabulary.md (terminology) -->
 
@@ -6,20 +6,20 @@
 
 ## Why This Exists
 
-AI coding agents are powerful on a single machine. But real work spans many machines — a dev server, a staging box, a GPU trainer, a production host. Today, if you want Claude Code working across all of them, you SSH in manually, run prompts one at a time, and copy files by hand. There's no single pane of glass.
+AI coding agents are powerful on a single machine. But real work spans many machines - a dev server, a staging box, a GPU trainer, a production host. Today, if you want Claude Code working across all of them, you SSH in manually, run prompts one at a time, and copy files by hand. There's no single pane of glass.
 
-Apra Fleet gives one Claude instance the ability to orchestrate many. Register machines, push files, run prompts, monitor health — all through natural language from your terminal. One master, many members.
+Apra Fleet gives one Claude instance the ability to orchestrate many. Register machines, push files, run prompts, monitor health - all through natural language from your terminal. One master, many members.
 
 ## Conceptual Model
 
 The system has three layers of abstraction:
 
-**Fleet** → **Members** → **Sessions**
+**Fleet** -> **Members** -> **Sessions**
 
-A *fleet* is the collection of all registered machines. A *member* is one machine with a working directory — the unit you talk to. A *session* is a conversation thread on a member — Claude remembers context across prompts within a session, and you can reset it to start fresh.
+A *fleet* is the collection of all registered machines. A *member* is one machine with a working directory - the unit you talk to. A *session* is a conversation thread on a member - Claude remembers context across prompts within a session, and you can reset it to start fresh.
 
 Members come in two flavors:
-- **Remote members** communicate over SSH. They can be any machine you can reach — Linux VMs, macOS servers, Windows boxes.
+- **Remote members** communicate over SSH. They can be any machine you can reach - Linux VMs, macOS servers, Windows boxes.
 - **Local members** run on the same machine as the master, in a different folder. No SSH needed. Useful for isolating work into separate project directories without spinning up another machine.
 
 This distinction is hidden behind a **Strategy pattern**: every tool interacts with members through a uniform interface. The strategy implementation (remote via SSH, or local via child process) is selected at runtime based on member type. Tools never know or care which kind of member they're talking to.
@@ -27,32 +27,32 @@ This distinction is hidden behind a **Strategy pattern**: every tool interacts w
 ## How It Fits Together
 
 ```
-┌────────────────────────────────────────────────────┐
-│  Master Machine                                    │
-│                                                    │
-│  Claude Code CLI ◄──stdio──► Apra Fleet Server      │
-│                               │                    │
-│                    ┌──────────┴──────────┐         │
-│                    │  Member Strategy    │         │
-│                    │  (uniform interface)│         │
-│                    └──┬─────────────┬───┘         │
-│                       │             │              │
-│              Remote Strategy   Local Strategy      │
-│              (ssh2 + sftp)    (child_process + fs) │
-│                       │             │              │
-│                    SSH│        local exec           │
-└───────────────────────┼─────────────┼──────────────┘
-                        │             │
-           ┌────────────┘             └──► /other/project/
-           ▼                               (same machine)
-    ┌──────────────┐
-    │ Remote Member │
-    │ (any OS,      │
-    │  any provider)│
-    └──────────────┘
-```
-
-The MCP server speaks **stdio** — the standard transport for Claude Code MCP servers. Claude sends JSON-RPC tool calls, the server executes them, returns results. No HTTP, no ports to open.
++------------------------------------------------------+
+|  Master Machine                                    |
+|                                                    |
+|  Claude Code CLI <--stdio--> Apra Fleet Server     |
+|                               |                    |
+|                    +---------+---------+           |
+|                    |  Member Strategy    |         |
+|                    |  (uniform interface)|         |
+|                    +--+------------+---+           |
+|                       |             |              |
+|              Remote Strategy   Local Strategy      |
+|              (ssh2 + sftp)    (child_process + fs) |
+|                       |             |              |
+|                    SSH|        local exec           |
++-------------------+------------+--+---------------+
+                    |             |
+           +--------+             +---> /other/project/
+           |                          (same machine)
+    +------+----------+
+    | Remote Member  |
+    | (any OS,       |
+    |  any provider) |
+    +----------------+
+```
+
+The MCP server speaks **stdio** - the standard transport for Claude Code MCP servers. Claude sends JSON-RPC tool calls, the server executes them, returns results. No HTTP, no ports to open.
 
 ## Layers
 
@@ -70,6 +70,161 @@ The codebase follows a strict layering:
 
 Each layer only depends on the layers below it. Tools never import other tools. Services don't know about the MCP protocol.
 
+## Transport Layer
+
+Fleet supports two MCP transports: HTTP+SSE (default) and stdio (legacy).
+
+### HTTP+SSE Transport (Default)
+
+The HTTP transport runs as a **singleton service** on your machine. A single fleet
+server listens on port 7523 and multiple LLM clients connect concurrently. Each
+client gets its own session with a dedicated `McpServer` instance inside the fleet
+process, so tool calls and state are isolated per client.
+
+```
+    Client 1 (Claude Code)          Client 2 (Gemini)
+              |                             |
+              +-------------+---+----------+
+                            |
+                 HTTP + SSE  |
+                             |
+                     +-------+-------+
+                     | Singleton    |
+                     | Fleet Server  |
+                     | (port 7523)   |
+                     +-------+-------+
+                             |
+                 +-----------|----------+
+                 |           |          |
+              McpServer   McpServer  Tool Registry
+              (Session 1) (Session 2) (shared)
+                 |           |
+                 +------+----+
+                        |
+                  Event Bus (notifications)
+```
+
+**Per-session McpServer model:** When a client connects, the fleet creates a new
+`McpServer` instance for that session. This isolates tool call state, session storage,
+and concurrent requests. Multiple clients can call the same tool simultaneously
+without interfering with each other.
+
+**Event bus:** The fleet's internal event bus (`FleetEventMap`) carries notifications
+from subsystems (e.g., `credential:stored` when out-of-band auth completes) to all
+connected clients via SSE `notifications/message`. This is the publish-subscribe
+mechanism for server-initiated events.
+
+**Singleton lifecycle:** The server starts on-demand the first time an LLM client
+connects. Subsequent clients reuse the running server. The server keeps running until
+explicitly shut down (via `shutdown_server` tool, SIGINT, SIGTERM, or system reboot).
+This is intentional - the singleton is a long-lived service, not a per-request
+process. Restarting it has a cost (tool re-registration, SSH connection repool,
+stall detector restart).
+
+**server.json discovery:** When the server starts, it writes `~/.apra-fleet/server.json`
+with `{ pid, port, url, version, startedAt }`. Clients discover the running instance
+by reading this file and verifying the process is alive and the port responds to
+`/health` endpoint. The double-check (process.kill(pid, 0) + HTTP health request)
+detects stale entries and cleans them up.
+
+**Localhost-only binding:** The fleet server binds to `127.0.0.1` only, never
+`0.0.0.0`. This ensures only local processes can connect -- no network exposure.
+
+### Stdio Transport (Legacy)
+
+When `--transport stdio` is used, the fleet runs in the legacy mode: one MCP server
+process per client connection. The server reads JSON-RPC from stdin, writes responses
+to stdout, and terminates when the client disconnects. No HTTP, no singleton, no
+event bus. Tools work identically; the transport layer differs.
+
+### Event Flow Subsystem -> Notification
+
+When an event is emitted on the event bus:
+
+1. **Subsystem** (e.g., `auth-socket.ts`) calls `fleetEvents.emit('credential:stored', { name: ... })`
+2. **Event Bus** (`event-bus.ts`) delivers the event to all registered subscribers
+3. **HTTP Transport** (`http-transport.ts`) receives the event in its subscriber callback
+4. **Per-session McpServer** sends a `notifications/message` to each connected client over SSE
+5. **Client** receives the notification in its SSE stream handler
+
+This is the publish-subscribe pattern: producers emit to the bus, subscribers (the
+HTTP transport) are notified, and the transport broadcasts to all session clients.
+
+## Service Manager
+
+The `ServiceManager` component registers and controls the fleet server as an OS
+background service. It uses an adapter pattern so the CLI verbs (`start`, `stop`,
+`restart`, `status`) and the `install`/`uninstall` commands work identically on every
+platform.
+
+### Interface
+
+`src/services/service-manager/types.ts` defines the contract:
+
+```
+interface ServiceManager {
+  register(binaryPath, args, logPath): Promise<void>
+  unregister(): Promise<void>
+  start(): Promise<void>
+  stop(): Promise<void>
+  query(): Promise<ServiceStatus>
+  isInstalled(): Promise<boolean>
+}
+
+interface ServiceStatus {
+  installed: boolean
+  running: boolean
+  pid?: number
+  enabled?: boolean
+}
+```
+
+Service name constants are also in `types.ts`: `WINDOWS_TASK_NAME`,
+`LINUX_UNIT_NAME`, `MACOS_PLIST_LABEL`.
+
+### Platform Adapters
+
+```
+src/services/service-manager/
+  types.ts    - ServiceManager interface, ServiceStatus, service name constants
+  index.ts    - getServiceManager() factory, gracefulStopByServerJson(), NoopServiceManager
+  windows.ts  - WindowsServiceManager  (schtasks per-user Scheduled Task)
+  linux.ts    - LinuxServiceManager    (systemd --user unit)
+  macos.ts    - MacOSServiceManager    (launchd LaunchAgent plist)
+```
+
+- **WindowsServiceManager**: writes a wrapper `.bat` file and creates a per-user
+  Scheduled Task with an `OnLogon` trigger via `schtasks /create`. `start`, `stop`,
+  and `query` use `schtasks /run`, `/end`, and `/query`.
+- **LinuxServiceManager**: writes a systemd user unit file, then runs `daemon-reload`,
+  `enable`, and `loginctl enable-linger`. `start`, `stop`, and `query` use
+  `systemctl --user`.
+- **MacOSServiceManager**: writes a plist to `~/Library/LaunchAgents/` and bootstraps
+  it with `launchctl bootstrap`. `KeepAlive.SuccessfulExit=false` prevents launchd
+  from restarting on a clean exit. `start`, `stop`, and `query` use `launchctl`.
+
+### Factory
+
+`getServiceManager()` in `index.ts` selects the right adapter at runtime via a
+dynamic `import()` keyed on `process.platform`:
+
+```
+win32   -> WindowsServiceManager
+linux   -> LinuxServiceManager
+darwin  -> MacOSServiceManager
+other   -> NoopServiceManager  (warns once; all methods are safe no-ops)
+```
+
+`NoopServiceManager` ensures the CLI verbs work on unsupported platforms without
+crashing -- they simply have no effect.
+
+### Graceful Stop
+
+`gracefulStopByServerJson()` (exported from `index.ts`) reads
+`~/.apra-fleet/server.json`, POSTs to the `/shutdown` endpoint, then polls the
+process at 500 ms intervals for up to 5 s. If the process does not exit in time,
+it falls back to `taskkill /F` on Windows or `SIGTERM` on Unix.
+
 ## Provider Abstraction
 
 Fleet supports five LLM providers: Claude Code, Google Antigravity CLI (agy), OpenAI Codex CLI, GitHub Copilot CLI, and Gemini CLI. Members can mix providers within a single fleet.
@@ -79,18 +234,18 @@ Fleet supports five LLM providers: Claude Code, Google Antigravity CLI (agy), Op
 Each member has an optional `llmProvider` field (`'claude' | 'agy' | 'codex' | 'copilot' | 'gemini'`). When absent, it defaults to `'claude'` for backwards compatibility. Every tool that interacts with the member's LLM CLI resolves the provider via `getProvider(agent.llmProvider)` and delegates CLI-specific concerns to the `ProviderAdapter` interface.
 
 ```
-┌──────────┐     getProvider()     ┌─────────────────┐
-│  Tool    │ ───────────────────►  │ ProviderAdapter  │
-│ (generic)│                       │  (per-provider)  │
-└──────────┘                       └────────┬─────────┘
-                                            │ supplies:
-                                     cliCommand()
-                                     buildPromptCommand()
-                                     parseResponse()
-                                     classifyError()
-                                     authEnvVar
-                                     processName
-                                     ...
++----------+     getProvider()     +----------------+
+| Tool     | --------+---------->  | ProviderAdapter |
+| (generic)|                       | (per-provider)  |
++----------+                       +--------+--------+
+                                          | supplies:
+                                   cliCommand()
+                                   buildPromptCommand()
+                                   parseResponse()
+                                   classifyError()
+                                   authEnvVar
+                                   processName
+                                   ...
 ```
 
 The `OsCommands` layer sits below this: it handles OS-specific shell wrapping (PATH prepend, PowerShell syntax, base64 decode) and delegates CLI-specific parts (binary name, flags, JSON format) to the provider.
@@ -110,7 +265,7 @@ src/providers/
 
 ### Mix-and-Match Fleet
 
-A fleet can have members on different providers simultaneously. The PM dispatches work to members by name — it doesn't need to know which LLM backend each member uses. The fleet server resolves the correct CLI commands per member at runtime.
+A fleet can have members on different providers simultaneously. The PM dispatches work to members by name - it doesn't need to know which LLM backend each member uses. The fleet server resolves the correct CLI commands per member at runtime.
 
 ```
 PM (orchestrator, Claude)
@@ -136,11 +291,11 @@ See `docs/provider-matrix.md` for the full comparison table.
 
 ### Strategy Pattern for Member Types
 
-Rather than scattering `if (agent.agentType === 'local')` checks across every tool, the local/remote distinction lives in a single place: the strategy factory. Tools call `getStrategy(agent).execCommand(...)` and get back the same result shape regardless of how it was executed. Adding a third member type (e.g., Docker containers, cloud VMs with API-based access) means writing one new strategy class — no tool changes.
+Rather than scattering `if (agent.agentType === 'local')` checks across every tool, the local/remote distinction lives in a single place: the strategy factory. Tools call `getStrategy(agent).execCommand(...)` and get back the same result shape regardless of how it was executed. Adding a third member type (e.g., Docker containers, cloud VMs with API-based access) means writing one new strategy class - no tool changes.
 
 ### Passwords Encrypted at Rest
 
-SSH passwords are encrypted with AES-256-GCM before being written to the registry file. The encryption key is derived from the machine's identity (hostname + OS username), so the registry file is meaningless if copied to another machine. This isn't meant to stop a determined attacker with root access — it prevents accidental plaintext exposure in backups, screenshots, or config file shares.
+SSH passwords are encrypted with AES-256-GCM before being written to the registry file. The encryption key is derived from the machine's identity (hostname + OS username), so the registry file is meaningless if copied to another machine. This isn't meant to stop a determined attacker with root access - it prevents accidental plaintext exposure in backups, screenshots, or config file shares.
 
 ### Connection Pooling with Idle Timeout
 
@@ -148,15 +303,15 @@ SSH connections are expensive to establish (TCP + key exchange + auth). The serv
 
 ### Base64 Prompt Encoding
 
-Prompts sent to remote members are base64-encoded before being passed through SSH. This sidesteps the shell escaping nightmare of nested quoting across SSH → bash → claude CLI, across different operating systems. The remote member decodes before passing to Claude.
+Prompts sent to remote members are base64-encoded before being passed through SSH. This sidesteps the shell escaping nightmare of nested quoting across SSH -> bash -> claude CLI, across different operating systems. The remote member decodes before passing to Claude.
 
 ### Session Persistence
 
-Each member stores an optional `sessionId` — a Claude conversation thread ID. When `resume=true` (the default), subsequent prompts continue the same conversation, so the remote Claude has full context of prior exchanges. Resetting a session is an explicit action, not an accident.
+Each member stores an optional `sessionId` - a Claude conversation thread ID. When `resume=true` (the default), subsequent prompts continue the same conversation, so the remote Claude has full context of prior exchanges. Resetting a session is an explicit action, not an accident.
 
 ### File-Based Registry
 
-All fleet state lives in `~/.apra-fleet/data/registry.json` — a single JSON file in the user's home directory. It's deliberately not in the project directory (won't be git-committed accidentally) and not in a database (no server to run, no migrations). For a fleet of dozens of members, JSON is more than sufficient.
+All fleet state lives in `~/.apra-fleet/data/registry.json` - a single JSON file in the user's home directory. It's deliberately not in the project directory (won't be git-committed accidentally) and not in a database (no server to run, no migrations). For a fleet of dozens of members, JSON is more than sufficient.
 
 ### Duplicate Folder Prevention
 
@@ -166,18 +321,18 @@ Two members cannot share the same working directory on the same device. For remo
 
 The tools break into natural groups. Each group has detailed documentation:
 
-**[Lifecycle](tools-lifecycle.md)** — `register_member`, `list_members`, `update_member`, `remove_member`, `shutdown_server`
+**[Lifecycle](tools-lifecycle.md)** - `register_member`, `list_members`, `update_member`, `remove_member`, `shutdown_server`
 Manage the fleet roster and server lifecycle. Registration validates connectivity, detects the OS, and checks that Claude CLI is available. Removal includes best-effort cleanup of auth credentials on the member.
 
-**[Work](tools-work.md)** — `send_files`, `execute_prompt`, `execute_command`, `reset_session`
+**[Work](tools-work.md)** - `send_files`, `execute_prompt`, `execute_command`, `reset_session`
 The core workflow. Push files to a member, run prompts against it, run shell commands directly, manage conversation sessions.
 
-**[Infrastructure](tools-infrastructure.md)** — `provision_llm_auth`, `setup_ssh_key`, `update_llm_cli`
+**[Infrastructure](tools-infrastructure.md)** - `provision_llm_auth`, `setup_ssh_key`, `update_llm_cli`
 One-time setup and maintenance. Provision auth (copy OAuth credentials or deploy API key for any provider), migrate from password to key auth, update the LLM CLI on members.
 
-**[Observability](tools-observability.md)** — `fleet_status`, `member_detail`
+**[Observability](tools-observability.md)** - `fleet_status`, `member_detail`
 Two-layer monitoring. `fleet_status` gives a quick summary table across all members with fleet-aware busy detection (distinguishes between Claude processes serving this member vs unrelated Claude activity). `member_detail` drills into one member with connectivity, CLI version, session state, and system resource metrics.
 
 ## Cross-Platform Support
 
-Members can run Windows, macOS, or Linux. The `platform.ts` utility generates the right shell commands for each OS — different commands for checking processes, reading memory, setting environment variables. The OS is auto-detected during registration (`uname -s` on Unix, `cmd /c ver` on Windows) and stored in the member record so subsequent tool calls don't need to re-detect.
+Members can run Windows, macOS, or Linux. The `platform.ts` utility generates the right shell commands for each OS - different commands for checking processes, reading memory, setting environment variables. The OS is auto-detected during registration (`uname -s` on Unix, `cmd /c ver` on Windows) and stored in the member record so subsequent tool calls don't need to re-detect.
diff --git a/feedback.md b/feedback.md
new file mode 100644
index 00000000..1c445cfb
--- /dev/null
+++ b/feedback.md
@@ -0,0 +1,657 @@
+# OS Service Lifecycle -- Plan Review
+
+**Reviewer:** rbnvk
+**Date:** 2026-05-19 12:38:29-0400
+**Verdict:** CHANGES NEEDED
+
+> See the recent git history of this file to understand the context of this review.
+
+---
+
+## 1. Template Checklist
+
+### 1.1 Does every task have clear "done" criteria?
+
+**PASS.** Every task (T1--T15) has an explicit "Done when" block with testable conditions.
+T6 and T10 (test tasks) specify coverage targets and the requirement that `npm test` stays
+green. T14 and T15 (docs) specify ASCII-only enforcement. No ambiguity.
+
+### 1.2 High cohesion within each task, low coupling between tasks?
+
+**PASS.** Each task has a single concern: T1 is the shutdown endpoint + constants, T2 is
+the interface + factory, T3--T5 are one adapter each, T7 bundles start+stop (these share
+the same server.json/PID lifecycle and are natural counterparts), T9 is status, T11 is
+install extension, T12 is uninstall extension. Clean boundaries.
+
+### 1.3 Are key abstractions and shared interfaces in the earliest tasks?
+
+**PASS.** The ServiceManager interface (T2) and service constants (T1) land in Phase 1
+before any consumer task. The /shutdown endpoint (T1) is also correctly front-loaded since
+it is consumed by adapters (T3 Windows stop, T5 macOS stop) and the stop CLI verb (T7).
+
+### 1.4 Is the riskiest assumption validated in Task 1?
+
+**PASS.** Phase 1 front-loads the two riskiest assumptions: (a) per-user service management
+without elevation across all three OSes (Tasks 3--5), and (b) cross-platform graceful stop
+via the /shutdown endpoint (Task 1). The plan explicitly acknowledges this sequencing in
+the Phase 1 preamble: "If schtasks/systemctl/launchctl cannot be called without elevation,
+this phase fails immediately."
+
+T1 is the shutdown endpoint + constants (cheap), which is the foundation but not itself the
+risky assumption. The risky per-user-no-elevation validation happens in T3--T5. This is
+acceptable because the interface (T2) needs to exist before the adapters can be written,
+and T1 provides the /shutdown endpoint that T3 and T5 depend on for their stop
+implementations.
+
+### 1.5 Later tasks reuse early abstractions (DRY)?
+
+**PASS.** T7 (start/stop CLI) calls `getServiceManager()` from T2. T9 (status) calls
+`serviceManager.query()`. T11 (install) calls `serviceManager.register()` +
+`serviceManager.start()`. T12 (uninstall) calls `serviceManager.unregister()`. The adapter
+pattern is consistently reused throughout.
+
+### 1.6 Phase boundaries at cohesion boundaries?
+
+**PASS.** Phase 1 (adapter layer) is self-contained -- produces a testable service manager
+with mocked tests. Phase 2 (CLI verbs) builds on Phase 1 and produces functional commands.
+Phase 3 (install/uninstall integration) wires everything together. Phase 4 (docs) is
+standalone. Each phase is reviewable and testable independently.
+
+### 1.7 Are tiers monotonically non-decreasing within each phase?
+
+**FAIL (HIGH-1).** Phase 2 tiers are: T7 cheap, T8 cheap, T9 standard, T10 standard.
+That is monotonically non-decreasing -- fine. Phase 1 tiers: T1 cheap, T2 standard,
+T3 standard, T4 standard, T5 standard, T6 standard -- fine. Phase 3: T11 standard,
+T12 standard, T13 standard -- fine. Phase 4: T14 cheap, T15 cheap -- fine.
+
+Actually, on closer inspection this is all compliant. **Changing to PASS.** Retracted.
+
+**PASS.** All phases have monotonically non-decreasing tiers.
+
+### 1.8 Each task completable in one session?
+
+**PASS.** All tasks are scoped to a single file or a small set of related files. The
+largest tasks (T3--T5, one adapter each) are well-bounded: a single class implementing a
+known interface with 5--6 methods, each method being a shell command wrapper. T7 bundles
+start+stop but these are thin CLI modules (~60 lines each). Reasonable for one session.
+
+### 1.9 Dependencies satisfied in order?
+
+**PASS.** T1 has no dependencies. T2 depends on T1 (types file). T3--T5 depend on T2
+(interface). T6 depends on T3--T5. T7 depends on Phase 1. T8 depends on T7. T9 depends
+on Phase 1. T10 depends on T7--T9. T11 depends on Phase 1. T12 depends on T11. T13
+depends on T11--T12. T14--T15 have no code dependencies. All valid.
+
+### 1.10 Any vague tasks that two developers would interpret differently?
+
+**FAIL (HIGH-2).** Task 7 (start command) says: "for dev mode, use process.execPath (node)
+with args [dist/index.js, --transport, http]" but does not specify how to determine the
+path to dist/index.js in dev mode. In install.ts, the existing code uses `findProjectRoot()`
+to locate the project root. T7 needs to specify whether to use the same mechanism or
+hardcode a relative path. Two developers would make different choices here.
+
+Additionally, T7's stop command says "POST /shutdown to the URL" but the /shutdown endpoint
+is defined in T1 as being added to http-transport.ts. The stop command must also handle
+the case where the server is running but the /shutdown endpoint is not yet deployed (e.g.,
+an older version of the binary is running from a previous install). The plan does not
+address this version-skew scenario. **NOTE** (not blocking): the fallback kill path
+covers this, but it should be explicitly called out.
+
+### 1.11 Any hidden dependencies between tasks?
+
+**PASS.** No hidden dependencies found. T7's stop command depends on the /shutdown endpoint
+from T1, which is correctly listed as a Phase 1 dependency. T11's service registration
+depends on knowing the binary path, which is already computed in install.ts.
+
+### 1.12 Does the plan include a risk register?
+
+**PASS.** The risk register covers 10 risks with impact and mitigation. It addresses:
+schtasks /end hard kill, loginctl linger requiring root, non-systemd Linux, Windows batch
+job rights, macOS API versions, binary path on update, backward compat, concurrent start
+race, /shutdown security, and stale server.json. This is thorough.
+
+### 1.13 Does the plan align with requirements.md intent?
+
+**FAIL (HIGH-3).** The Notes section states "Base branch: main" but requirements.md
+explicitly says "Base Branch: feat/mcp-sse-transport" and "Commit directly onto that
+branch; no new branch." This is a direct contradiction with the requirements. The
+implementation branch line is correct (feat/mcp-sse-transport) but the base branch line
+is wrong. This must be fixed to avoid confusion -- a doer might create the PR against
+main instead of extending PR #273.
+
+---
+
+## 2. Risk Checklist (from prep)
+
+### Risk 1: Per-user service registration with NO elevation
+
+**PASS.** The plan explicitly addresses all three OSes:
+- **Windows:** `schtasks /create ... /rl limited` (no elevation). Risk register notes
+  "Log on as a batch job" right restriction on domain-joined machines with mitigation.
+- **Linux:** `systemctl --user` (no elevation). loginctl enable-linger attempted with
+  non-fatal warning. Non-systemd detection throws actionable error.
+- **macOS:** `launchctl bootstrap gui/<uid>` (no elevation, LaunchAgent not LaunchDaemon).
+
+All three are well-specified. The dual-path (service vs. direct) fallback is defined.
+
+### Risk 2: Graceful shutdown on all platforms
+
+**PASS.** The plan introduces a POST /shutdown endpoint (T1) that triggers the existing
+SIGINT handler chain. This is the correct solution for Windows where schtasks /end does
+TerminateProcess. The risk register explicitly calls out "Never use schtasks /end for
+graceful stop." Stop always goes through HTTP /shutdown on all OSes, with a force-kill
+fallback after 5s timeout. This ensures server.json and lock cleanup.
+
+The plan correctly configures service managers to NOT restart after clean exit:
+- systemd: Restart=on-failure (exit 0 = no restart)
+- launchd: KeepAlive.SuccessfulExit=false (exit 0 = no restart)
+- Windows: schtasks at-logon trigger only (no restart semantics)
+
+### Risk 3: Interplay with server.json and singleton lock
+
+**PASS.** The plan reuses the existing checkRunningInstance() (validates pid + /health) for
+the start command's idempotency check. The stop command cleans up stale server.json and
+lock file after the server exits. The risk register notes that the existing stale-file
+cleanup mechanism is sufficient. The existing claimStartupLock prevents concurrent starts.
+
+### Risk 4: start with vs. without a service unit installed
+
+**PASS.** The Verb x OS Matrix explicitly defines both columns ("Service Installed" vs.
+"No Service Installed") for the start verb. When no service is installed, the binary is
+spawned detached with stdout/stderr redirected to LOG_FILE_PATH. T7 spells out both paths
+with the `isInstalled()` check determining which path to take.
+
+### Risk 5: Idempotency of every verb x every OS
+
+**PASS.** The plan states idempotency for each verb:
+- start: checkRunningInstance() first, exit 0 if running.
+- stop: if not running, report and exit 0. Clean up stale files.
+- install: schtasks /create uses /f (force/overwrite). systemctl enable is idempotent.
+  launchctl bootstrap may need bootout first -- see HIGH-4 below.
+- uninstall: each step tolerates "not found" errors.
+
+**FAIL (HIGH-4).** The install verb for macOS calls `launchctl bootstrap gui/<uid>
+<plist-path>` but does NOT call `launchctl bootout` first. If the service is already
+loaded (e.g., user runs `install` twice), `launchctl bootstrap` will fail with "service
+already loaded." The plan must specify that install either (a) calls bootout before
+bootstrap, or (b) catches the "already loaded" error and proceeds. This is a real
+idempotency gap. Windows schtasks uses /f which handles re-registration. Linux systemctl
+enable is inherently idempotent. Only macOS bootstrap has this issue.
+
+### Risk 6: No regression to existing install/uninstall MCP-config behaviour
+
+**PASS.** T11 explicitly states: "after the existing final step (Beads tracker install +
+permissions + install-config.json), add a new step." The existing install steps are
+unchanged. T12 states service removal is "prepended" to the existing uninstall steps.
+The risk register includes "backward compat" and notes service registration is "purely
+additive."
+
+### Risk 7: Log file redirection
+
+**PASS.** LOG_FILE_PATH is defined in T1 as `~/.apra-fleet/data/fleet.log`. Per-OS
+mechanisms are specified:
+- Windows: wrapper.bat handles redirection (schtasks cannot redirect natively).
+- Linux: StandardOutput=append:<logPath>, StandardError=append:<logPath>.
+- macOS: StandardOutPath=logPath, StandardErrorPath=logPath.
+- Direct spawn (no service): stdout/stderr redirect to LOG_FILE_PATH.
+
+**NOTE:** No log rotation strategy is mentioned. The log file will grow unboundedly. This
+is not blocking for this sprint but should be tracked as a follow-up.
+
+### Risk 8: Binary path in service units
+
+**PASS.** T11 specifies `serviceManager.register(binaryPath, ...)` where binaryPath is
+the installed binary from install.ts (BIN_DIR + binary name). The risk register notes
+"install command re-registers the service unit (updates binary path)" and "update command
+calls install --force, which also re-registers."
+
+### Risk 9: status command richness with/without a unit installed
+
+**PASS.** T9 specifies the full output format with all required fields (pid, port, url,
+version, uptime, sessions, service state). The "Service" line is "always shown regardless
+of server state" and covers installed/enabled/disabled/not-installed states. When server
+is stopped, pid/port/url/uptime/sessions are omitted. This matches requirements.
+
+### Risk 10: Port fallback interaction
+
+**PASS.** The stop command reads the URL from server.json (which contains the actual port
+the server bound to, whether 7523 or a fallback port). The status command also reads
+server.json. This correctly handles the port fallback case without assuming the default
+port.
+
+---
+
+## 3. Verb x OS Matrix Completeness
+
+**PASS.** The plan includes an explicit Verb x OS Matrix section with tables for start,
+stop, restart, status, install, and uninstall. Each cell specifies the exact OS command
+or behavior. No "and similarly for X" is used. Each OS is explicitly covered for every
+verb. This directly satisfies the requirements.md mandate: "The plan MUST explicitly walk
+through all three OSes for every verb."
+
+---
+
+## 4. Acceptance Criteria Mapping
+
+Mapping each acceptance criterion from requirements.md to plan tasks:
+
+1. "install registers a per-user service and server is running immediately" -> T11. **PASS.**
+2. "Server comes back after reboot/re-login on all three OSes" -> T3 (at-logon trigger),
+   T4 (WantedBy=default.target + linger), T5 (RunAtLoad=true). **PASS.**
+3. "start/stop/restart/status work idempotently on all OSes" -> T7, T8, T9. **PASS.**
+4. "status reports pid/port/url/version/uptime/sessions and service-unit state" -> T9.
+   **PASS.**
+5. "uninstall stops server and removes service unit and MCP config" -> T12. **PASS.**
+6. "No elevation/admin/root or UAC" -> All adapter tasks (T3--T5) use per-user commands.
+   **PASS.**
+7. "Tests cover verb logic and per-OS adapter" -> T6, T10, T13. **PASS.**
+8. "Docs updated" -> T14, T15. **PASS.**
+
+All acceptance criteria map to at least one task. No gaps.
+
+---
+
+## 5. VERIFY Checkpoint Placement
+
+**PASS.** VERIFY checkpoints are placed at the end of each phase:
+- After Phase 1 (T6): run tests, confirm compile, no regressions.
+- After Phase 2 (T10): run tests, verify --help, no regressions.
+- After Phase 3 (T13): run tests, confirm install/uninstall lifecycle, no regressions.
+- After Phase 4 (T15): confirm ASCII-only, docs accurate.
+
+Each checkpoint specifies what to verify and asks for a report. Correct placement.
+
+---
+
+## 6. Additional Findings
+
+### HIGH-5: Linux adapter stop() uses systemctl --user stop, not /shutdown
+
+Task 4 (Linux adapter) defines stop() as: "`systemctl --user stop apra-fleet` (sends
+SIGTERM, handled gracefully by existing handler)." However, the Verb x OS Matrix for
+`stop` says all OSes use "Read server.json -> POST /shutdown -> wait -> fallback."
+Meanwhile, Task 7 (stop CLI verb) always uses the HTTP /shutdown approach regardless of
+OS.
+
+There is a contradiction: the Linux adapter's `stop()` method uses `systemctl --user stop`
+(which sends SIGTERM), but the CLI stop verb bypasses the adapter entirely and uses HTTP
+/shutdown. This means `serviceManager.stop()` on Linux differs from the CLI stop behavior.
+
+This matters because T12 (uninstall) calls the graceful /shutdown approach, but T11
+(install) calls `serviceManager.start()` which could later be stopped by either path.
+
+The plan needs to clarify: does the CLI `stop` verb call `serviceManager.stop()` or does
+it always go through HTTP /shutdown directly? If the latter, what is
+`serviceManager.stop()` used for? If the adapter's stop() is never called by any CLI verb,
+it is dead code. If it IS called, then the Linux adapter's use of `systemctl --user stop`
+is fine (SIGTERM is handled gracefully), but the Windows adapter's stop() also uses HTTP
+/shutdown, creating an inconsistency in the adapter interface contract.
+
+**Resolution needed:** Either (a) make all adapters' stop() use HTTP /shutdown for
+consistency and have the CLI call `serviceManager.stop()`, or (b) have the CLI always
+bypass the adapter for stop and document that `serviceManager.stop()` is an internal
+method for the unregister flow only, or (c) clarify the exact call path in T7 and T12.
+
+---
+
+## Summary
+
+The plan is well-structured with clear task boundaries, proper dependency ordering,
+front-loaded risk validation, and a thorough Verb x OS Matrix. The ServiceManager adapter
+pattern is sound and the /shutdown endpoint elegantly solves the cross-platform graceful
+stop problem. All acceptance criteria map to tasks.
+
+**Three blocking items must be resolved:**
+
+- **HIGH-2:** T7 (start/stop CLI) is underspecified for dev-mode binary path resolution.
+  Clarify how dist/index.js is located.
+- **HIGH-3:** Notes section says "Base branch: main" -- contradicts requirements.md which
+  mandates feat/mcp-sse-transport. Fix the Notes.
+- **HIGH-4:** macOS install idempotency gap -- `launchctl bootstrap` will fail if already
+  loaded. Must bootout first or handle the error.
+- **HIGH-5:** Contradictory stop() semantics between the Linux adapter (systemctl stop),
+  the CLI stop verb (HTTP /shutdown), and the Verb x OS Matrix. Clarify the call path.
+
+**Non-blocking notes:**
+- No log rotation strategy (track as follow-up).
+- Version-skew scenario for /shutdown endpoint not explicitly addressed (fallback kill
+  covers it, but should be noted).
+
+---
+---
+
+# OS Service Lifecycle -- Plan Re-Review
+
+**Reviewer:** rbnvk
+**Date:** 2026-05-19 12:50:00-0400
+**Verdict:** APPROVED
+
+> See the recent git history of this file to understand the context of this review.
+> Prior review: 2026-05-19 12:38:29-0400 -- CHANGES NEEDED with 4 HIGH findings.
+> Doer revised PLAN.md in commit 7f712fb to address all findings.
+
+---
+
+## Prior HIGH Findings -- Resolution Verification
+
+### HIGH-2: Task 7 dev-mode binary path resolution
+
+**RESOLVED.** T7 now explicitly specifies: "In dev mode (non-SEA), the command is
+`process.execPath` (the Node.js binary) with args
+`[path.join(findProjectRoot(), 'dist', 'index.js'), '--transport', 'http']` -- using the
+same `findProjectRoot()` function from src/cli/install.ts that walks up from __dirname
+looking for version.json. Import `findProjectRoot` from install.ts (it is already exported)
+or extract it to a shared util."
+
+This is unambiguous -- two developers would make the same choice. The version-skew concern
+from the NOTE is also now addressed: T7 explicitly documents the fallback: "if an older
+binary without the /shutdown endpoint is running, the POST will fail (404 or connection
+error). The fallback force-kill path handles this correctly."
+
+### HIGH-3: Base branch contradiction
+
+**RESOLVED.** Notes section now reads: "Base branch: feat/mcp-sse-transport (extends
+PR #273 -- no new branch)" and "Implementation branch: feat/mcp-sse-transport (commit
+directly onto this branch)." This matches requirements.md exactly. No remaining references
+to "main" as base branch anywhere in PLAN.md.
+
+### HIGH-4: macOS install idempotency gap
+
+**RESOLVED.** T5 register() now includes: "Before loading, call `launchctl bootout
+gui/<uid>/com.apra-fleet.server` and tolerate 'not loaded' / 'no such process' errors --
+this makes register() idempotent." The Verb x OS Matrix install row for macOS also
+reflects this: "launchctl bootout ... (tolerate 'not loaded' error). Then launchctl
+bootstrap ..." Both the adapter task and the matrix are consistent.
+
+### HIGH-5: Contradictory stop() semantics
+
+**RESOLVED.** The Design Summary now includes a dedicated "Stop call path (unified)"
+paragraph that clarifies the architecture:
+
+1. The CLI `stop` verb bypasses the adapter entirely and calls POST /shutdown directly
+   (since stopping the process is service-agnostic).
+2. All three adapters' stop() methods use the same POST /shutdown mechanism (not
+   systemctl stop or OS-specific commands) for cross-platform consistency.
+3. serviceManager.stop() exists for use within unregister() and for interface
+   completeness, but the CLI never routes through it.
+
+T4 (Linux adapter) stop() now reads: "Read server.json for URL. POST /shutdown. Wait up
+to 5s for process exit (poll pid). Fallback: kill -TERM <pid>. This matches the Windows
+and macOS adapters." The prior contradiction with `systemctl --user stop` is fully
+eliminated. All three adapters share the same contract.
+
+---
+
+## Structural Re-Verification
+
+### Task slicing / ordering / dependencies
+
+**PASS.** No changes to task boundaries or ordering. The revisions were surgical --
+clarifications within T4, T5, and T7 without altering the phase structure.
+
+### Tier assignments and monotonicity
+
+**PASS.** No tier changes. Phase 1: cheap -> standard (5x). Phase 2: cheap (2x) ->
+standard (2x). Phase 3: standard (3x). Phase 4: cheap (2x). All monotonically
+non-decreasing within each phase.
+
+### VERIFY checkpoint placement
+
+**PASS.** All four VERIFY blocks remain at phase boundaries. No changes.
+
+### Acceptance criteria mapping
+
+**PASS.** All 8 acceptance criteria from requirements.md still map to tasks. The
+revisions did not remove or alter any task's scope. Verified:
+
+1. install -> service running immediately: T11. Covered.
+2. Reboot/re-login persistence: T3 (at-logon), T4 (linger), T5 (RunAtLoad). Covered.
+3. Verb idempotency on all OSes: T7, T8, T9. Covered.
+4. status richness: T9. Covered.
+5. uninstall cleanup: T12. Covered.
+6. No elevation: T3--T5 per-user commands. Covered.
+7. Test coverage: T6, T10, T13. Covered.
+8. Docs: T14, T15. Covered.
+
+### Deferred Items section
+
+**PASS.** New "Deferred Items" section exists and tracks: (1) log rotation -- noted as
+append-only with no rotation for this sprint, with follow-up approaches listed
+(size-based rotation, OS-native logrotate/newsyslog); (2) TLS/auth on HTTP endpoint.
+Both are correctly scoped out.
+
+### New problems introduced by revision
+
+**NONE FOUND.** The revisions are clean clarifications that do not introduce new
+ambiguity, contradictions, or gaps. The Verb x OS Matrix, risk register, and task
+descriptions remain internally consistent after the changes.
+
+---
+
+## Summary
+
+All four HIGH findings from the initial review are fully resolved. The plan is
+well-structured with clear task boundaries, proper dependency ordering, front-loaded risk
+validation, a complete Verb x OS Matrix, and a unified stop call path. The Deferred Items
+section tracks log rotation and TLS/auth as follow-ups. All acceptance criteria map to
+tasks. No new issues introduced by the revision.
+
+**Verdict: APPROVED.** The plan is ready for implementation.
+
+---
+---
+
+# Phase 1 (Platform Service Foundation) -- Code Review
+
+**Reviewer:** rbnvk
+**Date:** 2026-05-19 18:15:00-0400
+**Verdict:** APPROVED
+
+> Commits reviewed: 9963198 (T2), 98115b9 (T3), 93da1fa (T4), 1be25ed (T5), 490ead1 (T6), 224cd11 (T6.5)
+> Build: PASS (tsc clean). Tests: 1372 passed, 6 skipped. ASCII: clean (all reviewed files).
+
+---
+
+## 1. Platform Command Correctness
+
+### Windows (src/services/service-manager/windows.ts)
+
+**PASS.** `schtasks /create /tn ApraFleet /tr <wrapper> /sc onlogon /rl limited /f` is
+correct: `/sc onlogon` triggers at user login, `/rl limited` runs without elevation, `/f`
+forces overwrite for idempotency. `/run` starts the task. `/delete /f` removes it without
+confirmation. `/query /fo csv /nh` returns machine-parseable output. All flags verified
+against schtasks documentation.
+
+Stop path correctly uses `gracefulStopByServerJson` with a `taskkill /F /PID` fallback,
+avoiding `schtasks /end` (which does TerminateProcess). Matches the plan's unified stop
+semantics.
+
+### Linux (src/services/service-manager/linux.ts)
+
+**PASS.** All `systemctl` calls use `--user` flag throughout. `daemon-reload` after writing
+the unit file. `enable` to set up WantedBy symlink. `loginctl enable-linger` attempted
+with `console.warn` on failure (non-fatal, as specified in plan). The `checkSystemd()`
+guard validates `/run/user/<uid>/systemd` exists before any systemd operation.
+
+Unit file content is correct: `Type=simple`, `Restart=on-failure` (no restart on clean
+exit), `StandardOutput=append:<path>`, `StandardError=append:<path>`,
+`WantedBy=default.target`.
+
+### macOS (src/services/service-manager/macos.ts)
+
+**PASS.** `launchctl bootstrap gui/<uid> <plist>` for registration, `launchctl bootout
+gui/<uid>/<label>` for removal. Register calls bootout first (tolerate error) then
+bootstrap -- idempotent as required by HIGH-4 resolution. `launchctl kickstart` for
+explicit start. `launchctl print` for status query with pid extraction via
+`/\bpid\s*=\s*(\d+)/` regex.
+
+Plist content: `RunAtLoad=true`, `KeepAlive.SuccessfulExit=false` (no restart on clean
+exit). `StandardOutPath` and `StandardErrorPath` set. Label matches constant.
+
+Domain helper `gui/<uid>` correctly uses `process.getuid()` with fallback to `'501'`
+(default macOS UID).
+
+---
+
+## 2. Error Handling
+
+**PASS.** Each adapter method fails gracefully:
+
+- **Windows:** `unregister()` catches schtasks delete failure (task-not-found). `query()`
+  catches schtasks query failure and returns `{ installed: false, running: false }`.
+  `isInstalled()` catches and returns false. `stop()` fallback taskkill is try/caught.
+- **Linux:** `unregister()` wraps disable, stop, unlink, and daemon-reload each in
+  individual try/catch. `query()` catches is-active and is-enabled failures independently.
+  `loginctl enable-linger` failure is a warning, not an error.
+- **macOS:** `unregister()` catches bootout failure (service not loaded). `register()`
+  catches bootout-before-bootstrap failure. `query()` catches launchctl print failure and
+  returns `{ installed: true, running: false }`.
+
+The `gracefulStopByServerJson()` function in index.ts handles missing/unreadable
+server.json, missing pid/url, dead process, and timeout with fallback kill. Solid.
+
+---
+
+## 3. Security -- Per-User Scope
+
+**PASS.** No `sudo`, `runas`, `admin`, or elevation anywhere in the codebase:
+
+- **Windows:** `/rl limited` -- explicit non-elevated. No `/ru SYSTEM`.
+- **Linux:** `systemctl --user` throughout. Unit file in `~/.config/systemd/user/` (user
+  directory, not `/etc/systemd/system/`). `loginctl enable-linger` targets current
+  username, not root.
+- **macOS:** Plist in `~/Library/LaunchAgents/` (per-user, not `/Library/LaunchDaemons/`).
+  Domain is `gui/<uid>`, not `system/`.
+- **Factory:** `getServiceManager()` returns `NoopServiceManager` on unsupported platforms
+  with a warning -- no attempt to use privileged fallbacks.
+
+---
+
+## 4. Test Coverage
+
+**PASS.** 40 tests across all three adapters covering:
+
+- **Happy paths:** register writes correct content, calls correct commands; start calls
+  correct command; stop invokes gracefulStopByServerJson; query parses output correctly;
+  isInstalled returns true/false.
+- **Error paths:** Windows unregister tolerates task-not-found. Windows query handles
+  task-not-found. Windows isInstalled handles query failure. Linux register warns on
+  loginctl failure. Linux unregister is idempotent when unit not installed. Linux query
+  handles missing unit file. Linux non-systemd detection throws on register, start, stop.
+  macOS register tolerates bootout error on first registration. macOS unregister tolerates
+  bootout error when not loaded. macOS query handles launchctl print failure and no-pid
+  output.
+- **Windows stop fallback:** Test captures the fallback function and verifies it calls
+  taskkill with the correct PID.
+- **Mocking strategy:** `node:child_process`, `node:fs`, `node:os`, and
+  `gracefulStopByServerJson` are all mocked. Tests verify command arguments precisely
+  (e.g., exact schtasks flags, exact systemctl args).
+
+---
+
+## 5. T6.5 Logging -- Malformed Body Safety
+
+**PASS.** (src/services/http-transport.ts:130-139)
+
+The initialize body is cast to a structural type with all-optional fields:
+```
+body?.params?.clientInfo ?? {}
+body?.params?.capabilities ?? {}
+```
+
+If the body is `{ method: "initialize" }` with no `params`, `clientInfo` defaults to `{}`,
+`clientCaps` defaults to `{}`, `capKeys` becomes `''` (falls back to `'none'` in the log
+string), `hasChannel` becomes `false`. No crash path. If `params` exists but `capabilities`
+is a non-object primitive, `Object.keys()` would throw -- but the MCP SDK would reject
+such a body before it reaches this code, and the outer try/catch on `parseBody` handles
+truly malformed JSON.
+
+The `/shutdown` endpoint addition (lines 103-111) uses `setTimeout` + `process.emit('SIGINT')`
+which allows the response to flush before triggering shutdown. Clean.
+
+---
+
+## 6. ASCII-Only Compliance
+
+**PASS.** Grep for non-ASCII bytes across all 7 reviewed files returned zero matches. All
+string literals use ASCII characters only. The plist XML uses standard ASCII entities.
+The systemd unit file uses ASCII. Comments are ASCII.
+
+---
+
+## Findings (non-blocking)
+
+### MEDIUM-1: Windows bat wrapper does not quote individual args
+
+**File:** `src/services/service-manager/windows.ts:14`
+**Rating:** MEDIUM
+
+The bat line is:
+```
+`"${binaryPath}" ${args.join(' ')} >> "${logPath}" 2>&1`
+```
+
+`binaryPath` and `logPath` are quoted, but individual args are not. If any arg contains
+a space (e.g., a path), the bat file breaks. Current callers always pass simple flags
+(`['--transport', 'http']`), so this is not blocking. Suggest quoting each arg:
+```
+const quotedArgs = args.map(a => `"${a}"`).join(' ');
+```
+
+### MEDIUM-2: Linux unregister() gates on checkSystemd() before gracefulStopByServerJson()
+
+**File:** `src/services/service-manager/linux.ts:51`
+**Rating:** MEDIUM
+
+`unregister()` calls `checkSystemd()` first (line 51). If systemd was somehow removed but
+the process is still running, the graceful stop at line 52 never executes. The graceful
+stop is systemd-independent (reads server.json, POSTs /shutdown). Consider reordering:
+call `gracefulStopByServerJson()` first, then `checkSystemd()` before the systemctl
+commands that actually need it. The individual systemctl calls are already try/caught.
+
+### MEDIUM-3: macOS plist XML does not escape special characters
+
+**File:** `src/services/service-manager/macos.ts:22`
+**Rating:** MEDIUM
+
+`buildPlist()` interpolates `binaryPath`, `args`, and `logPath` directly into XML `<string>`
+elements without escaping `&`, `<`, `>`. If any path contains these characters, the plist
+will be malformed XML. Unlikely for real paths but a correctness gap. Consider adding:
+```
+function xmlEscape(s: string): string {
+  return s.replace(/&/g, '&amp;').replace(/</g, '&lt;').replace(/>/g, '&gt;');
+}
+```
+
+### LOW-1: Windows query CSV parsing
+
+**File:** `src/services/service-manager/windows.ts:50-51`
+**Rating:** LOW
+
+Splitting on `","` and indexing `cols[2]` works for standard schtasks CSV output but could
+be fragile across Windows versions/locales that add extra columns. The fallback to empty
+string and the catch-all try/catch make this safe in practice. Just noting it.
+
+### LOW-2: NoopServiceManager silently swallows operations
+
+**File:** `src/services/service-manager/index.ts:58-65`
+**Rating:** LOW
+
+The `NoopServiceManager` for unsupported platforms resolves all methods silently. The
+factory already emits a `console.warn`, so callers know the platform is unsupported. This
+is the correct behavior -- just noting it for completeness.
+
+---
+
+## Summary
+
+Phase 1 is well-implemented. Platform commands are correct across all three OSes. Error
+handling is thorough with graceful degradation. Security posture is clean -- no elevation
+anywhere. Test coverage is strong at 40 tests including error paths. T6.5 logging is safe
+against malformed bodies. All files are ASCII-only.
+
+Three MEDIUM findings noted for hardening (Windows arg quoting, Linux unregister ordering,
+macOS XML escaping) -- none are blocking. These can be addressed in a follow-up or during
+Phase 2 implementation.
+
+**Verdict: APPROVED.** Phase 1 is ready. Proceed to Phase 2.
diff --git a/llms-full.txt b/llms-full.txt
index 2fed5956..2c41f88c 100644
--- a/llms-full.txt
+++ b/llms-full.txt
@@ -236,6 +236,105 @@ reviewer  Opus 4.7        final review
 Provider strengths, role recommendations, and gotchas:
 [docs/provider-guide.md](docs/provider-guide.md).
 
+## Transport
+
+Fleet runs as a singleton service on your machine. When you start it, the server
+listens on port 7523 by default and multiple LLM clients (Claude Code, Gemini,
+Copilot, Codex) connect concurrently to the same fleet instance.
+
+### HTTP+SSE Transport (default)
+
+By default, fleet uses the **HTTP+SSE transport** -- clients connect over HTTP and
+receive server-push notifications over Server-Sent Events (SSE).
+
+```bash
+apra-fleet                  # Start HTTP server (default)
+apra-fleet --transport http # Explicitly use HTTP
+```
+
+When the server starts, it writes a `server.json` file to `~/.apra-fleet/` containing:
+```json
+{
+  "pid": 12345,
+  "port": 7523,
+  "url": "http://localhost:7523/mcp",
+  "version": "x.y.z",
+  "startedAt": "2026-05-19T..."
+}
+```
+
+If port 7523 is busy, the server falls back to port 0 (OS-assigned random port) and
+records the actual port in `server.json`. You can override the default port with the
+`APRA_FLEET_PORT` environment variable.
+
+**Multiple clients, one server.** When a second LLM client starts, it reads
+`server.json`, detects the running server, and connects to it. All clients share the
+same fleet instance -- no restart needed. When you close all clients, the server
+keeps running (as a singleton service on your machine). It shuts down on explicit
+exit (`apra-fleet --shutdown` tool) or on system reboot.
+
+**Re-register with HTTP.** When you upgrade or re-install Fleet, run:
+```bash
+apra-fleet install  # Registers fleet with HTTP transport (default)
+```
+
+### Event Bus
+
+The event bus is an internal notification system. When a subsystem (like credential
+storage) completes an operation, it emits an event, and the HTTP server broadcasts
+the notification to all connected clients via SSE. This lets clients respond
+immediately to fleet events without polling.
+
+### Backward Compatibility: stdio Transport
+
+Existing fleets can continue using the stdio transport:
+
+```bash
+apra-fleet --transport stdio # Use legacy stdio transport
+apra-fleet --stdio            # Alias for --transport stdio
+```
+
+When you run `apra-fleet install --transport stdio`, the MCP config keeps the old
+command-based format (no HTTP URL). The server's behavior is identical to pre-HTTP
+versions: it reads JSON-RPC from stdin, writes responses to stdout, and communicates
+with one client at a time via the stdio pipe.
+
+If you want to stay on stdio for now, run:
+```bash
+apra-fleet install --transport stdio
+```
+
+If you later switch back to HTTP, re-run the default install:
+```bash
+apra-fleet install  # Switches to HTTP transport
+```
+
+## Service Mode
+
+Fleet keeps a singleton server running so all your LLM clients share one instance.
+Registering it as an OS service keeps it alive across terminal sessions -- the server
+survives terminal close and restarts automatically on login:
+
+- Windows: a per-user Scheduled Task (Task Scheduler, OnLogon trigger)
+- Linux: a systemd user unit (`systemctl --user`)
+- macOS: a LaunchAgent in `~/Library/LaunchAgents/`
+
+Four verbs manage the lifecycle directly:
+
+```
+apra-fleet start    # start the server (idempotent -- exits cleanly if already running)
+apra-fleet stop     # graceful shutdown: POST /shutdown, poll, force-kill fallback
+apra-fleet restart  # stop then start
+apra-fleet status   # state, PID, port, uptime, version, and OS service status
+```
+
+`install` and `uninstall` include service registration. Running
+`apra-fleet install` on a packaged binary with the HTTP transport (the default)
+registers and starts the OS service automatically -- no extra step.
+`apra-fleet uninstall` stops and deregisters the service before removing files.
+Service registration failures are non-fatal: a warning is printed and the install
+continues.
+
 ## The PM skill
 
 The **PM skill** is Fleet's reference workflow for **software development**
@@ -374,7 +473,7 @@ Members with different **providers** are interchangeable from the PM's perspecti
   </doc>
 
   <doc title="Architecture" desc="How the fleet hub, MCP server, and members interact at a system level.">
-<!-- llm-context: This document explains the internal architecture of apra-fleet — the MCP server, member registry, SSH transport, session management, and how tools are dispatched. Read this when a user asks how fleet works under the hood, or when debugging connectivity or session issues. -->
+<!-- llm-context: This document explains the internal architecture of apra-fleet - the MCP server, member registry, SSH transport, session management, and how tools are dispatched. Read this when a user asks how fleet works under the hood, or when debugging connectivity or session issues. -->
 <!-- keywords: MCP server, member registry, SSH, transport, session, tool dispatch, child process, local member, remote member, architecture -->
 <!-- see-also: ../README.md (getting started), tools-infrastructure.md (tool details), vocabulary.md (terminology) -->
 
@@ -382,20 +481,20 @@ Members with different **providers** are interchangeable from the PM's perspecti
 
 ## Why This Exists
 
-AI coding agents are powerful on a single machine. But real work spans many machines — a dev server, a staging box, a GPU trainer, a production host. Today, if you want Claude Code working across all of them, you SSH in manually, run prompts one at a time, and copy files by hand. There's no single pane of glass.
+AI coding agents are powerful on a single machine. But real work spans many machines - a dev server, a staging box, a GPU trainer, a production host. Today, if you want Claude Code working across all of them, you SSH in manually, run prompts one at a time, and copy files by hand. There's no single pane of glass.
 
-Apra Fleet gives one Claude instance the ability to orchestrate many. Register machines, push files, run prompts, monitor health — all through natural language from your terminal. One master, many members.
+Apra Fleet gives one Claude instance the ability to orchestrate many. Register machines, push files, run prompts, monitor health - all through natural language from your terminal. One master, many members.
 
 ## Conceptual Model
 
 The system has three layers of abstraction:
 
-**Fleet** → **Members** → **Sessions**
+**Fleet** -> **Members** -> **Sessions**
 
-A *fleet* is the collection of all registered machines. A *member* is one machine with a working directory — the unit you talk to. A *session* is a conversation thread on a member — Claude remembers context across prompts within a session, and you can reset it to start fresh.
+A *fleet* is the collection of all registered machines. A *member* is one machine with a working directory - the unit you talk to. A *session* is a conversation thread on a member - Claude remembers context across prompts within a session, and you can reset it to start fresh.
 
 Members come in two flavors:
-- **Remote members** communicate over SSH. They can be any machine you can reach — Linux VMs, macOS servers, Windows boxes.
+- **Remote members** communicate over SSH. They can be any machine you can reach - Linux VMs, macOS servers, Windows boxes.
 - **Local members** run on the same machine as the master, in a different folder. No SSH needed. Useful for isolating work into separate project directories without spinning up another machine.
 
 This distinction is hidden behind a **Strategy pattern**: every tool interacts with members through a uniform interface. The strategy implementation (remote via SSH, or local via child process) is selected at runtime based on member type. Tools never know or care which kind of member they're talking to.
@@ -403,32 +502,32 @@ This distinction is hidden behind a **Strategy pattern**: every tool interacts w
 ## How It Fits Together
 
 ```
-┌────────────────────────────────────────────────────┐
-│  Master Machine                                    │
-│                                                    │
-│  Claude Code CLI ◄──stdio──► Apra Fleet Server      │
-│                               │                    │
-│                    ┌──────────┴──────────┐         │
-│                    │  Member Strategy    │         │
-│                    │  (uniform interface)│         │
-│                    └──┬─────────────┬───┘         │
-│                       │             │              │
-│              Remote Strategy   Local Strategy      │
-│              (ssh2 + sftp)    (child_process + fs) │
-│                       │             │              │
-│                    SSH│        local exec           │
-└───────────────────────┼─────────────┼──────────────┘
-                        │             │
-           ┌────────────┘             └──► /other/project/
-           ▼                               (same machine)
-    ┌──────────────┐
-    │ Remote Member │
-    │ (any OS,      │
-    │  any provider)│
-    └──────────────┘
-```
-
-The MCP server speaks **stdio** — the standard transport for Claude Code MCP servers. Claude sends JSON-RPC tool calls, the server executes them, returns results. No HTTP, no ports to open.
++------------------------------------------------------+
+|  Master Machine                                    |
+|                                                    |
+|  Claude Code CLI <--stdio--> Apra Fleet Server     |
+|                               |                    |
+|                    +---------+---------+           |
+|                    |  Member Strategy    |         |
+|                    |  (uniform interface)|         |
+|                    +--+------------+---+           |
+|                       |             |              |
+|              Remote Strategy   Local Strategy      |
+|              (ssh2 + sftp)    (child_process + fs) |
+|                       |             |              |
+|                    SSH|        local exec           |
++-------------------+------------+--+---------------+
+                    |             |
+           +--------+             +---> /other/project/
+           |                          (same machine)
+    +------+----------+
+    | Remote Member  |
+    | (any OS,       |
+    |  any provider) |
+    +----------------+
+```
+
+The MCP server speaks **stdio** - the standard transport for Claude Code MCP servers. Claude sends JSON-RPC tool calls, the server executes them, returns results. No HTTP, no ports to open.
 
 ## Layers
 
@@ -446,6 +545,161 @@ The codebase follows a strict layering:
 
 Each layer only depends on the layers below it. Tools never import other tools. Services don't know about the MCP protocol.
 
+## Transport Layer
+
+Fleet supports two MCP transports: HTTP+SSE (default) and stdio (legacy).
+
+### HTTP+SSE Transport (Default)
+
+The HTTP transport runs as a **singleton service** on your machine. A single fleet
+server listens on port 7523 and multiple LLM clients connect concurrently. Each
+client gets its own session with a dedicated `McpServer` instance inside the fleet
+process, so tool calls and state are isolated per client.
+
+```
+    Client 1 (Claude Code)          Client 2 (Gemini)
+              |                             |
+              +-------------+---+----------+
+                            |
+                 HTTP + SSE  |
+                             |
+                     +-------+-------+
+                     | Singleton    |
+                     | Fleet Server  |
+                     | (port 7523)   |
+                     +-------+-------+
+                             |
+                 +-----------|----------+
+                 |           |          |
+              McpServer   McpServer  Tool Registry
+              (Session 1) (Session 2) (shared)
+                 |           |
+                 +------+----+
+                        |
+                  Event Bus (notifications)
+```
+
+**Per-session McpServer model:** When a client connects, the fleet creates a new
+`McpServer` instance for that session. This isolates tool call state, session storage,
+and concurrent requests. Multiple clients can call the same tool simultaneously
+without interfering with each other.
+
+**Event bus:** The fleet's internal event bus (`FleetEventMap`) carries notifications
+from subsystems (e.g., `credential:stored` when out-of-band auth completes) to all
+connected clients via SSE `notifications/message`. This is the publish-subscribe
+mechanism for server-initiated events.
+
+**Singleton lifecycle:** The server starts on-demand the first time an LLM client
+connects. Subsequent clients reuse the running server. The server keeps running until
+explicitly shut down (via `shutdown_server` tool, SIGINT, SIGTERM, or system reboot).
+This is intentional - the singleton is a long-lived service, not a per-request
+process. Restarting it has a cost (tool re-registration, SSH connection repool,
+stall detector restart).
+
+**server.json discovery:** When the server starts, it writes `~/.apra-fleet/server.json`
+with `{ pid, port, url, version, startedAt }`. Clients discover the running instance
+by reading this file and verifying the process is alive and the port responds to
+`/health` endpoint. The double-check (process.kill(pid, 0) + HTTP health request)
+detects stale entries and cleans them up.
+
+**Localhost-only binding:** The fleet server binds to `127.0.0.1` only, never
+`0.0.0.0`. This ensures only local processes can connect -- no network exposure.
+
+### Stdio Transport (Legacy)
+
+When `--transport stdio` is used, the fleet runs in the legacy mode: one MCP server
+process per client connection. The server reads JSON-RPC from stdin, writes responses
+to stdout, and terminates when the client disconnects. No HTTP, no singleton, no
+event bus. Tools work identically; the transport layer differs.
+
+### Event Flow Subsystem -> Notification
+
+When an event is emitted on the event bus:
+
+1. **Subsystem** (e.g., `auth-socket.ts`) calls `fleetEvents.emit('credential:stored', { name: ... })`
+2. **Event Bus** (`event-bus.ts`) delivers the event to all registered subscribers
+3. **HTTP Transport** (`http-transport.ts`) receives the event in its subscriber callback
+4. **Per-session McpServer** sends a `notifications/message` to each connected client over SSE
+5. **Client** receives the notification in its SSE stream handler
+
+This is the publish-subscribe pattern: producers emit to the bus, subscribers (the
+HTTP transport) are notified, and the transport broadcasts to all session clients.
+
+## Service Manager
+
+The `ServiceManager` component registers and controls the fleet server as an OS
+background service. It uses an adapter pattern so the CLI verbs (`start`, `stop`,
+`restart`, `status`) and the `install`/`uninstall` commands work identically on every
+platform.
+
+### Interface
+
+`src/services/service-manager/types.ts` defines the contract:
+
+```
+interface ServiceManager {
+  register(binaryPath, args, logPath): Promise<void>
+  unregister(): Promise<void>
+  start(): Promise<void>
+  stop(): Promise<void>
+  query(): Promise<ServiceStatus>
+  isInstalled(): Promise<boolean>
+}
+
+interface ServiceStatus {
+  installed: boolean
+  running: boolean
+  pid?: number
+  enabled?: boolean
+}
+```
+
+Service name constants are also in `types.ts`: `WINDOWS_TASK_NAME`,
+`LINUX_UNIT_NAME`, `MACOS_PLIST_LABEL`.
+
+### Platform Adapters
+
+```
+src/services/service-manager/
+  types.ts    - ServiceManager interface, ServiceStatus, service name constants
+  index.ts    - getServiceManager() factory, gracefulStopByServerJson(), NoopServiceManager
+  windows.ts  - WindowsServiceManager  (schtasks per-user Scheduled Task)
+  linux.ts    - LinuxServiceManager    (systemd --user unit)
+  macos.ts    - MacOSServiceManager    (launchd LaunchAgent plist)
+```
+
+- **WindowsServiceManager**: writes a wrapper `.bat` file and creates a per-user
+  Scheduled Task with an `OnLogon` trigger via `schtasks /create`. `start`, `stop`,
+  and `query` use `schtasks /run`, `/end`, and `/query`.
+- **LinuxServiceManager**: writes a systemd user unit file, then runs `daemon-reload`,
+  `enable`, and `loginctl enable-linger`. `start`, `stop`, and `query` use
+  `systemctl --user`.
+- **MacOSServiceManager**: writes a plist to `~/Library/LaunchAgents/` and bootstraps
+  it with `launchctl bootstrap`. `KeepAlive.SuccessfulExit=false` prevents launchd
+  from restarting on a clean exit. `start`, `stop`, and `query` use `launchctl`.
+
+### Factory
+
+`getServiceManager()` in `index.ts` selects the right adapter at runtime via a
+dynamic `import()` keyed on `process.platform`:
+
+```
+win32   -> WindowsServiceManager
+linux   -> LinuxServiceManager
+darwin  -> MacOSServiceManager
+other   -> NoopServiceManager  (warns once; all methods are safe no-ops)
+```
+
+`NoopServiceManager` ensures the CLI verbs work on unsupported platforms without
+crashing -- they simply have no effect.
+
+### Graceful Stop
+
+`gracefulStopByServerJson()` (exported from `index.ts`) reads
+`~/.apra-fleet/server.json`, POSTs to the `/shutdown` endpoint, then polls the
+process at 500 ms intervals for up to 5 s. If the process does not exit in time,
+it falls back to `taskkill /F` on Windows or `SIGTERM` on Unix.
+
 ## Provider Abstraction
 
 Fleet supports five LLM providers: Claude Code, Google Antigravity CLI (agy), OpenAI Codex CLI, GitHub Copilot CLI, and Gemini CLI. Members can mix providers within a single fleet.
@@ -455,18 +709,18 @@ Fleet supports five LLM providers: Claude Code, Google Antigravity CLI (agy), Op
 Each member has an optional `llmProvider` field (`'claude' | 'agy' | 'codex' | 'copilot' | 'gemini'`). When absent, it defaults to `'claude'` for backwards compatibility. Every tool that interacts with the member's LLM CLI resolves the provider via `getProvider(agent.llmProvider)` and delegates CLI-specific concerns to the `ProviderAdapter` interface.
 
 ```
-┌──────────┐     getProvider()     ┌─────────────────┐
-│  Tool    │ ───────────────────►  │ ProviderAdapter  │
-│ (generic)│                       │  (per-provider)  │
-└──────────┘                       └────────┬─────────┘
-                                            │ supplies:
-                                     cliCommand()
-                                     buildPromptCommand()
-                                     parseResponse()
-                                     classifyError()
-                                     authEnvVar
-                                     processName
-                                     ...
++----------+     getProvider()     +----------------+
+| Tool     | --------+---------->  | ProviderAdapter |
+| (generic)|                       | (per-provider)  |
++----------+                       +--------+--------+
+                                          | supplies:
+                                   cliCommand()
+                                   buildPromptCommand()
+                                   parseResponse()
+                                   classifyError()
+                                   authEnvVar
+                                   processName
+                                   ...
 ```
 
 The `OsCommands` layer sits below this: it handles OS-specific shell wrapping (PATH prepend, PowerShell syntax, base64 decode) and delegates CLI-specific parts (binary name, flags, JSON format) to the provider.
@@ -486,7 +740,7 @@ src/providers/
 
 ### Mix-and-Match Fleet
 
-A fleet can have members on different providers simultaneously. The PM dispatches work to members by name — it doesn't need to know which LLM backend each member uses. The fleet server resolves the correct CLI commands per member at runtime.
+A fleet can have members on different providers simultaneously. The PM dispatches work to members by name - it doesn't need to know which LLM backend each member uses. The fleet server resolves the correct CLI commands per member at runtime.
 
 ```
 PM (orchestrator, Claude)
@@ -512,11 +766,11 @@ See `docs/provider-matrix.md` for the full comparison table.
 
 ### Strategy Pattern for Member Types
 
-Rather than scattering `if (agent.agentType === 'local')` checks across every tool, the local/remote distinction lives in a single place: the strategy factory. Tools call `getStrategy(agent).execCommand(...)` and get back the same result shape regardless of how it was executed. Adding a third member type (e.g., Docker containers, cloud VMs with API-based access) means writing one new strategy class — no tool changes.
+Rather than scattering `if (agent.agentType === 'local')` checks across every tool, the local/remote distinction lives in a single place: the strategy factory. Tools call `getStrategy(agent).execCommand(...)` and get back the same result shape regardless of how it was executed. Adding a third member type (e.g., Docker containers, cloud VMs with API-based access) means writing one new strategy class - no tool changes.
 
 ### Passwords Encrypted at Rest
 
-SSH passwords are encrypted with AES-256-GCM before being written to the registry file. The encryption key is derived from the machine's identity (hostname + OS username), so the registry file is meaningless if copied to another machine. This isn't meant to stop a determined attacker with root access — it prevents accidental plaintext exposure in backups, screenshots, or config file shares.
+SSH passwords are encrypted with AES-256-GCM before being written to the registry file. The encryption key is derived from the machine's identity (hostname + OS username), so the registry file is meaningless if copied to another machine. This isn't meant to stop a determined attacker with root access - it prevents accidental plaintext exposure in backups, screenshots, or config file shares.
 
 ### Connection Pooling with Idle Timeout
 
@@ -524,15 +778,15 @@ SSH connections are expensive to establish (TCP + key exchange + auth). The serv
 
 ### Base64 Prompt Encoding
 
-Prompts sent to remote members are base64-encoded before being passed through SSH. This sidesteps the shell escaping nightmare of nested quoting across SSH → bash → claude CLI, across different operating systems. The remote member decodes before passing to Claude.
+Prompts sent to remote members are base64-encoded before being passed through SSH. This sidesteps the shell escaping nightmare of nested quoting across SSH -> bash -> claude CLI, across different operating systems. The remote member decodes before passing to Claude.
 
 ### Session Persistence
 
-Each member stores an optional `sessionId` — a Claude conversation thread ID. When `resume=true` (the default), subsequent prompts continue the same conversation, so the remote Claude has full context of prior exchanges. Resetting a session is an explicit action, not an accident.
+Each member stores an optional `sessionId` - a Claude conversation thread ID. When `resume=true` (the default), subsequent prompts continue the same conversation, so the remote Claude has full context of prior exchanges. Resetting a session is an explicit action, not an accident.
 
 ### File-Based Registry
 
-All fleet state lives in `~/.apra-fleet/data/registry.json` — a single JSON file in the user's home directory. It's deliberately not in the project directory (won't be git-committed accidentally) and not in a database (no server to run, no migrations). For a fleet of dozens of members, JSON is more than sufficient.
+All fleet state lives in `~/.apra-fleet/data/registry.json` - a single JSON file in the user's home directory. It's deliberately not in the project directory (won't be git-committed accidentally) and not in a database (no server to run, no migrations). For a fleet of dozens of members, JSON is more than sufficient.
 
 ### Duplicate Folder Prevention
 
@@ -542,21 +796,21 @@ Two members cannot share the same working directory on the same device. For remo
 
 The tools break into natural groups. Each group has detailed documentation:
 
-**[Lifecycle](tools-lifecycle.md)** — `register_member`, `list_members`, `update_member`, `remove_member`, `shutdown_server`
+**[Lifecycle](tools-lifecycle.md)** - `register_member`, `list_members`, `update_member`, `remove_member`, `shutdown_server`
 Manage the fleet roster and server lifecycle. Registration validates connectivity, detects the OS, and checks that Claude CLI is available. Removal includes best-effort cleanup of auth credentials on the member.
 
-**[Work](tools-work.md)** — `send_files`, `execute_prompt`, `execute_command`, `reset_session`
+**[Work](tools-work.md)** - `send_files`, `execute_prompt`, `execute_command`, `reset_session`
 The core workflow. Push files to a member, run prompts against it, run shell commands directly, manage conversation sessions.
 
-**[Infrastructure](tools-infrastructure.md)** — `provision_llm_auth`, `setup_ssh_key`, `update_llm_cli`
+**[Infrastructure](tools-infrastructure.md)** - `provision_llm_auth`, `setup_ssh_key`, `update_llm_cli`
 One-time setup and maintenance. Provision auth (copy OAuth credentials or deploy API key for any provider), migrate from password to key auth, update the LLM CLI on members.
 
-**[Observability](tools-observability.md)** — `fleet_status`, `member_detail`
+**[Observability](tools-observability.md)** - `fleet_status`, `member_detail`
 Two-layer monitoring. `fleet_status` gives a quick summary table across all members with fleet-aware busy detection (distinguishes between Claude processes serving this member vs unrelated Claude activity). `member_detail` drills into one member with connectivity, CLI version, session state, and system resource metrics.
 
 ## Cross-Platform Support
 
-Members can run Windows, macOS, or Linux. The `platform.ts` utility generates the right shell commands for each OS — different commands for checking processes, reading memory, setting environment variables. The OS is auto-detected during registration (`uname -s` on Unix, `cmd /c ver` on Windows) and stored in the member record so subsequent tool calls don't need to re-detect.
+Members can run Windows, macOS, or Linux. The `platform.ts` utility generates the right shell commands for each OS - different commands for checking processes, reading memory, setting environment variables. The OS is auto-detected during registration (`uname -s` on Unix, `cmd /c ver` on Windows) and stored in the member record so subsequent tool calls don't need to re-detect.
   </doc>
 
   <doc title="Install" desc="Installation, uninstallation, and the --llm/--skill flags.">
diff --git a/progress.json b/progress.json
new file mode 100644
index 00000000..fb45488c
--- /dev/null
+++ b/progress.json
@@ -0,0 +1,31 @@
+{
+  "_schema": {
+    "type": "work | verify",
+    "status": "pending | completed | blocked"
+  },
+  "project": "apra-fleet-svc",
+  "plan_file": "PLAN.md",
+  "created": "2026-05-19",
+  "tasks": [
+    { "id": 1,  "step": "T1: Shutdown endpoint + service constants", "type": "work", "status": "completed", "tier": "cheap", "commit": "ef84f92", "notes": "POST /shutdown endpoint added to http-transport.ts; LOG_FILE_PATH constant added to paths.ts; service name constants created in service-manager/types.ts" },
+    { "id": 2,  "step": "T2: ServiceManager interface + factory", "type": "work", "status": "completed", "tier": "standard", "commit": "pending-pm-commit", "notes": "ServiceManager interface + ServiceStatus in types.ts; getServiceManager() factory + gracefulStopByServerJson + NoopServiceManager in index.ts. Fleet-dev was mid-task when Opus limit hit; PM committed the partial work to unblock build." },
+    { "id": 3,  "step": "T3: Windows Scheduled Task adapter", "type": "work", "status": "completed", "tier": "standard", "commit": "98115b9", "notes": "WindowsServiceManager: register writes wrapper.bat + schtasks /create; unregister deletes task + bat; start/stop/query/isInstalled via schtasks; gracefulStopByServerJson with taskkill fallback." },
+    { "id": 4,  "step": "T4: Linux systemd user unit adapter", "type": "work", "status": "completed", "tier": "standard", "commit": "93da1fa", "notes": "LinuxServiceManager: register writes systemd user unit + daemon-reload + enable + linger; unregister gracefully stops then disables; start/stop/query/isInstalled via systemctl --user; non-systemd detection throws clear error." },
+    { "id": 5,  "step": "T5: macOS launchd LaunchAgent adapter", "type": "work", "status": "completed", "tier": "standard", "commit": "1be25ed", "notes": "MacOSServiceManager: register writes plist + bootout (idempotent) + bootstrap; unregister bootout + remove plist; start/stop/query/isInstalled via launchctl; plist has KeepAlive.SuccessfulExit=false." },
+    { "id": 6,  "step": "T6: Service manager unit tests", "type": "work", "status": "completed", "tier": "standard", "commit": "490ead1", "notes": "40 tests covering all adapter methods and error paths. Fixed path-separator bug on Windows (backslash vs forward slash in existsSync mock) and macOS bootout mock order. All 40 pass." },
+    { "id": 7,  "step": "T6.5: MCP session capability logging (78g)", "type": "work", "status": "completed", "tier": "cheap", "commit": "224cd11", "notes": "Extracts clientInfo + capabilities from initialize body; logLine on onsessioninitialized (sid, name/version, caps, channel flag) and onsessionclosed (sid). logLine imported from utils/log-helpers.js." },
+    { "id": 8,  "step": "VERIFY: Platform Service Foundation", "type": "verify", "status": "completed", "commit": "a5fd844", "notes": "npm run build: clean. npm test: 85 files, 1365 passed, 13 skipped, 0 failed." },
+    { "id": 9,  "step": "T7: start and stop commands", "type": "work", "status": "completed", "tier": "cheap", "commit": "28d2732", "notes": "runStart: checkRunningInstance (idempotent), service manager if installed, else spawn detached to LOG_FILE_PATH; runStop: postShutdown, poll 5s, taskkill/SIGTERM fallback, cleanup server.json+lock. Both wired into index.ts dispatch." },
+    { "id": 10, "step": "T8: restart command", "type": "work", "status": "completed", "tier": "cheap", "commit": "f9dc2a0", "notes": "runRestart: calls runStop then runStart. Wired into index.ts dispatch." },
+    { "id": 11, "step": "T9: status command", "type": "work", "status": "completed", "tier": "standard", "commit": "653f265", "notes": "runStatus: checkRunningInstance + GET /health + getServiceManager().query(); formats State/PID/Port/URL/Version/Uptime/Sessions/Service; stopped state omits live fields. Wired into index.ts dispatch." },
+    { "id": 12, "step": "T10: CLI verb tests + --help update", "type": "work", "status": "completed", "tier": "standard", "commit": "37a28b6", "notes": "18 vitest tests: runStart (5), runStop (4), runRestart (2), runStatus (7). Spy-based fs/http mocks (vi.spyOn + restoreAllMocks) to avoid module-level factory leak in fileParallelism:false mode. --help updated with start/stop/restart/status verbs." },
+    { "id": 13, "step": "VERIFY: CLI Verbs", "type": "verify", "status": "completed", "commit": "37a28b6", "notes": "npm run build: clean. npm test: 86 files, 1383 passed, 13 skipped, 0 failed. All 18 new CLI verb tests green. Fixed factory-mock leakage in cli-verbs.test.ts (replaced vi.mock('node:fs',factory) with vi.spyOn to prevent registry.ts pollution in sequential test mode)." },
+    { "id": 14, "step": "T11: Extend install to register + start service", "type": "work", "status": "completed", "tier": "standard", "commit": "655bc5e", "notes": "Added getServiceManager + LOG_FILE_PATH imports. serviceStep = isSea() && http. baseSteps/totalSteps split; beads shows [baseSteps/totalSteps]. register+start at end; warns on failure. Service line in Done output." },
+    { "id": 15, "step": "T12: Extend uninstall to stop + remove service", "type": "work", "status": "completed", "tier": "standard", "commit": "655bc5e", "notes": "Replaced killApraFleet with svcMgr.stop() in --force path. Always calls svcMgr.unregister() (error swallowed) before file removal. isApraFleetRunning guard preserved for backward compat." },
+    { "id": 16, "step": "T13: Install/uninstall service integration tests", "type": "work", "status": "completed", "tier": "standard", "commit": "a4ec5e1", "notes": "13 tests: install register+start (SEA+HTTP), skip for stdio/dev-mode, step numbering, warn on failure; uninstall stop+unregister order, dry-run skip, idempotent error swallow, no stop when server not running." },
+    { "id": 17, "step": "VERIFY: Install/Uninstall Integration", "type": "verify", "status": "completed", "commit": "a4ec5e1", "notes": "npm run build: clean. npm test: 87 files, 1396 passed, 13 skipped, 0 failed. 13 new install-service tests + all prior tests green." },
+    { "id": 18, "step": "T14: Update README with service model + verbs", "type": "work", "status": "completed", "tier": "cheap", "commit": "b5038ae", "notes": "Added 'Service Mode' section between Transport and PM skill sections: explains singleton + OS service per platform, four verbs (start/stop/restart/status), and install/uninstall service registration." },
+    { "id": 19, "step": "T15: Update architecture docs with service manager", "type": "work", "status": "completed", "tier": "cheap", "commit": "12a313a", "notes": "Added 'Service Manager' section after Transport Layer: interface definition, platform adapters (Windows schtasks, Linux systemd --user, macOS launchctl), factory (getServiceManager), NoopServiceManager fallback, gracefulStopByServerJson." },
+    { "id": 20, "step": "VERIFY: Documentation", "type": "verify", "status": "completed", "commit": "", "notes": "npm run build: clean. npm test: 87 files, 1396 passed, 13 skipped, 0 failed. All prior tests green." }
+  ]
+}
diff --git a/requirements.md b/requirements.md
new file mode 100644
index 00000000..bfd9421e
--- /dev/null
+++ b/requirements.md
@@ -0,0 +1,98 @@
+# Requirements -- apra-fleet OS Service Lifecycle
+
+## Source
+Follow-up to apra-fleet#258 / PR #273 (HTTP+SSE transport). Closes the live-test gap
+filed as Beads apra-fleet-projects-jxj: the HTTP-transport install configures MCP clients
+but nothing starts or registers the singleton server, so every install fails on first
+connect (-32000) and again after every reboot.
+
+## Base Branch
+`feat/mcp-sse-transport` -- this work EXTENDS PR #273 (user decision 2026-05-19). Commit
+directly onto that branch; no new branch. PR #273 stays open until this lands too, so
+#273 ships a complete, self-installing HTTP transport.
+
+## Goal
+Make `apra-fleet` behave like a normal OS service: a small set of regular verbs to
+install, start, stop, restart, check, and uninstall the singleton HTTP+SSE MCP server,
+working uniformly on Windows, Linux, and macOS without requiring admin/root.
+
+## Key Decisions (user, 2026-05-19)
+1. **Land on PR #273** -- extend the existing branch, not a separate PR.
+2. **Top-level verbs** -- `apra-fleet start | stop | restart | status` are top-level
+   commands. Service registration/removal folds into the EXISTING `apra-fleet install`
+   and `apra-fleet uninstall` (no separate `service` subcommand group).
+3. **Per-user scope, no elevation** -- the service is registered and runs as the current
+   user. No admin/root, no UAC prompt. One service per logged-in user.
+
+## Scope
+
+### Verbs
+- `apra-fleet start` -- start the singleton HTTP server if not already running. Idempotent
+  (if already running, report that and exit 0). Goes through the OS service manager when a
+  service unit is installed; otherwise starts the process directly.
+- `apra-fleet stop` -- stop the running server gracefully (SIGTERM/equivalent -> server
+  cleans up server.json, lock file, sockets). Idempotent.
+- `apra-fleet restart` -- stop then start.
+- `apra-fleet status` -- report running/stopped, pid, port, url, version, uptime, active
+  session count (query GET /health), and whether the service unit is installed. Must work
+  whether or not the service unit is installed.
+- `apra-fleet install` -- EXTENDED: after installing the binary and writing MCP client
+  config (existing behaviour), also register the per-user service unit and start it, so
+  the server is live immediately and on every login. ASCII-only output.
+- `apra-fleet uninstall` -- EXTENDED: stop the server, remove the service unit, and remove
+  the MCP client config. Fully reverses `install`, leaving no orphaned unit or config.
+
+### Per-OS service mechanism (per-user, no elevation)
+- **Windows:** per-user Scheduled Task (schtasks) with an at-logon trigger, startable and
+  stoppable on demand. No Windows Service / SCM (that needs admin). Built-in tooling only.
+- **Linux:** systemd user unit at `~/.config/systemd/user/apra-fleet.service`, managed via
+  `systemctl --user`. Plan must address start-on-boot-without-login (loginctl
+  enable-linger) and a graceful fallback/clear error if systemd user mode is unavailable.
+- **macOS:** launchd LaunchAgent plist at `~/Library/LaunchAgents/<label>.plist`, managed
+  via `launchctl` (bootstrap/bootout/kickstart), RunAtLoad for start-on-login.
+
+The plan MUST explicitly walk through all three OSes for every verb -- no
+"and similarly for X". Each verb x each OS is a defined behaviour.
+
+## Cross-cutting requirements
+- The service invokes the INSTALLED binary at its stable path (e.g.
+  `~/.apra-fleet/bin/apra-fleet.exe`), never a transient build path.
+- The service runs the server in HTTP transport mode (the singleton on port 7523 /
+  fallback). Interplays correctly with the existing server.json, the singleton startup
+  lock, and `--transport`.
+- Service stdout/stderr is redirected to a log file under the fleet data dir.
+- `start`/`stop`/`install`/`uninstall` are all idempotent -- safe to run twice.
+- Graceful stop: the server's existing SIGINT/SIGTERM handlers must fire so server.json
+  and the lock file are cleaned up.
+- `status` is useful even with no service unit installed (read server.json + GET /health).
+- All command output is ASCII-only (pre-commit hook); cross-platform with no platform or
+  provider assumptions in shared code.
+
+## Out of Scope
+- System-wide service (Windows SCM service, systemd system unit, launchd LaunchDaemon) --
+  excluded by the per-user decision; would require elevation.
+- TLS / auth on the HTTP endpoint; remote (non-localhost) serving -- separate follow-up.
+- Auto-update / self-update of the running service.
+
+## Constraints
+- Per-user only -- no command in this sprint may require admin/root or trigger UAC.
+- Cross-platform: Windows / Linux / macOS. Built-in OS tooling preferred (schtasks,
+  systemctl --user, launchctl) over new npm dependencies.
+- ASCII-only in all committed files (pre-commit hook).
+- No regression to the existing `install` MCP-config behaviour or the stdio transport.
+- Extends PR #273 -- all existing #258 acceptance criteria must still hold.
+
+## Acceptance Criteria
+- [ ] `apra-fleet install` (default) registers a per-user service and the server is
+      running immediately afterwards -- a fresh MCP client connects with no manual step.
+- [ ] The server comes back automatically after a reboot / re-login, on all three OSes.
+- [ ] `apra-fleet start | stop | restart | status` work as described, idempotently, on
+      Windows, Linux, and macOS.
+- [ ] `apra-fleet status` reports pid/port/url/version/uptime/sessions and service-unit
+      state, and works whether or not the unit is installed.
+- [ ] `apra-fleet uninstall` stops the server and removes the service unit and MCP config
+      with nothing orphaned.
+- [ ] No elevation/admin/root or UAC prompt is required by any verb.
+- [ ] Tests cover the verb logic and the per-OS service-manager adapter; full existing
+      suite stays green; pre-commit ASCII hook passes.
+- [ ] Docs (README + docs/architecture.md) updated for the service model and verbs.
diff --git a/src/cli/install.ts b/src/cli/install.ts
index bf2c1ff7..c40b6b69 100644
--- a/src/cli/install.ts
+++ b/src/cli/install.ts
@@ -4,6 +4,8 @@ import os from 'node:os';
 import { execSync, execFileSync } from 'node:child_process';
 import { serverVersion } from '../version.js';
 import type { LlmProvider } from '../types.js';
+import { DEFAULT_PORT, LOG_FILE_PATH } from '../paths.js';
+import { getServiceManager } from '../services/service-manager/index.js';
 import {
   BIN_DIR,
   HOOKS_DIR,
@@ -298,10 +300,14 @@ function mergeCopilotConfig(paths: ProviderInstallConfig, mcpConfig: any): void
 function mergeCodexConfig(paths: ProviderInstallConfig, mcpConfig: any): void {
   const settings = readConfig(paths);
   settings.mcp_servers = settings.mcp_servers || {};
-  settings.mcp_servers['apra-fleet'] = {
-    command: mcpConfig.command.replace(/\\/g, '/'),
-    args: mcpConfig.args.map((a: string) => a.replace(/\\/g, '/')),
-  };
+  if (mcpConfig.url) {
+    settings.mcp_servers['apra-fleet'] = { url: mcpConfig.url };
+  } else {
+    settings.mcp_servers['apra-fleet'] = {
+      command: mcpConfig.command.replace(/\\/g, '/'),
+      args: mcpConfig.args.map((a: string) => a.replace(/\\/g, '/')),
+    };
+  }
 
   writeConfig(paths, settings);
 }
@@ -379,6 +385,8 @@ Usage:
   apra-fleet install --no-skill        Same as --skill none
   apra-fleet install --force           Stop a running server before installing
   apra-fleet install --llm <provider>  Target LLM provider: claude (default), gemini, codex, copilot, agy
+  apra-fleet install --transport http  Register MCP server with HTTP transport (default)
+  apra-fleet install --transport stdio Register MCP server with stdio transport (legacy)
   apra-fleet install --help            Show this help
 
 Options:
@@ -386,6 +394,8 @@ Options:
                           Defaults to claude. Note: --llm gemini shows a warning about sequential
                           dispatch — Gemini does not support background agents, so fleet operations
                           run sequentially rather than in parallel.
+  --transport <mode>      MCP transport to use: http (default) or stdio. HTTP uses the singleton
+                          fleet server at http://localhost:7523/mcp. stdio runs fleet as a subprocess.
   --skill <mode>          Which skills to install: all (default), fleet, pm, or none.
   --no-skill              Alias for --skill none.
   --force                 Stop a running apra-fleet server before installing (SEA mode only).`);
@@ -446,9 +456,34 @@ Options:
   // Parse --force flag
   const force = args.includes('--force');
 
+  // Parse --transport flag (default: http)
+  type TransportMode = 'http' | 'stdio';
+  let transport: TransportMode = 'http';
+  const transportEqualArg = args.find(a => a.startsWith('--transport='));
+  if (transportEqualArg) {
+    const val = transportEqualArg.split('=')[1];
+    if (val === 'http' || val === 'stdio') {
+      transport = val;
+    } else {
+      console.error(`Error: --transport value must be one of: http, stdio (got "${val}")`);
+      process.exit(1);
+    }
+  } else {
+    const transportIdx = args.indexOf('--transport');
+    if (transportIdx >= 0 && transportIdx < args.length - 1) {
+      const val = args[transportIdx + 1];
+      if (val === 'http' || val === 'stdio') {
+        transport = val;
+      } else {
+        console.error(`Error: --transport value must be one of: http, stdio (got "${val}")`);
+        process.exit(1);
+      }
+    }
+  }
+
   // Reject unknown flags to catch typos early
-  const knownFlagPrefixes = ['--llm=', '--skill='];
-  const knownFlagExact = new Set(['--llm', '--skill', '--no-skill', '--force', '--help', '-h']);
+  const knownFlagPrefixes = ['--llm=', '--skill=', '--transport='];
+  const knownFlagExact = new Set(['--llm', '--skill', '--no-skill', '--force', '--transport', '--help', '-h']);
   for (const a of args) {
     if (knownFlagExact.has(a)) continue;
     if (knownFlagPrefixes.some(p => a.startsWith(p))) continue;
@@ -459,7 +494,9 @@ Options:
 
   const installFleet = skillMode === 'fleet' || skillMode === 'pm' || skillMode === 'all';
   const installPm = skillMode === 'pm' || skillMode === 'all';
-  const totalSteps = (installFleet && installPm) ? 8 : installFleet ? 7 : installPm ? 8 : 6;
+  const serviceStep = isSea() && transport === 'http';
+  const baseSteps = (installFleet && installPm) ? 8 : installFleet ? 7 : installPm ? 8 : 6;
+  const totalSteps = baseSteps + (serviceStep ? 1 : 0);
 
   if (llm === 'gemini' && (installFleet || installPm)) {
     console.warn(`\n⚠ Note: Gemini does not support background agents. If you plan to use Gemini as the\n  PM/orchestrator, fleet operations will run sequentially (no parallel dispatch).\n  For best orchestration performance, consider using Claude. See docs for details.\n`);
@@ -545,27 +582,47 @@ ${killHint}
   // --- Step 5: Register MCP server ---
   console.log(`  [5/${totalSteps}] Registering MCP server...`);
 
-  const mcpConfig = isSea() 
-    ? { command: binaryPath, args: [] }
-    : { command: 'node', args: [path.join(findProjectRoot(), 'dist', 'index.js')] };
+  const fleetPort = DEFAULT_PORT;
+  const fleetUrl = `http://localhost:${fleetPort}/mcp`;
 
-  if (llm === 'claude') {
-    try {
-      run('claude mcp remove apra-fleet --scope user', { stdio: 'ignore' });
-    } catch { /* not registered */ }
-    
-    const cmd = mcpConfig.command === 'node' 
-      ? `claude mcp add --scope user apra-fleet -- node "${mcpConfig.args[0]}"`
-      : `claude mcp add --scope user apra-fleet -- "${mcpConfig.command}"`;
-    run(cmd);
-  } else if (llm === 'gemini') {
-    mergeGeminiConfig(paths, mcpConfig);
-  } else if (llm === 'codex') {
-    mergeCodexConfig(paths, mcpConfig);
-  } else if (llm === 'copilot') {
-    mergeCopilotConfig(paths, mcpConfig);
-  } else if (llm === 'agy') {
-    mergeAgyConfig(paths, mcpConfig);
+  if (transport === 'http') {
+    if (llm === 'claude') {
+      try {
+        run('claude mcp remove apra-fleet --scope user', { stdio: 'ignore' });
+      } catch { /* not registered */ }
+      run(`claude mcp add --scope user --transport http apra-fleet ${fleetUrl}`);
+    } else if (llm === 'gemini') {
+      mergeGeminiConfig(paths, { httpUrl: fleetUrl });
+    } else if (llm === 'codex') {
+      mergeCodexConfig(paths, { url: fleetUrl });
+    } else if (llm === 'copilot') {
+      mergeCopilotConfig(paths, { url: fleetUrl, type: 'http' });
+    } else if (llm === 'agy') {
+      mergeAgyConfig(paths, { url: fleetUrl });
+    }
+  } else {
+    const mcpConfig = isSea()
+      ? { command: binaryPath, args: [] }
+      : { command: 'node', args: [path.join(findProjectRoot(), 'dist', 'index.js')] };
+
+    if (llm === 'claude') {
+      try {
+        run('claude mcp remove apra-fleet --scope user', { stdio: 'ignore' });
+      } catch { /* not registered */ }
+
+      const cmd = mcpConfig.command === 'node'
+        ? `claude mcp add --scope user apra-fleet -- node "${mcpConfig.args[0]}"`
+        : `claude mcp add --scope user apra-fleet -- "${mcpConfig.command}"`;
+      run(cmd);
+    } else if (llm === 'gemini') {
+      mergeGeminiConfig(paths, mcpConfig);
+    } else if (llm === 'codex') {
+      mergeCodexConfig(paths, mcpConfig);
+    } else if (llm === 'copilot') {
+      mergeCopilotConfig(paths, mcpConfig);
+    } else if (llm === 'agy') {
+      mergeAgyConfig(paths, mcpConfig);
+    }
   }
 
   // --- Step 6: Install fleet skill (optional) ---
@@ -612,7 +669,7 @@ ${killHint}
   // --- Step 8: Install Beads task tracker ---
   // shell:true required on Windows — npm global packages install as .cmd wrappers
   // that cannot be directly spawned by Node without a shell
-  console.log(`  [${totalSteps}/${totalSteps}] Installing Beads task tracker...`);
+  console.log(`  [${baseSteps}/${totalSteps}] Installing Beads task tracker...`);
   try {
     // Check if already installed
     try {
@@ -633,6 +690,25 @@ ${killHint}
   // Write install-config.json (merge provider entry)
   writeInstallConfig(llm, skillMode);
 
+  // --- Step N: Register and start service (SEA + HTTP mode only) ---
+  let serviceRegistered = false;
+  if (serviceStep) {
+    console.log(`  [${totalSteps}/${totalSteps}] Registering and starting service...`);
+    const svcMgr = await getServiceManager();
+    try {
+      await svcMgr.register(binaryPath, ['--transport', 'http'], LOG_FILE_PATH);
+      try {
+        await svcMgr.start();
+        serviceRegistered = true;
+      } catch (startErr) {
+        try { await svcMgr.unregister(); } catch {}
+        throw startErr;
+      }
+    } catch (err) {
+      console.warn(`    Service registration skipped: ${(err as Error).message}`);
+    }
+  }
+
   // --- Done ---
   let beadsVersion = 'installed';
   try {
@@ -645,13 +721,14 @@ ${killHint}
   const clientName = llm === 'claude' ? 'Claude Code' : paths.name;
   const instructions = llm === 'claude' ? 'Run /mcp in Claude Code to load the server.' : `Restart ${paths.name} to load the server.`;
   const forceNote = force ? `\nRestart ${clientName} to reload the MCP server.` : '';
+  const serviceLine = serviceStep ? `\n  Service:     ${serviceRegistered ? 'registered and running' : 'registration skipped'}` : '';
   console.log(`
 Apra Fleet ${serverVersion} installed successfully for ${paths.name}.
   Binary:      ${BIN_DIR}
   Hooks:       ${HOOKS_DIR}
   Scripts:     ${SCRIPTS_DIR}
   Settings:    ${paths.settingsFile}${installFleet ? `\n  Fleet Skill: ${paths.fleetSkillsDir}` : ''}${installPm ? `\n  PM Skill:    ${paths.skillsDir}` : ''}
-  Beads:       ${beadsVersion}
+  Beads:       ${beadsVersion}${serviceLine}
 
 ${instructions}${forceNote}
 `);
diff --git a/src/cli/restart.ts b/src/cli/restart.ts
new file mode 100644
index 00000000..e8fb1556
--- /dev/null
+++ b/src/cli/restart.ts
@@ -0,0 +1,7 @@
+import { runStop } from './stop.js';
+import { runStart } from './start.js';
+
+export async function runRestart(args: string[]): Promise<void> {
+  await runStop(args);
+  await runStart(args);
+}
diff --git a/src/cli/start.ts b/src/cli/start.ts
new file mode 100644
index 00000000..0e192cfd
--- /dev/null
+++ b/src/cli/start.ts
@@ -0,0 +1,75 @@
+import fs from 'node:fs';
+import path from 'node:path';
+import { spawn } from 'node:child_process';
+import { fileURLToPath } from 'url';
+import { dirname } from 'path';
+import { checkRunningInstance } from '../services/singleton.js';
+import { getServiceManager } from '../services/service-manager/index.js';
+import { LOG_FILE_PATH, FLEET_DIR } from '../paths.js';
+import { BIN_DIR } from './config.js';
+
+const __filename = fileURLToPath(import.meta.url);
+const __dirname = dirname(__filename);
+
+function isSea(): boolean {
+  try {
+    const sea = require('node:sea');
+    return sea.isSea();
+  } catch {
+    return false;
+  }
+}
+
+function findProjectRoot(): string {
+  let dir = __dirname;
+  for (let i = 0; i < 5; i++) {
+    if (fs.existsSync(path.join(dir, 'version.json'))) return dir;
+    dir = path.dirname(dir);
+  }
+  throw new Error('Cannot find project root (version.json not found)');
+}
+
+export async function runStart(_args: string[]): Promise<void> {
+  const instance = await checkRunningInstance();
+  if (instance.running) {
+    console.log(`Server already running at ${instance.url} pid=${instance.pid}`);
+    return;
+  }
+
+  const svcMgr = await getServiceManager();
+  const installed = await svcMgr.isInstalled();
+
+  if (installed) {
+    await svcMgr.start();
+    console.log('Server starting via service manager...');
+  } else {
+    let cmd: string;
+    let spawnArgs: string[];
+    if (isSea()) {
+      const ext = process.platform === 'win32' ? '.exe' : '';
+      cmd = path.join(BIN_DIR, `apra-fleet${ext}`);
+      spawnArgs = ['--transport', 'http'];
+    } else {
+      cmd = process.execPath;
+      spawnArgs = [path.join(findProjectRoot(), 'dist', 'index.js'), '--transport', 'http'];
+    }
+    fs.mkdirSync(FLEET_DIR, { recursive: true });
+    const logFd = fs.openSync(LOG_FILE_PATH, 'a');
+    const child = spawn(cmd, spawnArgs, {
+      detached: true,
+      stdio: ['ignore', logFd, logFd],
+    });
+    child.unref();
+    fs.closeSync(logFd);
+    console.log('Server starting...');
+  }
+
+  await new Promise<void>(resolve => setTimeout(resolve, 2000));
+  const result = await checkRunningInstance();
+  if (result.running) {
+    console.log(`Server started at ${result.url} pid=${result.pid}`);
+  } else {
+    console.error(`Server did not start in time. Check logs at: ${LOG_FILE_PATH}`);
+    process.exit(1);
+  }
+}
diff --git a/src/cli/status.ts b/src/cli/status.ts
new file mode 100644
index 00000000..d91e02f9
--- /dev/null
+++ b/src/cli/status.ts
@@ -0,0 +1,86 @@
+import fs from 'node:fs';
+import http from 'node:http';
+import { checkRunningInstance } from '../services/singleton.js';
+import { getServiceManager } from '../services/service-manager/index.js';
+import type { ServiceStatus } from '../services/service-manager/types.js';
+import { SERVER_INFO_PATH } from '../paths.js';
+
+interface HealthResponse {
+  version?: string;
+  uptime?: number;
+  sessions?: number;
+}
+
+function getHealth(url: string): Promise<HealthResponse | null> {
+  const healthUrl = url.replace(/\/mcp$/, '/health');
+  const parsed = new URL(healthUrl);
+  return new Promise((resolve) => {
+    const req = http.get(
+      { hostname: parsed.hostname, port: Number(parsed.port), path: parsed.pathname, timeout: 3000 },
+      (res) => {
+        const chunks: Buffer[] = [];
+        res.on('data', (c: Buffer) => chunks.push(c));
+        res.on('end', () => {
+          try { resolve(JSON.parse(Buffer.concat(chunks).toString('utf8'))); }
+          catch { resolve(null); }
+        });
+      },
+    );
+    req.on('error', () => resolve(null));
+    req.on('timeout', () => { req.destroy(); resolve(null); });
+  });
+}
+
+function formatUptime(seconds: number): string {
+  const h = Math.floor(seconds / 3600);
+  const m = Math.floor((seconds % 3600) / 60);
+  const s = seconds % 60;
+  const parts: string[] = [];
+  if (h > 0) parts.push(`${h}h`);
+  if (m > 0) parts.push(`${m}m`);
+  parts.push(`${s}s`);
+  return parts.join(' ');
+}
+
+function readServerInfo(): { pid?: number; port?: number; url?: string } {
+  try {
+    return JSON.parse(fs.readFileSync(SERVER_INFO_PATH, 'utf8'));
+  } catch {
+    return {};
+  }
+}
+
+export async function runStatus(_args: string[]): Promise<void> {
+  const instance = await checkRunningInstance();
+  const svcMgr = await getServiceManager();
+  const svcStatus: ServiceStatus = await svcMgr.query().catch(() => ({ installed: false, running: false }));
+
+  let serviceLabel: string;
+  if (!svcStatus.installed) {
+    serviceLabel = 'not installed';
+  } else if (svcStatus.enabled) {
+    serviceLabel = 'installed (enabled)';
+  } else {
+    serviceLabel = 'installed (disabled)';
+  }
+
+  if (!instance.running) {
+    console.log('apra-fleet status');
+    console.log(`  State:    stopped`);
+    console.log(`  Service:  ${serviceLabel}`);
+    return;
+  }
+
+  const info = readServerInfo();
+  const health = await getHealth(instance.url);
+
+  console.log('apra-fleet status');
+  console.log(`  State:    running`);
+  if (info.pid) console.log(`  PID:      ${info.pid}`);
+  if (info.port) console.log(`  Port:     ${info.port}`);
+  console.log(`  URL:      ${instance.url}`);
+  if (health?.version) console.log(`  Version:  ${health.version}`);
+  if (health?.uptime !== undefined) console.log(`  Uptime:   ${formatUptime(health.uptime)}`);
+  if (health?.sessions !== undefined) console.log(`  Sessions: ${health.sessions}`);
+  console.log(`  Service:  ${serviceLabel}`);
+}
diff --git a/src/cli/stop.ts b/src/cli/stop.ts
new file mode 100644
index 00000000..1c272bea
--- /dev/null
+++ b/src/cli/stop.ts
@@ -0,0 +1,44 @@
+import fs from 'node:fs';
+import path from 'node:path';
+import { execFileSync } from 'node:child_process';
+import { checkRunningInstance } from '../services/singleton.js';
+import { SERVER_INFO_PATH, FLEET_DIR } from '../paths.js';
+import { getServiceManager } from '../services/service-manager/index.js';
+import { isPidAlive, postShutdown } from '../utils/process-utils.js';
+
+export async function runStop(_args: string[]): Promise<void> {
+  const svcMgr = await getServiceManager();
+  if (await svcMgr.isInstalled()) {
+    await svcMgr.stop();
+    console.log('Server stopped.');
+    return;
+  }
+
+  const instance = await checkRunningInstance();
+  if (!instance.running) {
+    console.log('Server is not running.');
+    return;
+  }
+
+  const { pid, url } = instance;
+  await postShutdown(url);
+
+  const deadline = Date.now() + 5000;
+  while (isPidAlive(pid) && Date.now() < deadline) {
+    await new Promise<void>(resolve => setTimeout(resolve, 500));
+  }
+
+  if (isPidAlive(pid)) {
+    if (process.platform === 'win32') {
+      try { execFileSync('taskkill', ['/F', '/PID', String(pid)]); } catch {}
+    } else {
+      try { process.kill(pid, 'SIGKILL'); } catch {}
+    }
+  }
+
+  const lockPath = path.join(FLEET_DIR, 'server.lock');
+  try { fs.unlinkSync(SERVER_INFO_PATH); } catch {}
+  try { fs.unlinkSync(lockPath); } catch {}
+
+  console.log('Server stopped.');
+}
diff --git a/src/cli/uninstall.ts b/src/cli/uninstall.ts
index fecf6bf6..5d90ac13 100644
--- a/src/cli/uninstall.ts
+++ b/src/cli/uninstall.ts
@@ -5,7 +5,8 @@ import { execSync } from 'node:child_process';
 import * as readlinePromises from 'node:readline/promises';
 import { serverVersion } from '../version.js';
 import type { LlmProvider } from '../types.js';
-import { isApraFleetRunning, killApraFleet } from './install.js';
+import { isApraFleetRunning } from './install.js';
+import { getServiceManager } from '../services/service-manager/index.js';
 import {
   BIN_DIR,
   HOOKS_DIR,
@@ -224,13 +225,16 @@ Options:
 
   console.log(`\nUninstalling Apra Fleet ${serverVersion}...${dryRun ? ' (DRY RUN)' : ''}\n`);
 
+  const svcMgr = await getServiceManager();
+
   if (isApraFleetRunning()) {
     if (dryRun && force) {
       console.log('  Note: apra-fleet server is currently running (would be stopped by --force).');
     } else if (force) {
-      killApraFleet();
-      await new Promise(resolve => setTimeout(resolve, 500));
-      console.log('  Stopped running server.');
+      if (!dryRun) {
+        try { await svcMgr.stop(); } catch {}
+        console.log('  Stopped running server.');
+      }
     } else {
       console.error('Error: apra-fleet server is currently running.\n\n  Run with --force to stop it automatically:\n    apra-fleet uninstall --force\n');
       process.exit(1);
@@ -238,6 +242,11 @@ Options:
     }
   }
 
+  // Remove service unit (idempotent -- tolerates "not installed")
+  if (!dryRun) {
+    try { await svcMgr.unregister(); } catch {}
+  }
+
   const installConfig = readInstallConfig();
   const recordedProviders = Object.keys(installConfig.providers) as LlmProvider[];
   const isFallback = recordedProviders.length === 0;
diff --git a/src/index.ts b/src/index.ts
index 2b2a3cb5..ea7fe76a 100644
--- a/src/index.ts
+++ b/src/index.ts
@@ -1,5 +1,6 @@
 #!/usr/bin/env node
 
+import fs from 'node:fs';
 import { serverVersion } from './version.js';
 import { logLine, logError } from './utils/log-helpers.js';
 
@@ -15,13 +16,20 @@ if (arg === '--help' || arg === '-h') {
   console.log(`apra-fleet ${serverVersion}
 
 Usage:
-  apra-fleet                  Start MCP server (stdio)
+  apra-fleet                         Start MCP server (HTTP, default)
+  apra-fleet --transport http        Start MCP server (HTTP)
+  apra-fleet --transport stdio       Start MCP server (stdio)
+  apra-fleet --stdio                 Start MCP server (stdio, alias for --transport stdio)
+  apra-fleet start                    Start the fleet server
+  apra-fleet stop                     Stop the fleet server
+  apra-fleet restart                  Restart the fleet server
+  apra-fleet status                   Show server and service status
   apra-fleet update           Check for and install latest update
   apra-fleet update --check   Check for update
   apra-fleet install                   Install binary + hooks + statusline + MCP + fleet & PM skills (default)
   apra-fleet install --skill all       Same as bare install (all skills)
   apra-fleet install --skill fleet     Install fleet skill only
-  apra-fleet install --skill pm        Install PM skill (also installs fleet — PM depends on fleet)
+  apra-fleet install --skill pm        Install PM skill (also installs fleet -- PM depends on fleet)
   apra-fleet install --skill none      Skip skill installation
   apra-fleet install --no-skill        Same as --skill none
   apra-fleet uninstall                 Remove binary, hooks, and MCP registration
@@ -84,54 +92,65 @@ Usage:
       .then(m => m.runUpdate())
       .catch(err => { logError('cli', `Update failed: ${err.message}`); process.exit(1); });
   }
-} else if (arg === undefined || arg === '--stdio') {
-  // Default: start MCP server
-  startServer();
+} else if (arg === 'start') {
+  import('./cli/start.js')
+    .then(m => m.runStart(process.argv.slice(3)))
+    .catch(err => { logError('cli', `Start failed: ${err.message}`); process.exit(1); });
+} else if (arg === 'stop') {
+  import('./cli/stop.js')
+    .then(m => m.runStop(process.argv.slice(3)))
+    .catch(err => { logError('cli', `Stop failed: ${err.message}`); process.exit(1); });
+} else if (arg === 'restart') {
+  import('./cli/restart.js')
+    .then(m => m.runRestart(process.argv.slice(3)))
+    .catch(err => { logError('cli', `Restart failed: ${err.message}`); process.exit(1); });
+} else if (arg === 'status') {
+  import('./cli/status.js')
+    .then(m => m.runStatus(process.argv.slice(3)))
+    .catch(err => { logError('cli', `Status failed: ${err.message}`); process.exit(1); });
+} else if (arg === undefined || arg === '--stdio' || arg === '--transport') {
+  // Server startup: parse transport flag
+  const transport = resolveTransport(process.argv.slice(2));
+  if (transport === 'invalid') {
+    const val = process.argv[3];
+    console.error(`Error: invalid --transport value '${val}'. Use 'http' or 'stdio'.`);
+    process.exit(1);
+  }
+  if (transport === 'stdio') {
+    startStdioServer();
+  } else {
+    startHttpServer();
+  }
 } else {
   console.error(`Error: unknown option '${arg}'`);
   console.error(`\nRun 'apra-fleet --help' for usage.`);
   process.exit(1);
 }
 
-async function startServer() {
+function resolveTransport(args: string[]): 'http' | 'stdio' | 'invalid' {
+  if (args.length === 0) return 'http';
+  if (args[0] === '--stdio') return 'stdio';
+  if (args[0] === '--transport') {
+    const val = args[1];
+    if (val === 'http') return 'http';
+    if (val === 'stdio') return 'stdio';
+    return 'invalid';
+  }
+  return 'invalid';
+}
+
+async function startStdioServer() {
   const { McpServer } = await import('@modelcontextprotocol/sdk/server/mcp.js');
   const { StdioServerTransport } = await import('@modelcontextprotocol/sdk/server/stdio.js');
 
   // Load onboarding state once at server startup (in-memory singleton)
-  const { loadOnboardingState, resetSessionFlags, getFirstRunPreamble, isJsonResponse, isActiveTool, getOnboardingNudge, getWelcomeBackPreamble } = await import('./services/onboarding.js');
+  const { loadOnboardingState, resetSessionFlags } = await import('./services/onboarding.js');
   const { VERBATIM_INSTRUCTIONS } = await import('./onboarding/text.js');
   const { getAllAgents: getAgentsForStartup } = await import('./services/registry.js');
-  // Pass current member count so upgrade detection works: existing registry + no onboarding.json → skip banner
+  // Pass current member count so upgrade detection works: existing registry + no onboarding.json -> skip banner
   loadOnboardingState(getAgentsForStartup().length);
   resetSessionFlags();
 
-  // Tool schemas and handlers
-  const { registerMemberSchema, registerMember } = await import('./tools/register-member.js');
-  const { listMembersSchema, listMembers } = await import('./tools/list-members.js');
-  const { removeMemberSchema, removeMember } = await import('./tools/remove-member.js');
-  const { updateMemberSchema, updateMember } = await import('./tools/update-member.js');
-  const { sendFilesSchema, sendFiles } = await import('./tools/send-files.js');
-  const { receiveFilesSchema, receiveFiles } = await import('./tools/receive-files.js');
-  const { executePromptSchema, executePrompt } = await import('./tools/execute-prompt.js');
-  const { executeCommandSchema, executeCommand } = await import('./tools/execute-command.js');
-  const { provisionAuthSchema, provisionAuth } = await import('./tools/provision-auth.js');
-  const { setupSSHKeySchema, setupSSHKey } = await import('./tools/setup-ssh-key.js');
-  const { setupGitAppSchema, setupGitApp } = await import('./tools/setup-git-app.js');
-  const { provisionVcsAuthSchema, provisionVcsAuth } = await import('./tools/provision-vcs-auth.js');
-  const { revokeVcsAuthSchema, revokeVcsAuth } = await import('./tools/revoke-vcs-auth.js');
-  const { fleetStatusSchema, fleetStatus } = await import('./tools/check-status.js');
-  const { memberDetailSchema, memberDetail } = await import('./tools/member-detail.js');
-  const { updateAgentCliSchema, updateAgentCli } = await import('./tools/update-agent-cli.js');
-  const { shutdownServerSchema, shutdownServer } = await import('./tools/shutdown-server.js');
-  const { composePermissionsSchema, composePermissions } = await import('./tools/compose-permissions.js');
-  const { cloudControlSchema, cloudControl } = await import('./tools/cloud-control.js');
-  const { monitorTaskSchema, monitorTask } = await import('./tools/monitor-task.js');
-  const { stopPromptSchema, stopPrompt } = await import('./tools/stop-prompt.js');
-  const { versionSchema, version } = await import('./tools/version.js');
-  const { credentialStoreSetSchema, credentialStoreSet } = await import('./tools/credential-store-set.js');
-  const { credentialStoreListSchema, credentialStoreList } = await import('./tools/credential-store-list.js');
-  const { credentialStoreDeleteSchema, credentialStoreDelete } = await import('./tools/credential-store-delete.js');
-  const { credentialStoreUpdateSchema, credentialStoreUpdate } = await import('./tools/credential-store-update.js');
   const { closeAllConnections } = await import('./services/ssh.js');
   const { idleManager } = await import('./services/cloud/idle-manager.js');
   const { cleanupStaleTasks } = await import('./services/task-cleanup.js');
@@ -139,7 +158,7 @@ async function startServer() {
   const { purgeExpiredCredentials } = await import('./services/credential-store.js');
   const { getStallDetector } = await import('./services/stall/index.js');
 
-  // serverVersion is "v0.0.1_abc123" — strip 'v' prefix for semver-like version field
+  // serverVersion is "v0.0.1_abc123" -- strip 'v' prefix for semver-like version field
   const versionNum = serverVersion.startsWith('v') ? serverVersion.slice(1) : serverVersion;
 
   let capturedClientInfo: any = null;
@@ -161,108 +180,9 @@ async function startServer() {
     };
   }
 
-  // --- Onboarding helpers ---
-  // isActiveTool guards passive tools (version, shutdown_server) from consuming the banner.
-  // First-run banner bypasses the JSON check — passive guard is sufficient protection.
-  // Welcome-back and nudges still respect the JSON check.
-
-  async function sendOnboardingNotification(srv: typeof server, text: string): Promise<void> {
-    try {
-      await srv.server.sendLoggingMessage({
-        level: 'info',
-        logger: 'apra-fleet-onboarding',
-        data: text,
-      });
-    } catch (e: unknown) {
-      const msg = (e instanceof Error ? e.message : String(e));
-      if (!/logging|method not found|not supported/i.test(msg)) {
-        process.stderr.write(`[apra-fleet] onboarding notification failed: ${msg}\n`);
-      }
-    }
-  }
-
-  function sanitizeToolResult(s: string): string {
-    return s.replace(/<\/?apra-fleet-display[^>]*(?:>|$)/gi, '[tag-stripped]');
-  }
-
-  function getOnboardingPreamble(toolName: string, isJson: boolean): string | null {
-    if (!isActiveTool(toolName)) return null;
-    // First-run banner always shows regardless of response format
-    const banner = getFirstRunPreamble();
-    if (banner) return banner;
-    // Welcome-back still respects JSON check
-    if (isJson) return null;
-    return getWelcomeBackPreamble();
-  }
-
-  function wrapTool(toolName: string, handler: (input: any, extra?: any) => Promise<string>) {
-    return async (input: any, extra?: any) => {
-      const result = await handler(input, extra);
-      const isJson = isJsonResponse(result);
-      const preamble = getOnboardingPreamble(toolName, isJson);
-      const suffix = isJson ? null : getOnboardingNudge(toolName, input, result);
-
-      // Channel 1: out-of-band notifications (best effort, never throws)
-      if (preamble) void sendOnboardingNotification(server, preamble);
-      if (suffix)   void sendOnboardingNotification(server, suffix);
-
-      // Channel 2 + 3: content blocks with markers + audience annotation
-      const content: Array<{ type: 'text'; text: string; annotations?: { audience?: ('user' | 'assistant')[]; priority?: number } }> = [];
-      if (preamble) {
-        content.push({ type: 'text' as const, text: `<apra-fleet-display>\n${preamble}\n</apra-fleet-display>`, annotations: { audience: ['user'], priority: 1 } });
-      }
-      content.push({ type: 'text' as const, text: sanitizeToolResult(result) });
-      if (suffix) {
-        content.push({ type: 'text' as const, text: `<apra-fleet-display>\n${suffix}\n</apra-fleet-display>`, annotations: { audience: ['user'], priority: 0.8 } });
-      }
-      return { content };
-    };
-  }
-
-  // --- Core Member Management ---
-  server.tool('register_member', 'Add a machine to the fleet. Use member_type "local" for this machine or "remote" for a machine reachable over SSH. Choose the AI provider the member will use for prompts.', registerMemberSchema.shape, wrapTool('register_member', (input) => registerMember(input as any)));
-  server.tool('list_members', 'List all fleet members and their current status. Use format="json" for structured data.', listMembersSchema.shape, wrapTool('list_members', (input) => listMembers(input as any)));
-  server.tool('remove_member', 'Remove a member from the fleet.', removeMemberSchema.shape, wrapTool('remove_member', (input) => removeMember(input as any)));
-  server.tool('update_member', "Change a member's name, connection details, working directory, AI provider, or other settings.", updateMemberSchema.shape, wrapTool('update_member', (input) => updateMember(input as any)));
-
-  // --- File Operations ---
-  server.tool('send_files', 'Transfer local files to a member. Always batch multiple files into a single call — never invoke repeatedly for individual files.', sendFilesSchema.shape, wrapTool('send_files', (input, extra) => sendFiles(input as any, extra)));
-  server.tool('receive_files', 'Download files from a member to a local directory. Always batch multiple files into a single call — never invoke repeatedly for individual files.', receiveFilesSchema.shape, wrapTool('receive_files', (input, extra) => receiveFiles(input as any, extra)));
-
-  // --- Prompt Execution ---
-  server.tool('execute_prompt', 'IMP: Never call this tool directly. Always wrap in a background subagent: Agent(run_in_background=true). Run an AI prompt on a member. Supports session resume for multi-turn conversations.', executePromptSchema.shape, wrapTool('execute_prompt', (input, extra) => executePrompt(input as any, extra)));
-  server.tool('execute_command', 'IMP: Never call this tool directly. Always wrap in a background subagent: Agent(run_in_background=true). Run a shell command on a member. Use for quick tasks like installing packages, checking versions, or running scripts.', executeCommandSchema.shape, wrapTool('execute_command', (input, extra) => executeCommand(input as any, extra)));
-
-  // --- Authentication & SSH ---
-  server.tool('provision_llm_auth', "Authenticate a fleet member so it can run prompts. Copies your current login session to the member, or deploys an API key if provided. Run this before execute_prompt if the member reports no authentication.", provisionAuthSchema.shape, wrapTool('provision_llm_auth', (input) => provisionAuth(input as any)));
-  server.tool('setup_ssh_key', 'Generate an SSH key pair and migrate a member from password to key-based authentication.', setupSSHKeySchema.shape, wrapTool('setup_ssh_key', (input) => setupSSHKey(input as any)));
-  server.tool('setup_git_app', "One-time setup: register a GitHub App for git token minting. Requires a GitHub App ID, private key (.pem) file path, and installation ID. The app must already be created at github.com/organizations/{org}/settings/apps.", setupGitAppSchema.shape, wrapTool('setup_git_app', (input) => setupGitApp(input as any)));
-  server.tool('provision_vcs_auth', 'Set up git access credentials on a member. Supports GitHub, Bitbucket, and Azure DevOps. Tests connectivity after setup.', provisionVcsAuthSchema.shape, wrapTool('provision_vcs_auth', (input) => provisionVcsAuth(input as any)));
-  server.tool('revoke_vcs_auth', 'Remove VCS credentials from a member. Specify the provider (github, bitbucket, or azure-devops) to revoke.', revokeVcsAuthSchema.shape, wrapTool('revoke_vcs_auth', (input) => revokeVcsAuth(input as any)));
-
-  // --- Status & Monitoring ---
-  server.tool('fleet_status', 'Get status of all fleet members. Use json format for structured data.', fleetStatusSchema.shape, wrapTool('fleet_status', (input) => fleetStatus(input as any)));
-  server.tool('member_detail', 'Get detailed status for one member: connectivity, AI version, authentication, active session, resources, and git branch.', memberDetailSchema.shape, wrapTool('member_detail', (input) => memberDetail(input as any)));
-
-  // --- Maintenance ---
-  server.tool('update_llm_cli', "Update or install the AI provider CLI on members. Omit member to update all online members at once. Use install_if_missing to install on members that don't have it yet.", updateAgentCliSchema.shape, wrapTool('update_llm_cli', (input) => updateAgentCli(input as any)));
-  server.tool('shutdown_server', 'Gracefully shut down the MCP server. Run /mcp afterwards to start a fresh instance with the latest code.', shutdownServerSchema.shape, wrapTool('shutdown_server', () => shutdownServer()));
-  server.tool('version', 'Returns the installed apra-fleet server version', versionSchema.shape, wrapTool('version', () => version()));
-
-  // --- Permissions ---
-  server.tool('compose_permissions', 'Set up and deliver the right permissions to a member for their role. Automatically tailors permissions to the project type. Use grant to add specific permissions mid-sprint without a full recompose.', composePermissionsSchema.shape, wrapTool('compose_permissions', (input) => composePermissions(input as any)));
-
-  // --- Cloud Control ---
-  server.tool('cloud_control', 'Manually start, stop, or check status of a cloud fleet member. Start waits until the member is ready; stop is immediate.', cloudControlSchema.shape, wrapTool('cloud_control', (input) => cloudControl(input as any)));
-  server.tool('monitor_task', 'Check status of a long-running background task on a cloud member. Optionally stop the cloud instance automatically when the task completes.', monitorTaskSchema.shape, wrapTool('monitor_task', (input) => monitorTask(input as any)));
-
-  // --- Agent Lifecycle ---
-  server.tool('stop_prompt', 'Kill the active LLM process on a member. Always call TaskStop on the dispatching background agent after calling this.', stopPromptSchema.shape, wrapTool('stop_prompt', (input) => stopPrompt(input as any)));
-  // --- Credential Store ---
-  server.tool('credential_store_set', 'Collect a secret from the user out-of-band and store it. Returns a handle (sec://NAME) and scope. Use {{secure.NAME}} tokens in execute_command to inject the value.', credentialStoreSetSchema.shape, wrapTool('credential_store_set', (input) => credentialStoreSet(input as any)));
-  server.tool('credential_store_list', 'List all stored credentials (names and metadata only — no values).', credentialStoreListSchema.shape, wrapTool('credential_store_list', () => credentialStoreList()));
-  server.tool('credential_store_delete', 'Delete a named credential from the store (both session and persistent tiers).', credentialStoreDeleteSchema.shape, wrapTool('credential_store_delete', (input) => credentialStoreDelete(input as any)));
-  server.tool('credential_store_update', 'Update metadata (members, TTL, network policy) on an existing credential without re-entering the secret.', credentialStoreUpdateSchema.shape, wrapTool('credential_store_update', (input) => credentialStoreUpdate(input as any)));
+  // Register all tools
+  const { registerAllTools } = await import('./services/tool-registry.js');
+  await registerAllTools(server);
 
   // --- Start Server ---
   const transport = new StdioServerTransport();
@@ -275,7 +195,7 @@ async function startServer() {
   const clientStr = capturedClientInfo?.name ? ` client=${capturedClientInfo.name}` : '';
   const versionStr = capturedClientInfo?.version ? ` version=${capturedClientInfo.version}` : '';
   const pidStr = ` pid=${process.pid} ppid=${process.ppid}`;
-  logLine('startup', `apra-fleet ${serverVersion} started${clientStr}${versionStr}${pidStr} FLEET_DIR=${FLEET_DIR}`);
+  logLine('startup', `apra-fleet ${serverVersion} started transport=stdio${clientStr}${versionStr}${pidStr} FLEET_DIR=${FLEET_DIR}`);
 
   idleManager.start();
   void cleanupStaleTasks();
@@ -286,3 +206,82 @@ async function startServer() {
   process.on('SIGINT', () => { cleanupAuthSocket().then(() => { closeAllConnections(); stallDetector.stop(); process.exit(0); }); });
   process.on('SIGTERM', () => { cleanupAuthSocket().then(() => { closeAllConnections(); stallDetector.stop(); process.exit(0); }); });
 }
+
+async function startHttpServer() {
+  const { loadOnboardingState, resetSessionFlags } = await import('./services/onboarding.js');
+  const { getAllAgents: getAgentsForStartup } = await import('./services/registry.js');
+  // Pass current member count so upgrade detection works: existing registry + no onboarding.json -> skip banner
+  loadOnboardingState(getAgentsForStartup().length);
+  resetSessionFlags();
+
+  const { checkRunningInstance, claimStartupLock } = await import('./services/singleton.js');
+  const { createHttpTransport } = await import('./services/http-transport.js');
+  const { registerAllTools } = await import('./services/tool-registry.js');
+  const { FLEET_DIR, SERVER_INFO_PATH } = await import('./paths.js');
+  const { closeAllConnections } = await import('./services/ssh.js');
+  const { idleManager } = await import('./services/cloud/idle-manager.js');
+  const { cleanupStaleTasks } = await import('./services/task-cleanup.js');
+  const { checkForUpdate } = await import('./services/update-check.js');
+  const { purgeExpiredCredentials } = await import('./services/credential-store.js');
+  const { getStallDetector } = await import('./services/stall/index.js');
+  const { cleanupAuthSocket } = await import('./services/auth-socket.js');
+  const { setHttpHandle } = await import('./tools/shutdown-server.js');
+
+  // Detect already-running instance before starting
+  const instance = await checkRunningInstance();
+  if (instance.running) {
+    logLine('startup', `apra-fleet already running at ${instance.url} pid=${instance.pid} -- exiting`);
+    process.exit(0);
+  }
+
+  // Atomic startup lock to prevent concurrent double-start race
+  const lock = claimStartupLock();
+  if (!lock.acquired) {
+    logLine('startup', 'Another fleet instance is starting up -- exiting');
+    process.exit(0);
+  }
+
+  const handle = await createHttpTransport({ registerTools: registerAllTools });
+
+  // Write server.json so other processes can detect this instance
+  fs.mkdirSync(FLEET_DIR, { recursive: true });
+  fs.writeFileSync(
+    SERVER_INFO_PATH,
+    JSON.stringify({
+      pid: process.pid,
+      port: handle.port,
+      url: handle.url,
+      version: serverVersion,
+      startedAt: new Date().toISOString(),
+    }),
+  );
+
+  // Release startup lock now that server.json is written (server.json is the long-lived detection mechanism)
+  lock.release();
+
+  // Make HTTP handle available to shutdown_server tool
+  setHttpHandle(handle);
+
+  const stallDetector = getStallDetector();
+  stallDetector.start();
+
+  logLine('startup', `apra-fleet ${serverVersion} started transport=http port=${handle.port} pid=${process.pid} FLEET_DIR=${FLEET_DIR}`);
+
+  idleManager.start();
+  void cleanupStaleTasks();
+  purgeExpiredCredentials();
+  void checkForUpdate();
+
+  async function shutdown() {
+    try { lock.release(); } catch {}
+    try { fs.unlinkSync(SERVER_INFO_PATH); } catch {}
+    try { await handle.close(); } catch {}
+    try { await cleanupAuthSocket(); } catch {}
+    try { closeAllConnections(); } catch {}
+    try { stallDetector.stop(); } catch {}
+    process.exit(0);
+  }
+
+  process.on('SIGINT', () => void shutdown());
+  process.on('SIGTERM', () => void shutdown());
+}
diff --git a/src/paths.ts b/src/paths.ts
index 040363f0..dd8cba47 100644
--- a/src/paths.ts
+++ b/src/paths.ts
@@ -2,3 +2,9 @@ import path from 'node:path';
 import os from 'node:os';
 
 export const FLEET_DIR = process.env.APRA_FLEET_DATA_DIR ?? path.join(os.homedir(), '.apra-fleet', 'data');
+
+export const DEFAULT_PORT = parseInt(process.env.APRA_FLEET_PORT ?? '', 10) || 7523;
+
+export const SERVER_INFO_PATH = path.join(FLEET_DIR, 'server.json');
+
+export const LOG_FILE_PATH = path.join(FLEET_DIR, 'fleet.log');
diff --git a/src/services/auth-socket.ts b/src/services/auth-socket.ts
index c86124b5..9df1f54f 100644
--- a/src/services/auth-socket.ts
+++ b/src/services/auth-socket.ts
@@ -8,6 +8,7 @@ import { FLEET_DIR } from '../paths.js';
 import { encryptPassword } from '../utils/crypto.js';
 import { logError } from '../utils/log-helpers.js';
 import { OOB_TIMEOUT_MS } from '../utils/oob-timeout.js';
+import { fleetEvents } from './event-bus.js';
 
 const SOCKET_PATH = path.join(FLEET_DIR, 'auth.sock');
 const PENDING_TTL_MS = 10 * 60 * 1000; // 10 minutes
@@ -120,6 +121,7 @@ export async function ensureAuthSocket(): Promise<void> {
               clearTimeout(waiter.timer);
               passwordWaiters.delete(msg.member_name);
               waiter.resolve(pending.encryptedPassword);
+              fleetEvents.emit('credential:stored', { name: msg.member_name });
             }
           } else {
             conn.write(JSON.stringify({ type: 'ack', ok: false, error: 'Invalid message' }) + '\n');
diff --git a/src/services/event-bus.ts b/src/services/event-bus.ts
new file mode 100644
index 00000000..f4d4793f
--- /dev/null
+++ b/src/services/event-bus.ts
@@ -0,0 +1,43 @@
+import { EventEmitter } from 'node:events';
+
+export interface FleetEventMap {
+  'credential:stored': { name: string };
+  'task:completed': { taskId: string; status: string };
+  'member:status-changed': { memberId: string; status: string };
+  'stall:detected': { memberId: string; memberName: string };
+}
+
+class TypedEventBus extends EventEmitter {
+  emit<K extends keyof FleetEventMap>(
+    event: K,
+    payload: FleetEventMap[K]
+  ): boolean {
+    return super.emit(event as string, payload);
+  }
+
+  on<K extends keyof FleetEventMap>(
+    event: K,
+    listener: (payload: FleetEventMap[K]) => void
+  ): this {
+    super.on(event as string, listener);
+    return this;
+  }
+
+  off<K extends keyof FleetEventMap>(
+    event: K,
+    listener: (payload: FleetEventMap[K]) => void
+  ): this {
+    super.off(event as string, listener);
+    return this;
+  }
+
+  once<K extends keyof FleetEventMap>(
+    event: K,
+    listener: (payload: FleetEventMap[K]) => void
+  ): this {
+    super.once(event as string, listener);
+    return this;
+  }
+}
+
+export const fleetEvents = new TypedEventBus();
diff --git a/src/services/http-transport.ts b/src/services/http-transport.ts
new file mode 100644
index 00000000..c6b28b70
--- /dev/null
+++ b/src/services/http-transport.ts
@@ -0,0 +1,250 @@
+import http from 'node:http';
+import crypto from 'node:crypto';
+import { McpServer } from '@modelcontextprotocol/sdk/server/mcp.js';
+import { StreamableHTTPServerTransport } from '@modelcontextprotocol/sdk/server/streamableHttp.js';
+import { fleetEvents, FleetEventMap } from './event-bus.js';
+import { DEFAULT_PORT } from '../paths.js';
+import { serverVersion } from '../version.js';
+import { logLine } from '../utils/log-helpers.js';
+
+interface Session {
+  server: McpServer;
+  transport: StreamableHTTPServerTransport;
+}
+
+export interface HttpTransportOptions {
+  registerTools: (server: McpServer) => void | Promise<void>;
+  preferredPort?: number;
+}
+
+export interface HttpTransportHandle {
+  httpServer: http.Server;
+  port: number;
+  url: string;
+  sessions: Map<string, Session>;
+  close(): Promise<void>;
+}
+
+function parseBody(req: http.IncomingMessage): Promise<unknown> {
+  return new Promise((resolve, reject) => {
+    const chunks: Buffer[] = [];
+    req.on('data', (chunk: Buffer) => chunks.push(chunk));
+    req.on('end', () => {
+      try {
+        const text = Buffer.concat(chunks).toString('utf8');
+        resolve(text ? JSON.parse(text) : undefined);
+      } catch (err) {
+        reject(err);
+      }
+    });
+    req.on('error', reject);
+  });
+}
+
+function listenOnPort(server: http.Server, port: number, host: string): Promise<number> {
+  return new Promise((resolve, reject) => {
+    server.listen(port, host, () => {
+      const addr = server.address() as { port: number };
+      resolve(addr.port);
+    });
+    server.once('error', reject);
+  });
+}
+
+function isInitializeRequest(body: unknown): boolean {
+  if (!body) return false;
+  if (Array.isArray(body)) {
+    return body.some((msg: unknown) => (msg as { method?: string }).method === 'initialize');
+  }
+  return (body as { method?: string }).method === 'initialize';
+}
+
+export async function createHttpTransport(options: HttpTransportOptions): Promise<HttpTransportHandle> {
+  const { registerTools, preferredPort } = options;
+  const sessions = new Map<string, Session>();
+  const startedAt = Date.now();
+
+  // LOW-1: Track event listener references for cleanup in close()
+  const eventCleanups: Array<() => void> = [];
+
+  // LOW-3: Shared handler for GET and DELETE -- both just look up session and delegate
+  async function handleSessionRequest(req: http.IncomingMessage, res: http.ServerResponse): Promise<void> {
+    const sessionId = req.headers['mcp-session-id'] as string | undefined;
+    if (!sessionId) {
+      res.writeHead(400);
+      res.end('Missing mcp-session-id header');
+      return;
+    }
+    const session = sessions.get(sessionId);
+    if (!session) {
+      res.writeHead(404);
+      res.end('Session not found');
+      return;
+    }
+    await session.transport.handleRequest(req, res);
+  }
+
+  const httpServer = http.createServer(async (req, res) => {
+    const url = req.url ?? '/';
+
+    if (url === '/health' && req.method === 'GET') {
+      const body = JSON.stringify({
+        status: 'ok',
+        version: serverVersion,
+        pid: process.pid,
+        uptime: Math.floor((Date.now() - startedAt) / 1000),
+        sessions: sessions.size,
+      });
+      res.writeHead(200, { 'Content-Type': 'application/json' });
+      res.end(body);
+      return;
+    }
+
+    if (url === '/shutdown' && req.method === 'POST') {
+      const body = JSON.stringify({ status: 'shutting-down' });
+      res.writeHead(200, { 'Content-Type': 'application/json' });
+      res.end(body);
+      setTimeout(() => {
+        process.emit('SIGINT');
+      }, 100);
+      return;
+    }
+
+    if (url !== '/mcp') {
+      res.writeHead(404);
+      res.end();
+      return;
+    }
+
+    if (req.method === 'POST') {
+      let parsedBody: unknown;
+      try {
+        parsedBody = await parseBody(req);
+      } catch {
+        res.writeHead(400);
+        res.end('Bad request body');
+        return;
+      }
+
+      if (isInitializeRequest(parsedBody)) {
+        const body = parsedBody as {
+          params?: {
+            clientInfo?: { name?: string; version?: string };
+            capabilities?: Record<string, unknown>;
+          };
+        };
+        const clientInfo = body?.params?.clientInfo ?? {};
+        const clientCaps = body?.params?.capabilities ?? {};
+        const capKeys = Object.keys(clientCaps).join(',');
+        const hasChannel = !!(clientCaps.experimental as any)?.['claude/channel'];
+
+        const sessionServer = new McpServer(
+          { name: `apra fleet server ${serverVersion}`, version: serverVersion },
+          { capabilities: { logging: {}, experimental: { 'claude/channel': {} } } }
+        );
+        const sessionTransport = new StreamableHTTPServerTransport({
+          sessionIdGenerator: () => crypto.randomUUID(),
+          onsessioninitialized: (sid) => {
+            sessions.set(sid, { server: sessionServer, transport: sessionTransport });
+            logLine('session', `new sid=${sid} client=${clientInfo.name ?? 'unknown'}/${clientInfo.version ?? 'unknown'} caps=${capKeys || 'none'} channel=${hasChannel}`);
+          },
+          onsessionclosed: (sid) => {
+            logLine('session', `closed sid=${sid}`);
+            // LOW-2: Close the McpServer when its session closes
+            const s = sessions.get(sid);
+            if (s) {
+              (s.server as any).server?.close().catch(() => {});
+            }
+            sessions.delete(sid);
+          },
+        });
+        await registerTools(sessionServer);
+        await sessionServer.connect(sessionTransport);
+        await sessionTransport.handleRequest(req, res, parsedBody);
+        return;
+      }
+
+      const sessionId = req.headers['mcp-session-id'] as string | undefined;
+      if (!sessionId) {
+        res.writeHead(400);
+        res.end('Missing mcp-session-id header');
+        return;
+      }
+      const session = sessions.get(sessionId);
+      if (!session) {
+        res.writeHead(404);
+        res.end('Session not found');
+        return;
+      }
+      await session.transport.handleRequest(req, res, parsedBody);
+      return;
+    }
+
+    // LOW-3: GET and DELETE share the same session-lookup-and-delegate logic
+    if (req.method === 'GET' || req.method === 'DELETE') {
+      await handleSessionRequest(req, res);
+      return;
+    }
+
+    res.writeHead(405);
+    res.end('Method not allowed');
+  });
+
+  // Subscribe to fleet events and broadcast to all connected sessions
+  const fleetEventTypes: (keyof FleetEventMap)[] = [
+    'credential:stored',
+    'task:completed',
+    'member:status-changed',
+    'stall:detected',
+  ];
+
+  for (const eventType of fleetEventTypes) {
+    const handler = (payload: FleetEventMap[typeof eventType]) => {
+      const data = { event: eventType, ...(payload as object) };
+      for (const [, session] of sessions) {
+        session.server.sendLoggingMessage({
+          level: 'info',
+          logger: 'apra-fleet-events',
+          data,
+        }).catch(() => {});
+      }
+    };
+    fleetEvents.on(eventType, handler);
+    // LOW-1: Store cleanup so close() can unsubscribe
+    eventCleanups.push(() => fleetEvents.off(eventType, handler));
+  }
+
+  // Start listening: try preferred port, fall back to OS-assigned port
+  const targetPort = preferredPort ?? DEFAULT_PORT;
+  let port: number;
+  try {
+    port = await listenOnPort(httpServer, targetPort, '127.0.0.1');
+  } catch (err: unknown) {
+    if ((err as NodeJS.ErrnoException).code === 'EADDRINUSE') {
+      port = await listenOnPort(httpServer, 0, '127.0.0.1');
+    } else {
+      throw err;
+    }
+  }
+
+  const url = `http://127.0.0.1:${port}/mcp`;
+
+  return {
+    httpServer,
+    port,
+    url,
+    sessions,
+    close(): Promise<void> {
+      // LOW-1: Unsubscribe all fleet event listeners
+      for (const cleanup of eventCleanups) cleanup();
+      // LOW-2: Close all active session McpServers before shutting down
+      for (const [, session] of sessions) {
+        (session.server as any).server?.close().catch(() => {});
+      }
+      sessions.clear();
+      return new Promise((resolve, reject) => {
+        httpServer.close((err) => (err ? reject(err) : resolve()));
+      });
+    },
+  };
+}
diff --git a/src/services/service-manager/index.ts b/src/services/service-manager/index.ts
new file mode 100644
index 00000000..d4a3f5da
--- /dev/null
+++ b/src/services/service-manager/index.ts
@@ -0,0 +1,65 @@
+import fs from 'node:fs';
+import { SERVER_INFO_PATH } from '../../paths.js';
+import type { ServiceManager, ServiceStatus } from './types.js';
+import { isPidAlive, postShutdown } from '../../utils/process-utils.js';
+
+export type { ServiceManager, ServiceStatus };
+
+export async function gracefulStopByServerJson(fallbackKill?: (pid: number) => void): Promise<void> {
+  let info: { pid?: number; url?: string };
+  try {
+    info = JSON.parse(fs.readFileSync(SERVER_INFO_PATH, 'utf8'));
+  } catch {
+    return;
+  }
+  const { pid, url } = info;
+  if (!pid || !url) return;
+  if (!isPidAlive(pid)) return;
+
+  await postShutdown(url);
+
+  const deadline = Date.now() + 5000;
+  while (isPidAlive(pid) && Date.now() < deadline) {
+    await new Promise(resolve => setTimeout(resolve, 500));
+  }
+
+  if (isPidAlive(pid)) {
+    if (fallbackKill) {
+      fallbackKill(pid);
+    } else {
+      try { process.kill(pid, 'SIGTERM'); } catch {}
+    }
+  }
+
+  try { fs.unlinkSync(SERVER_INFO_PATH); } catch {}
+}
+
+class NoopServiceManager implements ServiceManager {
+  async register(_binaryPath: string, _args: string[], _logPath: string): Promise<void> {}
+  async unregister(): Promise<void> {}
+  async start(): Promise<void> {}
+  async stop(): Promise<void> {}
+  async query(): Promise<ServiceStatus> { return { installed: false, running: false }; }
+  async isInstalled(): Promise<boolean> { return false; }
+}
+
+export async function getServiceManager(): Promise<ServiceManager> {
+  switch (process.platform) {
+    case 'win32': {
+      const { WindowsServiceManager } = await import('./windows.js');
+      return new WindowsServiceManager();
+    }
+    case 'linux': {
+      const { LinuxServiceManager } = await import('./linux.js');
+      return new LinuxServiceManager();
+    }
+    case 'darwin': {
+      const { MacOSServiceManager } = await import('./macos.js');
+      return new MacOSServiceManager();
+    }
+    default: {
+      console.warn(`apra-fleet: service management is not supported on platform '${process.platform}'. Using no-op stub.`);
+      return new NoopServiceManager();
+    }
+  }
+}
diff --git a/src/services/service-manager/linux.ts b/src/services/service-manager/linux.ts
new file mode 100644
index 00000000..c95625d5
--- /dev/null
+++ b/src/services/service-manager/linux.ts
@@ -0,0 +1,94 @@
+import { execFileSync } from 'node:child_process';
+import fs from 'node:fs';
+import path from 'node:path';
+import os from 'node:os';
+import type { ServiceManager, ServiceStatus } from './types.js';
+import { LINUX_UNIT_NAME } from './types.js';
+import { gracefulStopByServerJson } from './index.js';
+
+const UNIT_DIR = path.join(os.homedir(), '.config', 'systemd', 'user');
+const UNIT_PATH = path.join(UNIT_DIR, LINUX_UNIT_NAME);
+const SERVICE_NAME = LINUX_UNIT_NAME.replace(/\.service$/, '');
+
+function checkSystemd(): void {
+  const uid = typeof process.getuid === 'function' ? process.getuid() : 1000;
+  const xdgRuntime = process.env.XDG_RUNTIME_DIR ?? `/run/user/${uid}`;
+  if (!fs.existsSync(path.join(xdgRuntime, 'systemd'))) {
+    throw new Error('systemd user mode is not available. Service management requires systemd.');
+  }
+}
+
+export class LinuxServiceManager implements ServiceManager {
+  async register(binaryPath: string, args: string[], logPath: string): Promise<void> {
+    checkSystemd();
+    const unit = [
+      '[Unit]',
+      'Description=Apra Fleet MCP Server',
+      '',
+      '[Service]',
+      'Type=simple',
+      `ExecStart=${binaryPath} ${args.join(' ')}`,
+      'Restart=on-failure',
+      `StandardOutput=append:${logPath}`,
+      `StandardError=append:${logPath}`,
+      '',
+      '[Install]',
+      'WantedBy=default.target',
+      '',
+    ].join('\n');
+    fs.mkdirSync(UNIT_DIR, { recursive: true });
+    fs.writeFileSync(UNIT_PATH, unit, 'utf8');
+    execFileSync('systemctl', ['--user', 'daemon-reload']);
+    execFileSync('systemctl', ['--user', 'enable', SERVICE_NAME]);
+    try {
+      execFileSync('loginctl', ['enable-linger', os.userInfo().username]);
+    } catch (err) {
+      console.warn(`apra-fleet: loginctl enable-linger failed (non-fatal): ${err}`);
+    }
+  }
+
+  async unregister(): Promise<void> {
+    await gracefulStopByServerJson();
+    checkSystemd();
+    try { execFileSync('systemctl', ['--user', 'disable', SERVICE_NAME]); } catch {}
+    try { execFileSync('systemctl', ['--user', 'stop', SERVICE_NAME]); } catch {}
+    try { fs.unlinkSync(UNIT_PATH); } catch {}
+    try { execFileSync('systemctl', ['--user', 'daemon-reload']); } catch {}
+  }
+
+  async start(): Promise<void> {
+    checkSystemd();
+    execFileSync('systemctl', ['--user', 'start', SERVICE_NAME]);
+  }
+
+  async stop(): Promise<void> {
+    checkSystemd();
+    await gracefulStopByServerJson();
+  }
+
+  async query(): Promise<ServiceStatus> {
+    checkSystemd();
+    if (!fs.existsSync(UNIT_PATH)) {
+      return { installed: false, running: false };
+    }
+    let running = false;
+    let enabled: boolean | undefined;
+    try {
+      const active = execFileSync(
+        'systemctl', ['--user', 'is-active', SERVICE_NAME], { encoding: 'utf8' },
+      ).trim();
+      running = active === 'active';
+    } catch {}
+    try {
+      const enabledOut = execFileSync(
+        'systemctl', ['--user', 'is-enabled', SERVICE_NAME], { encoding: 'utf8' },
+      ).trim();
+      enabled = enabledOut === 'enabled';
+    } catch {}
+    return { installed: true, running, enabled };
+  }
+
+  async isInstalled(): Promise<boolean> {
+    return fs.existsSync(UNIT_PATH);
+  }
+}
diff --git a/src/services/service-manager/macos.ts b/src/services/service-manager/macos.ts
new file mode 100644
index 00000000..06d70513
--- /dev/null
+++ b/src/services/service-manager/macos.ts
@@ -0,0 +1,98 @@
+import { execFileSync } from 'node:child_process';
+import fs from 'node:fs';
+import path from 'node:path';
+import os from 'node:os';
+import type { ServiceManager, ServiceStatus } from './types.js';
+import { MACOS_PLIST_LABEL } from './types.js';
+import { gracefulStopByServerJson } from './index.js';
+
+const PLIST_DIR = path.join(os.homedir(), 'Library', 'LaunchAgents');
+const PLIST_PATH = path.join(PLIST_DIR, `${MACOS_PLIST_LABEL}.plist`);
+
+function getUid(): string {
+  return typeof process.getuid === 'function' ? String(process.getuid()) : '501';
+}
+
+function domain(): string {
+  return `gui/${getUid()}`;
+}
+
+function xmlEscape(s: string): string {
+  return s.replace(/&/g, '&amp;').replace(/</g, '&lt;').replace(/>/g, '&gt;');
+}
+
+function buildPlist(binaryPath: string, args: string[], logPath: string): string {
+  const argElements = [binaryPath, ...args]
+    .map(a => `        <string>${xmlEscape(a)}</string>`)
+    .join('\n');
+  return [
+    '<?xml version="1.0" encoding="UTF-8"?>',
+    '<!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN" "http://www.apple.com/DTDs/PropertyList-1.0.dtd">',
+    '<plist version="1.0">',
+    '<dict>',
+    '    <key>Label</key>',
+    `    <string>${MACOS_PLIST_LABEL}</string>`,
+    '    <key>ProgramArguments</key>',
+    '    <array>',
+    argElements,
+    '    </array>',
+    '    <key>RunAtLoad</key>',
+    '    <true/>',
+    '    <key>KeepAlive</key>',
+    '    <dict>',
+    '        <key>SuccessfulExit</key>',
+    '        <false/>',
+    '    </dict>',
+    `    <key>StandardOutPath</key>`,
+    `    <string>${xmlEscape(logPath)}</string>`,
+    `    <key>StandardErrorPath</key>`,
+    `    <string>${xmlEscape(logPath)}</string>`,
+    '</dict>',
+    '</plist>',
+    '',
+  ].join('\n');
+}
+
+export class MacOSServiceManager implements ServiceManager {
+  async register(binaryPath: string, args: string[], logPath: string): Promise<void> {
+    fs.mkdirSync(PLIST_DIR, { recursive: true });
+    fs.writeFileSync(PLIST_PATH, buildPlist(binaryPath, args, logPath), 'utf8');
+    // Bootout first to make register idempotent
+    try { execFileSync('launchctl', ['bootout', `${domain()}/${MACOS_PLIST_LABEL}`]); } catch {}
+    execFileSync('launchctl', ['bootstrap', domain(), PLIST_PATH]);
+  }
+
+  async unregister(): Promise<void> {
+    try { execFileSync('launchctl', ['bootout', `${domain()}/${MACOS_PLIST_LABEL}`]); } catch {}
+    try { fs.unlinkSync(PLIST_PATH); } catch {}
+  }
+
+  async start(): Promise<void> {
+    execFileSync('launchctl', ['kickstart', `${domain()}/${MACOS_PLIST_LABEL}`]);
+  }
+
+  async stop(): Promise<void> {
+    await gracefulStopByServerJson();
+  }
+
+  async query(): Promise<ServiceStatus> {
+    if (!fs.existsSync(PLIST_PATH)) {
+      return { installed: false, running: false };
+    }
+    try {
+      const out = execFileSync(
+        'launchctl', ['print', `${domain()}/${MACOS_PLIST_LABEL}`],
+        { encoding: 'utf8' },
+      );
+      const pidMatch = out.match(/\bpid\s*=\s*(\d+)/);
+      const pid = pidMatch ? parseInt(pidMatch[1], 10) : undefined;
+      return { installed: true, running: !!pid && pid > 0, pid };
+    } catch {
+      return { installed: true, running: false };
+    }
+  }
+
+  async isInstalled(): Promise<boolean> {
+    return fs.existsSync(PLIST_PATH);
+  }
+}
diff --git a/src/services/service-manager/types.ts b/src/services/service-manager/types.ts
new file mode 100644
index 00000000..5a7ee7c3
--- /dev/null
+++ b/src/services/service-manager/types.ts
@@ -0,0 +1,20 @@
+// Service name constants for each platform
+export const WINDOWS_TASK_NAME = 'ApraFleet';
+export const LINUX_UNIT_NAME = 'apra-fleet.service';
+export const MACOS_PLIST_LABEL = 'com.apra-fleet.server';
+
+export interface ServiceStatus {
+  installed: boolean;
+  running: boolean;
+  pid?: number;
+  enabled?: boolean;
+}
+
+export interface ServiceManager {
+  register(binaryPath: string, args: string[], logPath: string): Promise<void>;
+  unregister(): Promise<void>;
+  start(): Promise<void>;
+  stop(): Promise<void>;
+  query(): Promise<ServiceStatus>;
+  isInstalled(): Promise<boolean>;
+}
diff --git a/src/services/service-manager/windows.ts b/src/services/service-manager/windows.ts
new file mode 100644
index 00000000..d6889e26
--- /dev/null
+++ b/src/services/service-manager/windows.ts
@@ -0,0 +1,67 @@
+import { execFileSync } from 'node:child_process';
+import fs from 'node:fs';
+import path from 'node:path';
+import type { ServiceManager, ServiceStatus } from './types.js';
+import { WINDOWS_TASK_NAME } from './types.js';
+import { gracefulStopByServerJson } from './index.js';
+import { BIN_DIR } from '../../cli/config.js';
+
+const WRAPPER_PATH = path.join(BIN_DIR, 'apra-fleet-service.bat');
+
+export class WindowsServiceManager implements ServiceManager {
+  async register(binaryPath: string, args: string[], logPath: string): Promise<void> {
+    fs.mkdirSync(path.dirname(WRAPPER_PATH), { recursive: true });
+    const quotedArgs = args.map(a => `"${a}"`).join(' ');
+    const lines = ['@echo off', `"${binaryPath}" ${quotedArgs} >> "${logPath}" 2>&1`];
+    fs.writeFileSync(WRAPPER_PATH, lines.join('\r\n'), 'utf8');
+    execFileSync('schtasks', [
+      '/create', '/tn', WINDOWS_TASK_NAME,
+      '/tr', WRAPPER_PATH,
+      '/sc', 'onlogon', '/rl', 'limited', '/f',
+    ]);
+  }
+
+  async unregister(): Promise<void> {
+    try {
+      execFileSync('schtasks', ['/delete', '/tn', WINDOWS_TASK_NAME, '/f']);
+    } catch {
+      // Tolerate task-not-found
+    }
+    try { fs.unlinkSync(WRAPPER_PATH); } catch {}
+  }
+
+  async start(): Promise<void> {
+    execFileSync('schtasks', ['/run', '/tn', WINDOWS_TASK_NAME]);
+  }
+
+  async stop(): Promise<void> {
+    await gracefulStopByServerJson((pid) => {
+      try { execFileSync('taskkill', ['/F', '/PID', String(pid)]); } catch {}
+    });
+  }
+
+  async query(): Promise<ServiceStatus> {
+    try {
+      const out = execFileSync(
+        'schtasks', ['/query', '/tn', WINDOWS_TASK_NAME, '/fo', 'csv', '/nh'],
+        { encoding: 'utf8' },
+      );
+      // CSV line: "TaskName","Next Run Time","Status"
+      const line = out.trim().split(/\r?\n/)[0] ?? '';
+      const cols = line.split('","');
+      const status = (cols[2] ?? '').replace(/"/g, '').trim();
+      return { installed: true, running: status === 'Running' };
+    } catch {
+      return { installed: false, running: false };
+    }
+  }
+
+  async isInstalled(): Promise<boolean> {
+    try {
+      execFileSync('schtasks', ['/query', '/tn', WINDOWS_TASK_NAME]);
+      return true;
+    } catch {
+      return false;
+    }
+  }
+}
diff --git a/src/services/singleton.ts b/src/services/singleton.ts
new file mode 100644
index 00000000..73fce630
--- /dev/null
+++ b/src/services/singleton.ts
@@ -0,0 +1,108 @@
+import fs from 'node:fs';
+import http from 'node:http';
+import path from 'node:path';
+import os from 'node:os';
+import { isPidAlive } from '../utils/process-utils.js';
+
+// Paths are computed at call time (not module load) so tests can override APRA_FLEET_DATA_DIR
+function getFleetDir(): string {
+  return process.env.APRA_FLEET_DATA_DIR ?? path.join(os.homedir(), '.apra-fleet', 'data');
+}
+
+function getServerInfoPath(): string {
+  return path.join(getFleetDir(), 'server.json');
+}
+
+function getLockPath(): string {
+  return path.join(getFleetDir(), 'server.lock');
+}
+
+const STALE_LOCK_AGE_MS = 60_000;
+
+export interface RunningInstance {
+  running: true;
+  url: string;
+  pid: number;
+}
+
+export type InstanceCheckResult = RunningInstance | { running: false };
+
+export interface StartupLock {
+  acquired: boolean;
+  release: () => void;
+}
+
+function checkHealthEndpoint(url: string): Promise<boolean> {
+  const healthUrl = url.replace(/\/mcp$/, '/health');
+  return new Promise((resolve) => {
+    const req = http.get(healthUrl, { timeout: 2000 }, (res) => {
+      res.resume(); // drain response body
+      resolve(res.statusCode === 200);
+    });
+    req.on('error', () => resolve(false));
+    req.on('timeout', () => { req.destroy(); resolve(false); });
+  });
+}
+
+export async function checkRunningInstance(): Promise<InstanceCheckResult> {
+  const serverInfoPath = getServerInfoPath();
+  let info: { pid?: number; url?: string };
+  try {
+    const raw = fs.readFileSync(serverInfoPath, 'utf8');
+    info = JSON.parse(raw);
+  } catch {
+    return { running: false };
+  }
+
+  if (!info.pid || !info.url) return { running: false };
+
+  if (!isPidAlive(info.pid)) {
+    try { fs.unlinkSync(serverInfoPath); } catch {}
+    return { running: false };
+  }
+
+  const healthy = await checkHealthEndpoint(info.url);
+  if (!healthy) {
+    try { fs.unlinkSync(serverInfoPath); } catch {}
+    return { running: false };
+  }
+
+  return { running: true, url: info.url, pid: info.pid };
+}
+
+export function claimStartupLock(): StartupLock {
+  const fleetDir = getFleetDir();
+  const lockPath = getLockPath();
+
+  try { fs.mkdirSync(fleetDir, { recursive: true }); } catch {}
+
+  function tryAcquire(allowRetry: boolean): StartupLock {
+    try {
+      const fd = fs.openSync(lockPath, 'wx');
+      fs.writeSync(fd, String(process.pid));
+      fs.closeSync(fd);
+      return {
+        acquired: true,
+        release: () => { try { fs.unlinkSync(lockPath); } catch {} },
+      };
+    } catch (err: unknown) {
+      if ((err as NodeJS.ErrnoException).code !== 'EEXIST') throw err;
+
+      // Lock file exists -- check if it is stale (crashed process)
+      if (allowRetry) {
+        try {
+          const stat = fs.statSync(lockPath);
+          if (Date.now() - stat.mtimeMs > STALE_LOCK_AGE_MS) {
+            fs.unlinkSync(lockPath);
+            return tryAcquire(false);
+          }
+        } catch {
+          // stat failed -- lock may have been deleted between our check and now
+        }
+      }
+      return { acquired: false, release: () => {} };
+    }
+  }
+
+  return tryAcquire(true);
+}
diff --git a/src/services/task-cleanup.ts b/src/services/task-cleanup.ts
index 4a3ae2fc..2858b135 100644
--- a/src/services/task-cleanup.ts
+++ b/src/services/task-cleanup.ts
@@ -1,6 +1,7 @@
 import fs from 'node:fs';
 import path from 'node:path';
 import os from 'node:os';
+import { isPidAlive } from '../utils/process-utils.js';
 
 const FLEET_TASKS_DIR = path.join(os.homedir(), '.fleet-tasks');
 
@@ -12,15 +13,6 @@ function retentionHoursFailed(): number {
   return parseInt(process.env.FLEET_TASK_RETENTION_HOURS ?? '168', 10);
 }
 
-function isPidAlive(pid: number): boolean {
-  try {
-    process.kill(pid, 0);
-    return true;
-  } catch {
-    return false;
-  }
-}
-
 export async function cleanupStaleTasks(tasksDir = FLEET_TASKS_DIR): Promise<void> {
   if (!fs.existsSync(tasksDir)) return;
 
diff --git a/src/services/tool-registry.ts b/src/services/tool-registry.ts
new file mode 100644
index 00000000..832b9217
--- /dev/null
+++ b/src/services/tool-registry.ts
@@ -0,0 +1,130 @@
+import type { McpServer } from '@modelcontextprotocol/sdk/server/mcp.js';
+
+export async function registerAllTools(server: McpServer): Promise<void> {
+  // Load onboarding functions
+  const { getFirstRunPreamble, isJsonResponse, isActiveTool, getOnboardingNudge, getWelcomeBackPreamble } = await import('./onboarding.js');
+
+  // Tool schemas and handlers
+  const { registerMemberSchema, registerMember } = await import('../tools/register-member.js');
+  const { listMembersSchema, listMembers } = await import('../tools/list-members.js');
+  const { removeMemberSchema, removeMember } = await import('../tools/remove-member.js');
+  const { updateMemberSchema, updateMember } = await import('../tools/update-member.js');
+  const { sendFilesSchema, sendFiles } = await import('../tools/send-files.js');
+  const { receiveFilesSchema, receiveFiles } = await import('../tools/receive-files.js');
+  const { executePromptSchema, executePrompt } = await import('../tools/execute-prompt.js');
+  const { executeCommandSchema, executeCommand } = await import('../tools/execute-command.js');
+  const { provisionAuthSchema, provisionAuth } = await import('../tools/provision-auth.js');
+  const { setupSSHKeySchema, setupSSHKey } = await import('../tools/setup-ssh-key.js');
+  const { setupGitAppSchema, setupGitApp } = await import('../tools/setup-git-app.js');
+  const { provisionVcsAuthSchema, provisionVcsAuth } = await import('../tools/provision-vcs-auth.js');
+  const { revokeVcsAuthSchema, revokeVcsAuth } = await import('../tools/revoke-vcs-auth.js');
+  const { fleetStatusSchema, fleetStatus } = await import('../tools/check-status.js');
+  const { memberDetailSchema, memberDetail } = await import('../tools/member-detail.js');
+  const { updateAgentCliSchema, updateAgentCli } = await import('../tools/update-agent-cli.js');
+  const { shutdownServerSchema, shutdownServer } = await import('../tools/shutdown-server.js');
+  const { composePermissionsSchema, composePermissions } = await import('../tools/compose-permissions.js');
+  const { cloudControlSchema, cloudControl } = await import('../tools/cloud-control.js');
+  const { monitorTaskSchema, monitorTask } = await import('../tools/monitor-task.js');
+  const { stopPromptSchema, stopPrompt } = await import('../tools/stop-prompt.js');
+  const { versionSchema, version } = await import('../tools/version.js');
+  const { credentialStoreSetSchema, credentialStoreSet } = await import('../tools/credential-store-set.js');
+  const { credentialStoreListSchema, credentialStoreList } = await import('../tools/credential-store-list.js');
+  const { credentialStoreDeleteSchema, credentialStoreDelete } = await import('../tools/credential-store-delete.js');
+  const { credentialStoreUpdateSchema, credentialStoreUpdate } = await import('../tools/credential-store-update.js');
+
+  // Onboarding helpers
+  async function sendOnboardingNotification(srv: typeof server, text: string): Promise<void> {
+    try {
+      await srv.server.sendLoggingMessage({
+        level: 'info',
+        logger: 'apra-fleet-onboarding',
+        data: text,
+      });
+    } catch (e: unknown) {
+      const msg = (e instanceof Error ? e.message : String(e));
+      if (!/logging|method not found|not supported/i.test(msg)) {
+        process.stderr.write(`[apra-fleet] onboarding notification failed: ${msg}\n`);
+      }
+    }
+  }
+
+  function sanitizeToolResult(s: string): string {
+    return s.replace(/<\/?apra-fleet-display[^>]*(?:>|$)/gi, '[tag-stripped]');
+  }
+
+  function getOnboardingPreamble(toolName: string, isJson: boolean): string | null {
+    if (!isActiveTool(toolName)) return null;
+    const banner = getFirstRunPreamble();
+    if (banner) return banner;
+    if (isJson) return null;
+    return getWelcomeBackPreamble();
+  }
+
+  function wrapTool(toolName: string, handler: (input: any, extra?: any) => Promise<string>) {
+    return async (input: any, extra?: any) => {
+      const result = await handler(input, extra);
+      const isJson = isJsonResponse(result);
+      const preamble = getOnboardingPreamble(toolName, isJson);
+      const suffix = isJson ? null : getOnboardingNudge(toolName, input, result);
+
+      if (preamble) void sendOnboardingNotification(server, preamble);
+      if (suffix)   void sendOnboardingNotification(server, suffix);
+
+      const content: Array<{ type: 'text'; text: string; annotations?: { audience?: ('user' | 'assistant')[]; priority?: number } }> = [];
+      if (preamble) {
+        content.push({ type: 'text' as const, text: `<apra-fleet-display>\n${preamble}\n</apra-fleet-display>`, annotations: { audience: ['user'], priority: 1 } });
+      }
+      content.push({ type: 'text' as const, text: sanitizeToolResult(result) });
+      if (suffix) {
+        content.push({ type: 'text' as const, text: `<apra-fleet-display>\n${suffix}\n</apra-fleet-display>`, annotations: { audience: ['user'], priority: 0.8 } });
+      }
+      return { content };
+    };
+  }
+
+  // Core Member Management
+  server.tool('register_member', 'Add a machine to the fleet. Use member_type "local" for this machine or "remote" for a machine reachable over SSH. Choose the AI provider the member will use for prompts.', registerMemberSchema.shape, wrapTool('register_member', (input) => registerMember(input as any)));
+  server.tool('list_members', 'List all fleet members and their current status. Use format="json" for structured data.', listMembersSchema.shape, wrapTool('list_members', (input) => listMembers(input as any)));
+  server.tool('remove_member', 'Remove a member from the fleet.', removeMemberSchema.shape, wrapTool('remove_member', (input) => removeMember(input as any)));
+  server.tool('update_member', "Change a member's name, connection details, working directory, AI provider, or other settings.", updateMemberSchema.shape, wrapTool('update_member', (input) => updateMember(input as any)));
+
+  // File Operations
+  server.tool('send_files', 'Transfer local files to a member. Always batch multiple files into a single call — never invoke repeatedly for individual files.', sendFilesSchema.shape, wrapTool('send_files', (input, extra) => sendFiles(input as any, extra)));
+  server.tool('receive_files', 'Download files from a member to a local directory. Always batch multiple files into a single call — never invoke repeatedly for individual files.', receiveFilesSchema.shape, wrapTool('receive_files', (input, extra) => receiveFiles(input as any, extra)));
+
+  // Prompt Execution
+  server.tool('execute_prompt', 'IMP: Never call this tool directly. Always wrap in a background subagent: Agent(run_in_background=true). Run an AI prompt on a member. Supports session resume for multi-turn conversations.', executePromptSchema.shape, wrapTool('execute_prompt', (input, extra) => executePrompt(input as any, extra)));
+  server.tool('execute_command', 'IMP: Never call this tool directly. Always wrap in a background subagent: Agent(run_in_background=true). Run a shell command on a member. Use for quick tasks like installing packages, checking versions, or running scripts.', executeCommandSchema.shape, wrapTool('execute_command', (input, extra) => executeCommand(input as any, extra)));
+
+  // Authentication & SSH
+  server.tool('provision_llm_auth', "Authenticate a fleet member so it can run prompts. Copies your current login session to the member, or deploys an API key if provided. Run this before execute_prompt if the member reports no authentication.", provisionAuthSchema.shape, wrapTool('provision_llm_auth', (input) => provisionAuth(input as any)));
+  server.tool('setup_ssh_key', 'Generate an SSH key pair and migrate a member from password to key-based authentication.', setupSSHKeySchema.shape, wrapTool('setup_ssh_key', (input) => setupSSHKey(input as any)));
+  server.tool('setup_git_app', "One-time setup: register a GitHub App for git token minting. Requires a GitHub App ID, private key (.pem) file path, and installation ID. The app must already be created at github.com/organizations/{org}/settings/apps.", setupGitAppSchema.shape, wrapTool('setup_git_app', (input) => setupGitApp(input as any)));
+  server.tool('provision_vcs_auth', 'Set up git access credentials on a member. Supports GitHub, Bitbucket, and Azure DevOps. Tests connectivity after setup.', provisionVcsAuthSchema.shape, wrapTool('provision_vcs_auth', (input) => provisionVcsAuth(input as any)));
+  server.tool('revoke_vcs_auth', 'Remove VCS credentials from a member. Specify the provider (github, bitbucket, or azure-devops) to revoke.', revokeVcsAuthSchema.shape, wrapTool('revoke_vcs_auth', (input) => revokeVcsAuth(input as any)));
+
+  // Status & Monitoring
+  server.tool('fleet_status', 'Get status of all fleet members. Use json format for structured data.', fleetStatusSchema.shape, wrapTool('fleet_status', (input) => fleetStatus(input as any)));
+  server.tool('member_detail', 'Get detailed status for one member: connectivity, AI version, authentication, active session, resources, and git branch.', memberDetailSchema.shape, wrapTool('member_detail', (input) => memberDetail(input as any)));
+
+  // Maintenance
+  server.tool('update_llm_cli', "Update or install the AI provider CLI on members. Omit member to update all online members at once. Use install_if_missing to install on members that don't have it yet.", updateAgentCliSchema.shape, wrapTool('update_llm_cli', (input) => updateAgentCli(input as any)));
+  server.tool('shutdown_server', 'Gracefully shut down the MCP server. Run /mcp afterwards to start a fresh instance with the latest code.', shutdownServerSchema.shape, wrapTool('shutdown_server', () => shutdownServer()));
+  server.tool('version', 'Returns the installed apra-fleet server version', versionSchema.shape, wrapTool('version', () => version()));
+
+  // Permissions
+  server.tool('compose_permissions', 'Set up and deliver the right permissions to a member for their role. Automatically tailors permissions to the project type. Use grant to add specific permissions mid-sprint without a full recompose.', composePermissionsSchema.shape, wrapTool('compose_permissions', (input) => composePermissions(input as any)));
+
+  // Cloud Control
+  server.tool('cloud_control', 'Manually start, stop, or check status of a cloud fleet member. Start waits until the member is ready; stop is immediate.', cloudControlSchema.shape, wrapTool('cloud_control', (input) => cloudControl(input as any)));
+  server.tool('monitor_task', 'Check status of a long-running background task on a cloud member. Optionally stop the cloud instance automatically when the task completes.', monitorTaskSchema.shape, wrapTool('monitor_task', (input) => monitorTask(input as any)));
+
+  // Agent Lifecycle
+  server.tool('stop_prompt', 'Kill the active LLM process on a member. Always call TaskStop on the dispatching background agent after calling this.', stopPromptSchema.shape, wrapTool('stop_prompt', (input) => stopPrompt(input as any)));
+
+  // Credential Store
+  server.tool('credential_store_set', 'Collect a secret from the user out-of-band and store it. Returns a handle (sec://NAME) and scope. Use {{secure.NAME}} tokens in execute_command to inject the value.', credentialStoreSetSchema.shape, wrapTool('credential_store_set', (input) => credentialStoreSet(input as any)));
+  server.tool('credential_store_list', 'List all stored credentials (names and metadata only — no values).', credentialStoreListSchema.shape, wrapTool('credential_store_list', () => credentialStoreList()));
+  server.tool('credential_store_delete', 'Delete a named credential from the store (both session and persistent tiers).', credentialStoreDeleteSchema.shape, wrapTool('credential_store_delete', (input) => credentialStoreDelete(input as any)));
+  server.tool('credential_store_update', 'Update metadata (members, TTL, network policy) on an existing credential without re-entering the secret.', credentialStoreUpdateSchema.shape, wrapTool('credential_store_update', (input) => credentialStoreUpdate(input as any)));
+}
diff --git a/src/tools/shutdown-server.ts b/src/tools/shutdown-server.ts
index d6eab45a..7f4f59f1 100644
--- a/src/tools/shutdown-server.ts
+++ b/src/tools/shutdown-server.ts
@@ -1,9 +1,22 @@
 import { z } from 'zod';
+import fs from 'node:fs';
 import { closeAllConnections } from '../services/ssh.js';
+import type { HttpTransportHandle } from '../services/http-transport.js';
+import { SERVER_INFO_PATH } from '../paths.js';
 
 export const shutdownServerSchema = z.object({});
 
+let httpHandle: HttpTransportHandle | null = null;
+
+export function setHttpHandle(handle: HttpTransportHandle): void {
+  httpHandle = handle;
+}
+
 export async function shutdownServer(): Promise<string> {
+  if (httpHandle) {
+    try { fs.unlinkSync(SERVER_INFO_PATH); } catch {}
+    await httpHandle.close();
+  }
   closeAllConnections();
   setTimeout(() => process.exit(0), 100);
   return 'Server shutting down. Run /mcp to start a fresh instance.';
diff --git a/src/utils/process-utils.ts b/src/utils/process-utils.ts
new file mode 100644
index 00000000..5141a23f
--- /dev/null
+++ b/src/utils/process-utils.ts
@@ -0,0 +1,30 @@
+import http from 'node:http';
+
+export function isPidAlive(pid: number): boolean {
+  try {
+    process.kill(pid, 0);
+    return true;
+  } catch {
+    return false;
+  }
+}
+
+export function postShutdown(url: string): Promise<void> {
+  return new Promise((resolve) => {
+    const shutdownUrl = url.replace(/\/mcp$/, '/shutdown');
+    const parsed = new URL(shutdownUrl);
+    const req = http.request(
+      {
+        hostname: parsed.hostname,
+        port: Number(parsed.port),
+        path: parsed.pathname,
+        method: 'POST',
+        timeout: 3000,
+      },
+      (res) => { res.resume(); resolve(); },
+    );
+    req.on('error', () => resolve());
+    req.on('timeout', () => { req.destroy(); resolve(); });
+    req.end();
+  });
+}
diff --git a/tests/cli-verbs.test.ts b/tests/cli-verbs.test.ts
new file mode 100644
index 00000000..67feacf2
--- /dev/null
+++ b/tests/cli-verbs.test.ts
@@ -0,0 +1,327 @@
+import { describe, it, expect, vi, beforeEach, afterEach } from 'vitest';
+import fs from 'node:fs';
+import http from 'node:http';
+import { spawn } from 'node:child_process';
+
+// ---------------------------------------------------------------------------
+// Hoisted mock refs — local modules only (these are safe; factory mocks for
+// built-in node modules leak in fileParallelism:false mode, so we use spies)
+// ---------------------------------------------------------------------------
+const { mockCheckRunning, mockGetSvcMgr, mockSvcMgr } = vi.hoisted(() => {
+  const mockSvcMgr = {
+    isInstalled: vi.fn<() => Promise<boolean>>().mockResolvedValue(false),
+    start: vi.fn<() => Promise<void>>().mockResolvedValue(undefined),
+    stop: vi.fn<() => Promise<void>>().mockResolvedValue(undefined),
+    query: vi.fn<() => Promise<{ installed: boolean; running: boolean; enabled?: boolean }>>()
+      .mockResolvedValue({ installed: false, running: false }),
+    register: vi.fn<() => Promise<void>>().mockResolvedValue(undefined),
+    unregister: vi.fn<() => Promise<void>>().mockResolvedValue(undefined),
+  };
+  return {
+    mockCheckRunning: vi.fn<() => Promise<{ running: boolean; url?: string; pid?: number }>>()
+      .mockResolvedValue({ running: false }),
+    mockGetSvcMgr: vi.fn<() => Promise<typeof mockSvcMgr>>().mockResolvedValue(mockSvcMgr),
+    mockSvcMgr,
+  };
+});
+
+vi.mock('../src/services/singleton.js', () => ({
+  checkRunningInstance: mockCheckRunning,
+}));
+
+vi.mock('../src/services/service-manager/index.js', () => ({
+  getServiceManager: mockGetSvcMgr,
+}));
+
+// Auto-mock (no factory) so named imports get stubs — auto-mocks clean up
+// between files in sequential mode; factory mocks do not.
+vi.mock('node:child_process');
+
+// ---------------------------------------------------------------------------
+// Imports of subjects under test (after mocks so mocks apply)
+// ---------------------------------------------------------------------------
+import { runStart } from '../src/cli/start.js';
+import { runStop } from '../src/cli/stop.js';
+import { runRestart } from '../src/cli/restart.js';
+import { runStatus } from '../src/cli/status.js';
+
+// ---------------------------------------------------------------------------
+// Shared fixtures
+// ---------------------------------------------------------------------------
+const RUNNING = { running: true as const, url: 'http://127.0.0.1:7523/mcp', pid: 1234 };
+const STOPPED = { running: false as const };
+const SERVER_INFO = JSON.stringify({ pid: 1234, port: 7523, url: 'http://127.0.0.1:7523/mcp' });
+const HEALTH_BODY = JSON.stringify({ version: 'v0.1', uptime: 30, sessions: 1 });
+
+// ---------------------------------------------------------------------------
+// Per-test spy helpers (vi.spyOn restores cleanly in afterEach — no leakage)
+// ---------------------------------------------------------------------------
+function setupFsSpies() {
+  vi.spyOn(fs, 'mkdirSync').mockReturnValue(undefined as any);
+  vi.spyOn(fs, 'openSync').mockReturnValue(3 as any);
+  vi.spyOn(fs, 'closeSync').mockReturnValue(undefined);
+  vi.spyOn(fs, 'unlinkSync').mockReturnValue(undefined);
+  vi.spyOn(fs, 'existsSync').mockReturnValue(true); // lets findProjectRoot() succeed
+  vi.spyOn(fs, 'readFileSync').mockReturnValue(SERVER_INFO as any);
+}
+
+function setupHttpSpies() {
+  const mockReq = { on: vi.fn().mockReturnThis(), end: vi.fn(), destroy: vi.fn() };
+  vi.spyOn(http, 'request').mockImplementation(
+    (_opts: any, cb?: (res: any) => void) => {
+      cb?.({ resume: vi.fn() });
+      return mockReq as any;
+    },
+  );
+  vi.spyOn(http, 'get').mockImplementation(
+    (_opts: any, cb?: (res: any) => void) => {
+      cb?.({
+        on(ev: string, handler: (...a: any[]) => void) {
+          if (ev === 'data') handler(Buffer.from(HEALTH_BODY));
+          if (ev === 'end') handler();
+        },
+      });
+      return mockReq as any;
+    },
+  );
+}
+
+// ---------------------------------------------------------------------------
+// runStart
+// ---------------------------------------------------------------------------
+describe('runStart', () => {
+  let logSpy: ReturnType<typeof vi.spyOn>;
+  let errSpy: ReturnType<typeof vi.spyOn>;
+  let exitSpy: ReturnType<typeof vi.spyOn>;
+
+  beforeEach(() => {
+    vi.clearAllMocks();
+    setupFsSpies();
+    mockCheckRunning.mockResolvedValue(STOPPED);
+    mockSvcMgr.isInstalled.mockResolvedValue(false);
+    vi.mocked(spawn).mockReturnValue({ unref: vi.fn() } as any);
+    logSpy = vi.spyOn(console, 'log').mockImplementation(() => {});
+    errSpy = vi.spyOn(console, 'error').mockImplementation(() => {});
+    exitSpy = vi.spyOn(process, 'exit').mockImplementation((() => {}) as () => never);
+  });
+
+  afterEach(() => {
+    vi.restoreAllMocks();
+    vi.useRealTimers();
+  });
+
+  it('reports already running and skips service manager when server is up', async () => {
+    mockCheckRunning.mockResolvedValue(RUNNING);
+    await runStart([]);
+    expect(logSpy).toHaveBeenCalledWith(expect.stringContaining('already running'));
+    expect(mockGetSvcMgr).not.toHaveBeenCalled();
+  });
+
+  it('calls service manager start when unit is installed', async () => {
+    mockSvcMgr.isInstalled.mockResolvedValue(true);
+    mockCheckRunning.mockResolvedValueOnce(STOPPED).mockResolvedValueOnce(RUNNING);
+    vi.useFakeTimers();
+    const p = runStart([]);
+    await vi.advanceTimersByTimeAsync(2001);
+    await p;
+    expect(mockSvcMgr.start).toHaveBeenCalled();
+  });
+
+  it('spawns a detached process when no service unit is installed', async () => {
+    mockCheckRunning.mockResolvedValueOnce(STOPPED).mockResolvedValueOnce(RUNNING);
+    vi.useFakeTimers();
+    const p = runStart([]);
+    await vi.advanceTimersByTimeAsync(2001);
+    await p;
+    expect(vi.mocked(spawn)).toHaveBeenCalledWith(
+      expect.any(String),
+      expect.arrayContaining(['--transport', 'http']),
+      expect.objectContaining({ detached: true }),
+    );
+  });
+
+  it('logs success URL after server comes up', async () => {
+    mockCheckRunning.mockResolvedValueOnce(STOPPED).mockResolvedValueOnce(RUNNING);
+    vi.useFakeTimers();
+    const p = runStart([]);
+    await vi.advanceTimersByTimeAsync(2001);
+    await p;
+    expect(logSpy).toHaveBeenCalledWith(expect.stringContaining('Server started'));
+  });
+
+  it('exits with code 1 when server does not come up in time', async () => {
+    mockCheckRunning.mockResolvedValue(STOPPED);
+    vi.useFakeTimers();
+    const p = runStart([]);
+    await vi.advanceTimersByTimeAsync(2001);
+    await p;
+    expect(exitSpy).toHaveBeenCalledWith(1);
+  });
+});
+
+// ---------------------------------------------------------------------------
+// runStop
+// ---------------------------------------------------------------------------
+describe('runStop', () => {
+  let logSpy: ReturnType<typeof vi.spyOn>;
+  let killSpy: ReturnType<typeof vi.spyOn>;
+
+  beforeEach(() => {
+    vi.clearAllMocks();
+    setupFsSpies();
+    setupHttpSpies();
+    mockCheckRunning.mockResolvedValue(STOPPED);
+    logSpy = vi.spyOn(console, 'log').mockImplementation(() => {});
+    // Make isPidAlive return false immediately so the polling loop exits
+    killSpy = vi.spyOn(process, 'kill').mockImplementation((_pid, sig) => {
+      if (sig === 0) throw Object.assign(new Error('ESRCH'), { code: 'ESRCH' });
+      return true;
+    });
+  });
+
+  afterEach(() => {
+    vi.restoreAllMocks();
+  });
+
+  it('logs "not running" and skips /shutdown when server is stopped', async () => {
+    await runStop([]);
+    expect(logSpy).toHaveBeenCalledWith('Server is not running.');
+    expect(http.request).not.toHaveBeenCalled();
+  });
+
+  it('posts /shutdown when server is running', async () => {
+    mockCheckRunning.mockResolvedValue(RUNNING);
+    await runStop([]);
+    expect(http.request).toHaveBeenCalled();
+  });
+
+  it('reports "Server stopped." after shutdown', async () => {
+    mockCheckRunning.mockResolvedValue(RUNNING);
+    await runStop([]);
+    expect(logSpy).toHaveBeenCalledWith('Server stopped.');
+  });
+
+  it('cleans up server.json and lock file after stop', async () => {
+    mockCheckRunning.mockResolvedValue(RUNNING);
+    await runStop([]);
+    expect(fs.unlinkSync).toHaveBeenCalledTimes(2);
+  });
+});
+
+// ---------------------------------------------------------------------------
+// runRestart
+// ---------------------------------------------------------------------------
+describe('runRestart', () => {
+  beforeEach(() => {
+    vi.clearAllMocks();
+    setupFsSpies();
+    setupHttpSpies();
+    vi.mocked(spawn).mockReturnValue({ unref: vi.fn() } as any);
+    vi.spyOn(console, 'log').mockImplementation(() => {});
+    vi.spyOn(process, 'exit').mockImplementation((() => {}) as () => never);
+    vi.spyOn(process, 'kill').mockImplementation((_pid, sig) => {
+      if (sig === 0) throw Object.assign(new Error('ESRCH'), { code: 'ESRCH' });
+      return true;
+    });
+    mockSvcMgr.isInstalled.mockResolvedValue(false);
+  });
+
+  afterEach(() => {
+    vi.restoreAllMocks();
+    vi.useRealTimers();
+  });
+
+  it('stops then starts the server', async () => {
+    mockCheckRunning
+      .mockResolvedValueOnce(RUNNING)   // stop: running
+      .mockResolvedValueOnce(STOPPED)   // start: not running
+      .mockResolvedValueOnce(RUNNING);  // start: verify after 2s
+    vi.useFakeTimers();
+    const p = runRestart([]);
+    await vi.advanceTimersByTimeAsync(2001);
+    await p;
+    expect(http.request).toHaveBeenCalled();   // /shutdown was posted
+    expect(vi.mocked(spawn)).toHaveBeenCalled(); // process was spawned
+  });
+
+  it('is idempotent when server is already stopped before restart', async () => {
+    mockCheckRunning
+      .mockResolvedValueOnce(STOPPED)   // stop: not running (no-op)
+      .mockResolvedValueOnce(STOPPED)   // start: not running
+      .mockResolvedValueOnce(RUNNING);  // start: verify after 2s
+    vi.useFakeTimers();
+    const p = runRestart([]);
+    await vi.advanceTimersByTimeAsync(2001);
+    await p;
+    expect(vi.mocked(spawn)).toHaveBeenCalled();
+  });
+});
+
+// ---------------------------------------------------------------------------
+// runStatus
+// ---------------------------------------------------------------------------
+describe('runStatus', () => {
+  let logSpy: ReturnType<typeof vi.spyOn>;
+
+  beforeEach(() => {
+    vi.clearAllMocks();
+    setupFsSpies();
+    setupHttpSpies();
+    mockCheckRunning.mockResolvedValue(STOPPED);
+    mockSvcMgr.query.mockResolvedValue({ installed: false, running: false });
+    logSpy = vi.spyOn(console, 'log').mockImplementation(() => {});
+  });
+
+  afterEach(() => {
+    vi.restoreAllMocks();
+  });
+
+  function output(): string {
+    return logSpy.mock.calls.map(c => c.join(' ')).join('\n');
+  }
+
+  it('shows stopped state when server is not running', async () => {
+    await runStatus([]);
+    expect(output()).toContain('stopped');
+  });
+
+  it('shows "not installed" when no service unit exists', async () => {
+    await runStatus([]);
+    expect(output()).toContain('not installed');
+  });
+
+  it('shows "installed (enabled)" when service unit is enabled', async () => {
+    mockSvcMgr.query.mockResolvedValue({ installed: true, running: true, enabled: true });
+    await runStatus([]);
+    expect(output()).toContain('installed (enabled)');
+  });
+
+  it('shows "installed (disabled)" when service unit is disabled', async () => {
+    mockSvcMgr.query.mockResolvedValue({ installed: true, running: false, enabled: false });
+    await runStatus([]);
+    expect(output()).toContain('installed (disabled)');
+  });
+
+  it('shows running state with URL when server is up', async () => {
+    mockCheckRunning.mockResolvedValue(RUNNING);
+    await runStatus([]);
+    expect(output()).toContain('running');
+    expect(output()).toContain(RUNNING.url);
+  });
+
+  it('shows health info (version, uptime, sessions) from /health endpoint', async () => {
+    mockCheckRunning.mockResolvedValue(RUNNING);
+    await runStatus([]);
+    expect(output()).toContain('v0.1');
+    expect(output()).toContain('30s');
+    expect(output()).toContain('1');
+  });
+
+  it('omits live fields when server is stopped', async () => {
+    await runStatus([]);
+    const out = output();
+    expect(out).not.toContain('PID');
+    expect(out).not.toContain('Port');
+    expect(out).not.toContain('URL');
+  });
+});
diff --git a/tests/credential-event.test.ts b/tests/credential-event.test.ts
new file mode 100644
index 00000000..a44444f9
--- /dev/null
+++ b/tests/credential-event.test.ts
@@ -0,0 +1,132 @@
+import { describe, it, expect, beforeEach, afterEach, vi } from 'vitest';
+import net from 'node:net';
+import {
+  getSocketPath,
+  ensureAuthSocket,
+  createPendingAuth,
+  cleanupAuthSocket,
+  waitForPassword,
+} from '../src/services/auth-socket.js';
+import { fleetEvents } from '../src/services/event-bus.js';
+
+describe('credential-event', () => {
+  afterEach(async () => {
+    await cleanupAuthSocket();
+    fleetEvents.removeAllListeners();
+    vi.restoreAllMocks();
+  });
+
+  describe('credential:stored event emission', () => {
+    it('emits credential:stored event when OOB password is delivered', async () => {
+      await ensureAuthSocket();
+      createPendingAuth('web1');
+
+      const emitSpy = vi.spyOn(fleetEvents, 'emit');
+      const sockPath = getSocketPath();
+
+      // Start waiting for the password (creates a waiter)
+      const passwordPromise = waitForPassword('web1', 5000);
+
+      await new Promise<void>((resolve, reject) => {
+        const client = net.connect(sockPath, () => {
+          client.write(JSON.stringify({ type: 'auth', member_name: 'web1', password: 'secret123' }) + '\n');
+        });
+
+        let buffer = '';
+        client.on('data', (chunk) => {
+          buffer += chunk.toString();
+          const nl = buffer.indexOf('\n');
+          if (nl === -1) return;
+          const resp = JSON.parse(buffer.slice(0, nl));
+          expect(resp.ok).toBe(true);
+          client.end();
+          client.destroy();
+          resolve();
+        });
+        client.on('error', (err) => {
+          client.destroy();
+          reject(err);
+        });
+      });
+
+      // Wait for the password to be resolved
+      const pw = await passwordPromise;
+      expect(pw).toBeTruthy();
+
+      expect(emitSpy).toHaveBeenCalledWith('credential:stored', { name: 'web1' });
+    });
+
+    it('emits credential:stored with correct member name', async () => {
+      await ensureAuthSocket();
+      const memberName = 'prod-database';
+      createPendingAuth(memberName);
+
+      const emitSpy = vi.spyOn(fleetEvents, 'emit');
+      const sockPath = getSocketPath();
+
+      // Start waiting for the password (creates a waiter)
+      const passwordPromise = waitForPassword(memberName, 5000);
+
+      await new Promise<void>((resolve, reject) => {
+        const client = net.connect(sockPath, () => {
+          client.write(JSON.stringify({ type: 'auth', member_name: memberName, password: 'pw123' }) + '\n');
+        });
+
+        let buffer = '';
+        client.on('data', (chunk) => {
+          buffer += chunk.toString();
+          if (buffer.indexOf('\n') !== -1) {
+            client.end();
+            client.destroy();
+            resolve();
+          }
+        });
+        client.on('error', (err) => {
+          client.destroy();
+          reject(err);
+        });
+      });
+
+      // Wait for the password to be resolved
+      const pw = await passwordPromise;
+      expect(pw).toBeTruthy();
+
+      const calls = emitSpy.mock.calls.filter((call) => call[0] === 'credential:stored');
+      expect(calls).toHaveLength(1);
+      expect(calls[0][1]).toEqual({ name: memberName });
+    });
+
+    it('emits credential:stored only on successful password delivery', async () => {
+      await ensureAuthSocket();
+      createPendingAuth('web1');
+
+      const emitSpy = vi.spyOn(fleetEvents, 'emit');
+      const sockPath = getSocketPath();
+
+      // Send invalid message (no pending auth for 'unknown')
+      await new Promise<void>((resolve, reject) => {
+        const client = net.connect(sockPath, () => {
+          client.write(JSON.stringify({ type: 'auth', member_name: 'unknown', password: 'pw' }) + '\n');
+        });
+
+        let buffer = '';
+        client.on('data', (chunk) => {
+          buffer += chunk.toString();
+          if (buffer.indexOf('\n') !== -1) {
+            client.end();
+            client.destroy();
+            resolve();
+          }
+        });
+        client.on('error', (err) => {
+          client.destroy();
+          reject(err);
+        });
+      });
+
+      // Should not emit for invalid/failed delivery
+      const credentialCalls = emitSpy.mock.calls.filter((call) => call[0] === 'credential:stored');
+      expect(credentialCalls).toHaveLength(0);
+    });
+  });
+});
diff --git a/tests/event-bus.test.ts b/tests/event-bus.test.ts
new file mode 100644
index 00000000..a5d15793
--- /dev/null
+++ b/tests/event-bus.test.ts
@@ -0,0 +1,221 @@
+import { describe, it, expect, beforeEach } from 'vitest';
+import { fleetEvents, FleetEventMap } from '../src/services/event-bus.js';
+
+describe('event-bus: TypedEventBus', () => {
+  beforeEach(() => {
+    fleetEvents.removeAllListeners();
+  });
+
+  describe('emit and subscribe', () => {
+    it('delivers credentials:stored events to all subscribers', () => {
+      const results: { name: string }[] = [];
+
+      const handler = (payload: FleetEventMap['credential:stored']) => {
+        results.push(payload);
+      };
+
+      fleetEvents.on('credential:stored', handler);
+      fleetEvents.emit('credential:stored', { name: 'test-cred' });
+
+      expect(results).toHaveLength(1);
+      expect(results[0]).toEqual({ name: 'test-cred' });
+    });
+
+    it('delivers to multiple subscribers', () => {
+      const results1: { name: string }[] = [];
+      const results2: { name: string }[] = [];
+
+      const handler1 = (payload: FleetEventMap['credential:stored']) => {
+        results1.push(payload);
+      };
+      const handler2 = (payload: FleetEventMap['credential:stored']) => {
+        results2.push(payload);
+      };
+
+      fleetEvents.on('credential:stored', handler1);
+      fleetEvents.on('credential:stored', handler2);
+      fleetEvents.emit('credential:stored', { name: 'shared-cred' });
+
+      expect(results1).toHaveLength(1);
+      expect(results1[0]).toEqual({ name: 'shared-cred' });
+      expect(results2).toHaveLength(1);
+      expect(results2[0]).toEqual({ name: 'shared-cred' });
+    });
+
+    it('calls listeners multiple times for multiple emits', () => {
+      const results: { name: string }[] = [];
+
+      fleetEvents.on('credential:stored', (payload) => {
+        results.push(payload);
+      });
+
+      fleetEvents.emit('credential:stored', { name: 'cred1' });
+      fleetEvents.emit('credential:stored', { name: 'cred2' });
+      fleetEvents.emit('credential:stored', { name: 'cred3' });
+
+      expect(results).toHaveLength(3);
+      expect(results[0]).toEqual({ name: 'cred1' });
+      expect(results[1]).toEqual({ name: 'cred2' });
+      expect(results[2]).toEqual({ name: 'cred3' });
+    });
+  });
+
+  describe('unsubscribe (off)', () => {
+    it('prevents delivery to unsubscribed listeners', () => {
+      const results: { name: string }[] = [];
+
+      const handler = (payload: FleetEventMap['credential:stored']) => {
+        results.push(payload);
+      };
+
+      fleetEvents.on('credential:stored', handler);
+      fleetEvents.emit('credential:stored', { name: 'before-off' });
+
+      fleetEvents.off('credential:stored', handler);
+      fleetEvents.emit('credential:stored', { name: 'after-off' });
+
+      expect(results).toHaveLength(1);
+      expect(results[0]).toEqual({ name: 'before-off' });
+    });
+
+    it('does not affect other subscribers when one is removed', () => {
+      const results1: { name: string }[] = [];
+      const results2: { name: string }[] = [];
+
+      const handler1 = (payload: FleetEventMap['credential:stored']) => {
+        results1.push(payload);
+      };
+      const handler2 = (payload: FleetEventMap['credential:stored']) => {
+        results2.push(payload);
+      };
+
+      fleetEvents.on('credential:stored', handler1);
+      fleetEvents.on('credential:stored', handler2);
+      fleetEvents.emit('credential:stored', { name: 'shared1' });
+
+      fleetEvents.off('credential:stored', handler1);
+      fleetEvents.emit('credential:stored', { name: 'shared2' });
+
+      expect(results1).toHaveLength(1);
+      expect(results1[0]).toEqual({ name: 'shared1' });
+      expect(results2).toHaveLength(2);
+      expect(results2[0]).toEqual({ name: 'shared1' });
+      expect(results2[1]).toEqual({ name: 'shared2' });
+    });
+  });
+
+  describe('multiple event types', () => {
+    it('different event types are independent', () => {
+      const credentialResults: { name: string }[] = [];
+      const taskResults: { taskId: string; status: string }[] = [];
+
+      fleetEvents.on('credential:stored', (payload) => {
+        credentialResults.push(payload);
+      });
+      fleetEvents.on('task:completed', (payload) => {
+        taskResults.push(payload);
+      });
+
+      fleetEvents.emit('credential:stored', { name: 'cred' });
+      fleetEvents.emit('task:completed', { taskId: 'task1', status: 'done' });
+
+      expect(credentialResults).toHaveLength(1);
+      expect(credentialResults[0]).toEqual({ name: 'cred' });
+      expect(taskResults).toHaveLength(1);
+      expect(taskResults[0]).toEqual({ taskId: 'task1', status: 'done' });
+    });
+
+    it('emitting one event type does not trigger listeners of other types', () => {
+      const credentialResults: { name: string }[] = [];
+      const memberResults: { memberId: string; status: string }[] = [];
+
+      fleetEvents.on('credential:stored', (payload) => {
+        credentialResults.push(payload);
+      });
+      fleetEvents.on('member:status-changed', (payload) => {
+        memberResults.push(payload);
+      });
+
+      fleetEvents.emit('credential:stored', { name: 'cred' });
+
+      expect(credentialResults).toHaveLength(1);
+      expect(memberResults).toHaveLength(0);
+    });
+  });
+
+  describe('once: one-time listeners', () => {
+    it('once listener fires only once', () => {
+      const results: { name: string }[] = [];
+
+      fleetEvents.once('credential:stored', (payload) => {
+        results.push(payload);
+      });
+
+      fleetEvents.emit('credential:stored', { name: 'first' });
+      fleetEvents.emit('credential:stored', { name: 'second' });
+
+      expect(results).toHaveLength(1);
+      expect(results[0]).toEqual({ name: 'first' });
+    });
+  });
+
+  describe('typed payload correctness', () => {
+    it('task:completed payload has taskId and status', () => {
+      let receivedPayload: FleetEventMap['task:completed'] | null = null;
+
+      fleetEvents.on('task:completed', (payload) => {
+        receivedPayload = payload;
+      });
+
+      fleetEvents.emit('task:completed', {
+        taskId: 'task-123',
+        status: 'completed',
+      });
+
+      expect(receivedPayload).not.toBeNull();
+      expect(receivedPayload).toEqual({
+        taskId: 'task-123',
+        status: 'completed',
+      });
+    });
+
+    it('member:status-changed payload has memberId and status', () => {
+      let receivedPayload: FleetEventMap['member:status-changed'] | null =
+        null;
+
+      fleetEvents.on('member:status-changed', (payload) => {
+        receivedPayload = payload;
+      });
+
+      fleetEvents.emit('member:status-changed', {
+        memberId: 'member-456',
+        status: 'offline',
+      });
+
+      expect(receivedPayload).not.toBeNull();
+      expect(receivedPayload).toEqual({
+        memberId: 'member-456',
+        status: 'offline',
+      });
+    });
+
+    it('stall:detected payload has memberId and memberName', () => {
+      let receivedPayload: FleetEventMap['stall:detected'] | null = null;
+
+      fleetEvents.on('stall:detected', (payload) => {
+        receivedPayload = payload;
+      });
+
+      fleetEvents.emit('stall:detected', {
+        memberId: 'member-789',
+        memberName: 'test-member',
+      });
+
+      expect(receivedPayload).not.toBeNull();
+      expect(receivedPayload).toEqual({
+        memberId: 'member-789',
+        memberName: 'test-member',
+      });
+    });
+  });
+});
diff --git a/tests/http-transport.test.ts b/tests/http-transport.test.ts
new file mode 100644
index 00000000..e8cfd587
--- /dev/null
+++ b/tests/http-transport.test.ts
@@ -0,0 +1,177 @@
+import { describe, it, expect, afterEach, beforeEach } from 'vitest';
+import net from 'node:net';
+import { Client } from '@modelcontextprotocol/sdk/client/index.js';
+import { StreamableHTTPClientTransport } from '@modelcontextprotocol/sdk/client/streamableHttp.js';
+import { LoggingMessageNotificationSchema } from '@modelcontextprotocol/sdk/types.js';
+import { McpServer } from '@modelcontextprotocol/sdk/server/mcp.js';
+import { createHttpTransport, HttpTransportHandle } from '../src/services/http-transport.js';
+import { fleetEvents } from '../src/services/event-bus.js';
+
+function noop(_server: McpServer): void {
+  // no tools registered in these tests
+}
+
+function makeClient(port: number): Client {
+  return new Client({ name: 'test-client', version: '1.0.0' }, { capabilities: {} });
+}
+
+function makeTransport(port: number): StreamableHTTPClientTransport {
+  return new StreamableHTTPClientTransport(
+    new URL(`http://127.0.0.1:${port}/mcp`),
+    { reconnectionOptions: { maxRetries: 0, maxReconnectionDelay: 100, initialReconnectionDelay: 100, reconnectionDelayGrowFactor: 1 } }
+  );
+}
+
+const handles: HttpTransportHandle[] = [];
+const clients: Client[] = [];
+
+afterEach(async () => {
+  for (const client of clients.splice(0)) {
+    try { await client.close(); } catch { /* ignore */ }
+  }
+  fleetEvents.removeAllListeners();
+  for (const handle of handles.splice(0)) {
+    try { await handle.close(); } catch { /* ignore */ }
+  }
+});
+
+// ---------------------------------------------------------------------------
+// (a) Server binds to 127.0.0.1 only
+// ---------------------------------------------------------------------------
+describe('(a) server binds to 127.0.0.1', () => {
+  it('address is 127.0.0.1', async () => {
+    const handle = await createHttpTransport({ registerTools: noop, preferredPort: 0 });
+    handles.push(handle);
+    const addr = handle.httpServer.address() as net.AddressInfo;
+    expect(addr.address).toBe('127.0.0.1');
+    expect(addr.port).toBeGreaterThan(0);
+  });
+
+  it('url reflects 127.0.0.1', async () => {
+    const handle = await createHttpTransport({ registerTools: noop, preferredPort: 0 });
+    handles.push(handle);
+    expect(handle.url).toMatch(/^http:\/\/127\.0\.0\.1:\d+\/mcp$/);
+  });
+});
+
+// ---------------------------------------------------------------------------
+// (b) Two clients connect concurrently with separate sessions
+// ---------------------------------------------------------------------------
+describe('(b) two concurrent clients get separate sessions', () => {
+  it('sessions map has two entries after both clients connect', async () => {
+    const handle = await createHttpTransport({ registerTools: noop, preferredPort: 0 });
+    handles.push(handle);
+
+    const c1 = makeClient(handle.port);
+    const c2 = makeClient(handle.port);
+    clients.push(c1, c2);
+
+    await Promise.all([
+      c1.connect(makeTransport(handle.port)),
+      c2.connect(makeTransport(handle.port)),
+    ]);
+
+    expect(handle.sessions.size).toBe(2);
+    const ids = [...handle.sessions.keys()];
+    expect(ids[0]).not.toBe(ids[1]);
+  });
+});
+
+// ---------------------------------------------------------------------------
+// (c) Event bus emit reaches BOTH connected clients as logging notifications
+// ---------------------------------------------------------------------------
+describe('(c) event bus broadcasts to all sessions', () => {
+  it('credential:stored reaches both clients', async () => {
+    const handle = await createHttpTransport({ registerTools: noop, preferredPort: 0 });
+    handles.push(handle);
+
+    // Track GET /mcp requests (standalone SSE streams from clients)
+    let sseGetCount = 0;
+    handle.httpServer.on('request', (req) => {
+      if (req.method === 'GET' && req.url === '/mcp') sseGetCount++;
+    });
+
+    const c1 = makeClient(handle.port);
+    const c2 = makeClient(handle.port);
+    clients.push(c1, c2);
+
+    const received1: unknown[] = [];
+    const received2: unknown[] = [];
+
+    c1.setNotificationHandler(LoggingMessageNotificationSchema, (n) => {
+      received1.push(n.params.data);
+    });
+    c2.setNotificationHandler(LoggingMessageNotificationSchema, (n) => {
+      received2.push(n.params.data);
+    });
+
+    await Promise.all([
+      c1.connect(makeTransport(handle.port)),
+      c2.connect(makeTransport(handle.port)),
+    ]);
+
+    // Wait for both standalone GET SSE streams to be established
+    const deadline = Date.now() + 3000;
+    while (sseGetCount < 2 && Date.now() < deadline) {
+      await new Promise(resolve => setTimeout(resolve, 20));
+    }
+    expect(sseGetCount).toBeGreaterThanOrEqual(2);
+
+    fleetEvents.emit('credential:stored', { name: 'my-cred' });
+
+    // Allow notification to propagate
+    await new Promise(resolve => setTimeout(resolve, 300));
+
+    expect(received1).toHaveLength(1);
+    expect(received2).toHaveLength(1);
+    expect((received1[0] as { event: string }).event).toBe('credential:stored');
+    expect((received2[0] as { event: string }).event).toBe('credential:stored');
+  });
+});
+
+// ---------------------------------------------------------------------------
+// (d) Client disconnect removes session from the map
+// ---------------------------------------------------------------------------
+describe('(d) disconnect removes session', () => {
+  it('session is removed when client terminates the session', async () => {
+    const handle = await createHttpTransport({ registerTools: noop, preferredPort: 0 });
+    handles.push(handle);
+
+    const c1 = makeClient(handle.port);
+    clients.push(c1);
+    const transport = makeTransport(handle.port);
+
+    await c1.connect(transport);
+    expect(handle.sessions.size).toBe(1);
+
+    // Terminate the session via DELETE
+    await transport.terminateSession();
+
+    // Allow cleanup to propagate
+    await new Promise(resolve => setTimeout(resolve, 100));
+
+    expect(handle.sessions.size).toBe(0);
+  });
+});
+
+// ---------------------------------------------------------------------------
+// (e) Port fallback: when preferred port is busy, starts on random port
+// ---------------------------------------------------------------------------
+describe('(e) port fallback when preferred port is busy', () => {
+  it('starts on OS-assigned port when preferred port is in use', async () => {
+    // Occupy a port to force the fallback
+    const blocker = net.createServer();
+    await new Promise<void>(resolve => blocker.listen(0, '127.0.0.1', resolve));
+    const busyPort = (blocker.address() as net.AddressInfo).port;
+
+    try {
+      const handle = await createHttpTransport({ registerTools: noop, preferredPort: busyPort });
+      handles.push(handle);
+
+      expect(handle.port).not.toBe(busyPort);
+      expect(handle.port).toBeGreaterThan(0);
+    } finally {
+      await new Promise<void>(resolve => blocker.close(() => resolve()));
+    }
+  });
+});
diff --git a/tests/install-multi-provider.test.ts b/tests/install-multi-provider.test.ts
index 47d276c8..909f377e 100644
--- a/tests/install-multi-provider.test.ts
+++ b/tests/install-multi-provider.test.ts
@@ -376,7 +376,7 @@ describe('runInstall multi-provider', () => {
     expect(defaultModelWrite![1].toString()).toContain('gpt-5.4');
   });
 
-  it('Codex config.toml is valid TOML — every scalar string is properly double-quoted (#115)', async () => {
+  it('Codex config.toml is valid TOML (HTTP transport, url key)', async () => {
     await runInstall(['--llm', 'codex']);
 
     const codexConfig = path.join(mockHome, '.codex', 'config.toml');
@@ -386,6 +386,28 @@ describe('runInstall multi-provider', () => {
     expect(writes.length).toBeGreaterThan(0);
     const finalContent = writes.at(-1)![1].toString();
 
+    // Regression guard for #115: no bare/backslash-prefixed scalars.
+    expect(finalContent).not.toMatch(/=\s*\\/);
+    expect(finalContent).toMatch(/defaultModel\s*=\s*"gpt-5\.4"/);
+
+    // Parsing back with smol-toml must succeed and round-trip.
+    const parsed = parseToml(finalContent) as any;
+    expect(parsed.defaultModel).toBe('gpt-5.4');
+    // HTTP transport: url key, no command/args.
+    expect(typeof parsed.mcp_servers['apra-fleet'].url).toBe('string');
+    expect(parsed.mcp_servers['apra-fleet'].url).toContain('/mcp');
+  });
+
+  it('Codex config.toml is valid TOML — command/args for stdio transport (#115)', async () => {
+    await runInstall(['--llm', 'codex', '--transport', 'stdio']);
+
+    const codexConfig = path.join(mockHome, '.codex', 'config.toml');
+    const writes = vi.mocked(fs.writeFileSync).mock.calls.filter(c =>
+      c[0].toString().includes(codexConfig)
+    );
+    expect(writes.length).toBeGreaterThan(0);
+    const finalContent = writes.at(-1)![1].toString();
+
     // Regression guard for #115: no bare/backslash-prefixed scalars like `model = \gpt-5.3-codex`.
     // Every `key = value` scalar must either be quoted, a boolean, a number, a table, or an array.
     expect(finalContent).not.toMatch(/=\s*\\/);
@@ -394,7 +416,7 @@ describe('runInstall multi-provider', () => {
     // Parsing back with smol-toml must succeed and round-trip defaultModel.
     const parsed = parseToml(finalContent) as any;
     expect(parsed.defaultModel).toBe('gpt-5.4');
-    // mcp_servers.apra-fleet.command should be a plain string (proper TOML string literal).
+    // stdio transport: mcp_servers.apra-fleet.command should be a plain string (proper TOML string literal).
     expect(typeof parsed.mcp_servers['apra-fleet'].command).toBe('string');
     expect(Array.isArray(parsed.mcp_servers['apra-fleet'].args)).toBe(true);
   });
@@ -744,4 +766,131 @@ describe('runInstall multi-provider', () => {
     expect(pmIdx).toBeGreaterThanOrEqual(0);
     expect(fleetIdx).toBeLessThan(pmIdx);
   });
+
+  // -- Transport flag tests --
+
+  it('--transport http (default) uses URL-based Claude MCP registration', async () => {
+    await runInstall([]);
+
+    const calls = vi.mocked(execSync).mock.calls.map(c => c[0].toString());
+    const addCall = calls.find(c => c.includes('claude mcp add'));
+    expect(addCall).toBeDefined();
+    expect(addCall).toContain('--transport http');
+    expect(addCall).toContain('http://localhost:7523/mcp');
+  });
+
+  it('--transport stdio uses command+args Claude MCP registration', async () => {
+    await runInstall(['--transport', 'stdio']);
+
+    const calls = vi.mocked(execSync).mock.calls.map(c => c[0].toString());
+    const addCall = calls.find(c => c.includes('claude mcp add'));
+    expect(addCall).toBeDefined();
+    expect(addCall).not.toContain('--transport http');
+    expect(addCall).not.toContain('http://localhost:7523/mcp');
+  });
+
+  it('--transport http writes httpUrl for Gemini', async () => {
+    await runInstall(['--llm', 'gemini']);
+
+    const geminiSettings = path.join(mockHome, '.gemini', 'settings.json');
+    const writes = vi.mocked(fs.writeFileSync).mock.calls.filter(c =>
+      c[0].toString().includes(geminiSettings)
+    );
+    expect(writes.length).toBeGreaterThan(0);
+    const lastWrite = writes.at(-1)![1].toString();
+    const parsed = JSON.parse(lastWrite);
+    expect(parsed.mcpServers['apra-fleet'].httpUrl).toBe('http://localhost:7523/mcp');
+    expect(parsed.mcpServers['apra-fleet'].trust).toBe(true);
+  });
+
+  it('--transport stdio writes command+args for Gemini', async () => {
+    await runInstall(['--llm', 'gemini', '--transport', 'stdio']);
+
+    const geminiSettings = path.join(mockHome, '.gemini', 'settings.json');
+    const writes = vi.mocked(fs.writeFileSync).mock.calls.filter(c =>
+      c[0].toString().includes(geminiSettings)
+    );
+    expect(writes.length).toBeGreaterThan(0);
+    const lastWrite = writes.at(-1)![1].toString();
+    const parsed = JSON.parse(lastWrite);
+    expect(parsed.mcpServers['apra-fleet'].command).toBeDefined();
+    expect(parsed.mcpServers['apra-fleet'].httpUrl).toBeUndefined();
+  });
+
+  it('--transport http writes url+type for Copilot', async () => {
+    await runInstall(['--llm', 'copilot']);
+
+    const copilotSettings = path.join(mockHome, '.copilot', 'settings.json');
+    const writes = vi.mocked(fs.writeFileSync).mock.calls.filter(c =>
+      c[0].toString().includes(copilotSettings)
+    );
+    expect(writes.length).toBeGreaterThan(0);
+    const lastWrite = writes.at(-1)![1].toString();
+    const parsed = JSON.parse(lastWrite);
+    expect(parsed.mcpServers['apra-fleet'].url).toBe('http://localhost:7523/mcp');
+    expect(parsed.mcpServers['apra-fleet'].type).toBe('http');
+  });
+
+  it('--transport stdio writes command+args for Copilot', async () => {
+    await runInstall(['--llm', 'copilot', '--transport', 'stdio']);
+
+    const copilotSettings = path.join(mockHome, '.copilot', 'settings.json');
+    const writes = vi.mocked(fs.writeFileSync).mock.calls.filter(c =>
+      c[0].toString().includes(copilotSettings)
+    );
+    expect(writes.length).toBeGreaterThan(0);
+    const lastWrite = writes.at(-1)![1].toString();
+    const parsed = JSON.parse(lastWrite);
+    expect(parsed.mcpServers['apra-fleet'].command).toBeDefined();
+    expect(parsed.mcpServers['apra-fleet'].url).toBeUndefined();
+  });
+
+  it('--transport http writes url for Codex', async () => {
+    await runInstall(['--llm', 'codex']);
+
+    const codexConfig = path.join(mockHome, '.codex', 'config.toml');
+    const writes = vi.mocked(fs.writeFileSync).mock.calls.filter(c =>
+      c[0].toString().includes(codexConfig)
+    );
+    expect(writes.length).toBeGreaterThan(0);
+    const finalContent = writes.at(-1)![1].toString();
+    const parsed = parseToml(finalContent) as any;
+    expect(parsed.mcp_servers['apra-fleet'].url).toBe('http://localhost:7523/mcp');
+    expect(parsed.mcp_servers['apra-fleet'].command).toBeUndefined();
+  });
+
+  it('--transport http writes url for agy', async () => {
+    await runInstall(['--llm', 'agy']);
+
+    const agyMcpConfig = path.join(mockHome, '.gemini', 'config', 'mcp_config.json');
+    const writes = vi.mocked(fs.writeFileSync).mock.calls.filter(c =>
+      c[0].toString().includes(agyMcpConfig)
+    );
+    expect(writes.length).toBeGreaterThan(0);
+    const lastWrite = writes.at(-1)![1].toString();
+    const parsed = JSON.parse(lastWrite);
+    expect(parsed.mcpServers['apra-fleet'].url).toBe('http://localhost:7523/mcp');
+  });
+
+  it('--transport stdio writes command+args for agy', async () => {
+    await runInstall(['--llm', 'agy', '--transport', 'stdio']);
+
+    const agyMcpConfig = path.join(mockHome, '.gemini', 'config', 'mcp_config.json');
+    const writes = vi.mocked(fs.writeFileSync).mock.calls.filter(c =>
+      c[0].toString().includes(agyMcpConfig)
+    );
+    expect(writes.length).toBeGreaterThan(0);
+    const lastWrite = writes.at(-1)![1].toString();
+    const parsed = JSON.parse(lastWrite);
+    expect(parsed.mcpServers['apra-fleet'].command).toBeDefined();
+    expect(parsed.mcpServers['apra-fleet'].url).toBeUndefined();
+  });
+
+  it('--transport=invalid exits with error', async () => {
+    const exitSpy = vi.spyOn(process, 'exit').mockImplementation(() => { throw new Error('exit'); });
+
+    await expect(runInstall(['--transport=invalid'])).rejects.toThrow('exit');
+    expect(exitSpy).toHaveBeenCalledWith(1);
+    exitSpy.mockRestore();
+  });
 });
diff --git a/tests/install-service.test.ts b/tests/install-service.test.ts
new file mode 100644
index 00000000..140fc148
--- /dev/null
+++ b/tests/install-service.test.ts
@@ -0,0 +1,220 @@
+import { describe, it, expect, vi, beforeEach, afterEach } from 'vitest';
+import fs from 'node:fs';
+import os from 'node:os';
+import * as readline from 'node:readline/promises';
+import { runInstall, _setSeaOverride, _setManifestOverride } from '../src/cli/install.js';
+import { runUninstall } from '../src/cli/uninstall.js';
+import * as install from '../src/cli/install.js';
+
+// ---------------------------------------------------------------------------
+// Hoisted mock refs for service manager
+// ---------------------------------------------------------------------------
+const { mockGetSvcMgr, mockSvcMgr } = vi.hoisted(() => {
+  const mockSvcMgr = {
+    register: vi.fn<() => Promise<void>>().mockResolvedValue(undefined),
+    start: vi.fn<() => Promise<void>>().mockResolvedValue(undefined),
+    stop: vi.fn<() => Promise<void>>().mockResolvedValue(undefined),
+    query: vi.fn<() => Promise<{ installed: boolean; running: boolean }>>()
+      .mockResolvedValue({ installed: false, running: false }),
+    isInstalled: vi.fn<() => Promise<boolean>>().mockResolvedValue(false),
+    unregister: vi.fn<() => Promise<void>>().mockResolvedValue(undefined),
+  };
+  return {
+    mockGetSvcMgr: vi.fn<() => Promise<typeof mockSvcMgr>>().mockResolvedValue(mockSvcMgr),
+    mockSvcMgr,
+  };
+});
+
+// ---------------------------------------------------------------------------
+// Module mocks
+// ---------------------------------------------------------------------------
+vi.mock('node:os', () => ({
+  default: {
+    homedir: vi.fn(() => '/mock/home'),
+    platform: vi.fn(() => 'linux'),
+  },
+}));
+vi.mock('node:fs');
+vi.mock('node:child_process');
+vi.mock('../src/services/service-manager/index.js', () => ({
+  getServiceManager: mockGetSvcMgr,
+}));
+vi.mock('../src/cli/install.js', async (importOriginal) => {
+  const orig = await importOriginal<typeof import('../src/cli/install.js')>();
+  return {
+    ...orig,
+    isApraFleetRunning: vi.fn().mockReturnValue(false),
+  };
+});
+vi.mock('node:readline/promises', () => ({
+  createInterface: vi.fn(),
+}));
+
+// ---------------------------------------------------------------------------
+// FS mock helpers (mirrors install.test.ts pattern)
+// ---------------------------------------------------------------------------
+function makeFsMock() {
+  vi.mocked(fs.existsSync).mockImplementation((p: any) => {
+    const ps = p.toString();
+    if (ps.includes('version.json')) return true;
+    if (ps.includes('hooks-config.json')) return true;
+    return false;
+  });
+  vi.mocked(fs.readFileSync).mockImplementation((p: any) => {
+    const ps = p.toString();
+    if (ps.includes('version.json')) return JSON.stringify({ version: '0.1.0' });
+    if (ps.includes('hooks-config.json')) return JSON.stringify({ hooks: { PostToolUse: [] } });
+    if (ps.includes('install-config.json')) return JSON.stringify({ providers: { claude: { skill: 'all' } } });
+    if (ps.includes('settings.json')) return JSON.stringify({});
+    return '';
+  });
+  vi.mocked(fs.readdirSync).mockReturnValue([] as any);
+  vi.mocked(fs.mkdirSync).mockImplementation(() => undefined as any);
+  vi.mocked(fs.chmodSync).mockImplementation(() => {});
+  vi.mocked(fs.copyFileSync).mockImplementation(() => {});
+  vi.mocked(fs.writeFileSync).mockImplementation(() => {});
+  vi.mocked(fs.rmSync).mockImplementation(() => undefined);
+}
+
+// ---------------------------------------------------------------------------
+// Install service integration tests
+// ---------------------------------------------------------------------------
+describe('install -- service lifecycle (T11)', () => {
+  beforeEach(() => {
+    vi.clearAllMocks();
+    vi.mocked(os.homedir).mockReturnValue('/mock/home');
+    makeFsMock();
+    _setManifestOverride({ version: '0.1.0', hooks: {}, scripts: {}, skills: {}, fleetSkills: {} });
+    vi.spyOn(console, 'log').mockImplementation(() => {});
+    vi.spyOn(console, 'warn').mockImplementation(() => {});
+    vi.spyOn(console, 'error').mockImplementation(() => {});
+  });
+
+  afterEach(() => {
+    _setSeaOverride(null);
+    _setManifestOverride(null);
+  });
+
+  it('registers and starts service in SEA + HTTP mode', async () => {
+    _setSeaOverride(true);
+    await runInstall(['--transport', 'http', '--skill', 'none']);
+    expect(mockGetSvcMgr).toHaveBeenCalled();
+    expect(mockSvcMgr.register).toHaveBeenCalledWith(
+      expect.stringContaining('apra-fleet'),
+      ['--transport', 'http'],
+      expect.any(String),
+    );
+    expect(mockSvcMgr.start).toHaveBeenCalled();
+  });
+
+  it('skips service registration in stdio transport mode', async () => {
+    _setSeaOverride(true);
+    await runInstall(['--transport', 'stdio', '--skill', 'none']);
+    expect(mockSvcMgr.register).not.toHaveBeenCalled();
+    expect(mockSvcMgr.start).not.toHaveBeenCalled();
+  });
+
+  it('skips service registration in dev (non-SEA) mode', async () => {
+    _setSeaOverride(false);
+    await runInstall(['--transport', 'http', '--skill', 'none']);
+    expect(mockSvcMgr.register).not.toHaveBeenCalled();
+    expect(mockSvcMgr.start).not.toHaveBeenCalled();
+  });
+
+  it('shows "Service: registered and running" in done output when registered', async () => {
+    _setSeaOverride(true);
+    const logSpy = vi.mocked(console.log);
+    await runInstall(['--transport', 'http', '--skill', 'none']);
+    const allOutput = logSpy.mock.calls.flat().join('\n');
+    expect(allOutput).toContain('Service:');
+    expect(allOutput).toContain('registered and running');
+  });
+
+  it('warns (non-fatal) when service registration fails', async () => {
+    _setSeaOverride(true);
+    mockSvcMgr.register.mockRejectedValueOnce(new Error('schtasks access denied'));
+    const warnSpy = vi.mocked(console.warn);
+    await runInstall(['--transport', 'http', '--skill', 'none']);
+    expect(warnSpy).toHaveBeenCalledWith(expect.stringContaining('Service registration skipped'));
+  });
+
+  it('increments totalSteps by 1 in SEA + HTTP mode', async () => {
+    // With SEA + HTTP + no skills: base=6 steps, +1 service = 7 total
+    _setSeaOverride(true);
+    const logSpy = vi.mocked(console.log);
+    await runInstall(['--transport', 'http', '--skill', 'none']);
+    const allOutput = logSpy.mock.calls.flat().join('\n');
+    // Service step should show as [7/7]
+    expect(allOutput).toContain('[7/7]');
+    // Beads step should show as [6/7]
+    expect(allOutput).toContain('[6/7]');
+  });
+});
+
+// ---------------------------------------------------------------------------
+// Uninstall service integration tests
+// ---------------------------------------------------------------------------
+describe('uninstall -- service lifecycle (T12)', () => {
+  beforeEach(() => {
+    vi.clearAllMocks();
+    vi.mocked(os.homedir).mockReturnValue('/mock/home');
+    makeFsMock();
+    vi.mocked(fs.existsSync).mockReturnValue(true);
+    vi.mocked(fs.readFileSync).mockReturnValue(
+      JSON.stringify({ providers: { claude: { skill: 'all' } } }),
+    );
+    vi.mocked(install.isApraFleetRunning).mockReturnValue(false);
+    (readline.createInterface as any).mockReturnValue({
+      question: vi.fn().mockResolvedValue('y'),
+      close: vi.fn(),
+    });
+    vi.spyOn(console, 'log').mockImplementation(() => {});
+    vi.spyOn(console, 'warn').mockImplementation(() => {});
+    vi.spyOn(console, 'error').mockImplementation(() => {});
+    vi.spyOn(process, 'exit').mockImplementation(() => { throw new Error('exit'); });
+  });
+
+  it('calls unregister when server is not running', async () => {
+    await runUninstall(['--yes']);
+    expect(mockSvcMgr.unregister).toHaveBeenCalled();
+  });
+
+  it('calls stop then unregister when server is running and --force is passed', async () => {
+    vi.mocked(install.isApraFleetRunning).mockReturnValue(true);
+    await runUninstall(['--yes', '--force']);
+    expect(mockSvcMgr.stop).toHaveBeenCalled();
+    expect(mockSvcMgr.unregister).toHaveBeenCalled();
+    // stop must be called before unregister
+    const stopOrder = mockSvcMgr.stop.mock.invocationCallOrder[0];
+    const unregisterOrder = mockSvcMgr.unregister.mock.invocationCallOrder[0];
+    expect(stopOrder).toBeLessThan(unregisterOrder);
+  });
+
+  it('does not call stop when server is not running', async () => {
+    await runUninstall(['--yes']);
+    expect(mockSvcMgr.stop).not.toHaveBeenCalled();
+  });
+
+  it('does not call unregister in dry-run mode', async () => {
+    await runUninstall(['--dry-run', '--yes']);
+    expect(mockSvcMgr.unregister).not.toHaveBeenCalled();
+  });
+
+  it('does not call stop in dry-run mode even with --force and running server', async () => {
+    vi.mocked(install.isApraFleetRunning).mockReturnValue(true);
+    await runUninstall(['--dry-run', '--force', '--yes']);
+    expect(mockSvcMgr.stop).not.toHaveBeenCalled();
+  });
+
+  it('unregister error is swallowed (idempotent)', async () => {
+    mockSvcMgr.unregister.mockRejectedValueOnce(new Error('task not found'));
+    // Should complete without throwing
+    await runUninstall(['--yes']);
+  });
+
+  it('errors if server is running without --force', async () => {
+    vi.mocked(install.isApraFleetRunning).mockReturnValue(true);
+    await expect(runUninstall(['--yes'])).rejects.toThrow('exit');
+    expect(mockSvcMgr.stop).not.toHaveBeenCalled();
+  });
+});
diff --git a/tests/sea-http-verify.test.ts b/tests/sea-http-verify.test.ts
new file mode 100644
index 00000000..2de4c11e
--- /dev/null
+++ b/tests/sea-http-verify.test.ts
@@ -0,0 +1,96 @@
+/**
+ * Task 3: SEA Binary Compatibility Verification
+ *
+ * Verifies that src/services/http-transport.ts bundles correctly under esbuild
+ * (the same bundler used to produce dist/sea-bundle.cjs). The @hono/node-server
+ * package is a transitive dependency of StreamableHTTPServerTransport and has
+ * historically caused issues in bundled environments. This test surfaces any
+ * bundling problems before the transport is wired into the main binary.
+ */
+import { describe, it, expect, afterAll } from 'vitest';
+import { build } from 'esbuild';
+import { createRequire } from 'node:module';
+import fs from 'node:fs';
+import path from 'node:path';
+import os from 'node:os';
+import { fileURLToPath } from 'node:url';
+import { McpServer } from '@modelcontextprotocol/sdk/server/mcp.js';
+
+const __dirname = path.dirname(fileURLToPath(import.meta.url));
+const root = path.resolve(__dirname, '..');
+
+// Temporary bundle output path
+const BUNDLE_PATH = path.join(os.tmpdir(), `apra-fleet-sea-verify-${process.pid}.cjs`);
+
+// The actual http-transport source file (absolute path)
+const HTTP_TRANSPORT_SRC = path.join(root, 'src', 'services', 'http-transport.ts');
+
+afterAll(async () => {
+  try { fs.unlinkSync(BUNDLE_PATH); } catch { /* best-effort */ }
+});
+
+describe('SEA bundle compatibility: http-transport', () => {
+  let bundleSource = '';
+
+  it('esbuild bundles http-transport.ts without errors', async () => {
+    await build({
+      entryPoints: [HTTP_TRANSPORT_SRC],
+      bundle: true,
+      platform: 'node',
+      target: 'node22',
+      format: 'cjs',
+      outfile: BUNDLE_PATH,
+      sourcemap: false,
+      external: ['cpu-features'],
+      loader: { '.node': 'empty' },
+      // Shim import.meta.url exactly as in the real SEA build
+      define: { 'import.meta.url': 'import_meta_url' },
+      banner: {
+        js: 'var import_meta_url = typeof document === "undefined" ? require("url").pathToFileURL(__filename).href : undefined;',
+      },
+    });
+
+    expect(fs.existsSync(BUNDLE_PATH)).toBe(true);
+    bundleSource = fs.readFileSync(BUNDLE_PATH, 'utf8');
+    expect(bundleSource.length).toBeGreaterThan(1000);
+  });
+
+  it('bundle contains StreamableHTTPServerTransport code', () => {
+    expect(bundleSource).toBeTruthy();
+    expect(bundleSource).toContain('StreamableHTTPServerTransport');
+  });
+
+  it('bundle contains @hono/node-server adapter code', () => {
+    expect(bundleSource).toBeTruthy();
+    // @hono/node-server is the Node.js adapter used by StreamableHTTPServerTransport
+    // Its presence confirms the transitive dep bundled without requiring externals
+    expect(bundleSource).toMatch(/@hono\/node-server|hono.*node.*server|node.*hono/i);
+  });
+
+  it('bundled createHttpTransport starts and binds a port', async () => {
+    expect(fs.existsSync(BUNDLE_PATH)).toBe(true);
+
+    const req = createRequire(import.meta.url);
+    const mod = req(BUNDLE_PATH) as { createHttpTransport: typeof import('../src/services/http-transport.js').createHttpTransport };
+
+    expect(typeof mod.createHttpTransport).toBe('function');
+
+    const handle = await mod.createHttpTransport({
+      registerTools: (_server: McpServer) => {},
+      preferredPort: 0,
+    });
+
+    try {
+      expect(handle.port).toBeGreaterThan(0);
+      expect(handle.url).toMatch(/^http:\/\/127\.0\.0\.1:\d+\/mcp$/);
+
+      // Verify health endpoint responds
+      const resp = await fetch(`http://127.0.0.1:${handle.port}/health`);
+      expect(resp.status).toBe(200);
+      const json = await resp.json() as { status: string };
+      expect(json.status).toBe('ok');
+    } finally {
+      await handle.close();
+    }
+  });
+});
diff --git a/tests/service-manager.test.ts b/tests/service-manager.test.ts
new file mode 100644
index 00000000..7c1385d3
--- /dev/null
+++ b/tests/service-manager.test.ts
@@ -0,0 +1,390 @@
+import { describe, it, expect, vi, beforeEach, afterEach } from 'vitest';
+import fs from 'node:fs';
+import { execFileSync } from 'node:child_process';
+
+// vi.hoisted so these refs are available inside vi.mock factory closures
+const { mockGracefulStop } = vi.hoisted(() => ({
+  mockGracefulStop: vi.fn<(fallback?: (pid: number) => void) => Promise<void>>().mockResolvedValue(undefined),
+}));
+
+vi.mock('node:child_process');
+vi.mock('node:fs');
+vi.mock('node:os', () => ({
+  default: {
+    homedir: () => '/mock/home',
+    userInfo: () => ({ username: 'mockuser' }),
+  },
+}));
+vi.mock('../src/services/service-manager/index.js', () => ({
+  gracefulStopByServerJson: mockGracefulStop,
+}));
+
+import { WindowsServiceManager } from '../src/services/service-manager/windows.js';
+import { LinuxServiceManager } from '../src/services/service-manager/linux.js';
+import { MacOSServiceManager } from '../src/services/service-manager/macos.js';
+
+// ---------------------------------------------------------------------------
+// Windows
+// ---------------------------------------------------------------------------
+describe('WindowsServiceManager', () => {
+  beforeEach(() => {
+    vi.clearAllMocks();
+    vi.mocked(execFileSync).mockReturnValue('' as any);
+    vi.mocked(fs.mkdirSync).mockReturnValue(undefined as any);
+    vi.mocked(fs.writeFileSync).mockReturnValue(undefined);
+    vi.mocked(fs.unlinkSync).mockReturnValue(undefined);
+  });
+
+  describe('register', () => {
+    it('writes wrapper bat containing the binary invocation', async () => {
+      const mgr = new WindowsServiceManager();
+      await mgr.register('/bin/apra-fleet.exe', ['--transport', 'http'], '/logs/fleet.log');
+      expect(fs.writeFileSync).toHaveBeenCalledWith(
+        expect.stringContaining('apra-fleet-service.bat'),
+        expect.stringContaining('@echo off'),
+        'utf8',
+      );
+      const call = vi.mocked(fs.writeFileSync).mock.calls[0];
+      expect(call[1]).toContain('/bin/apra-fleet.exe');
+      expect(call[1]).toContain('"--transport" "http"');
+    });
+
+    it('calls schtasks /create with onlogon trigger and limited run-level', async () => {
+      const mgr = new WindowsServiceManager();
+      await mgr.register('/bin/apra-fleet.exe', ['--transport', 'http'], '/logs/fleet.log');
+      expect(execFileSync).toHaveBeenCalledWith('schtasks', expect.arrayContaining([
+        '/create', '/tn', 'ApraFleet', '/sc', 'onlogon', '/rl', 'limited', '/f',
+      ]));
+    });
+  });
+
+  describe('unregister', () => {
+    it('deletes the scheduled task and removes the wrapper bat', async () => {
+      const mgr = new WindowsServiceManager();
+      await mgr.unregister();
+      expect(execFileSync).toHaveBeenCalledWith('schtasks', ['/delete', '/tn', 'ApraFleet', '/f']);
+      expect(fs.unlinkSync).toHaveBeenCalledWith(expect.stringContaining('apra-fleet-service.bat'));
+    });
+
+    it('tolerates task-not-found error (idempotent)', async () => {
+      vi.mocked(execFileSync).mockImplementationOnce(() => { throw new Error('cannot find'); });
+      const mgr = new WindowsServiceManager();
+      await expect(mgr.unregister()).resolves.not.toThrow();
+    });
+  });
+
+  describe('start', () => {
+    it('calls schtasks /run', async () => {
+      const mgr = new WindowsServiceManager();
+      await mgr.start();
+      expect(execFileSync).toHaveBeenCalledWith('schtasks', ['/run', '/tn', 'ApraFleet']);
+    });
+  });
+
+  describe('stop', () => {
+    it('calls gracefulStopByServerJson with a fallback function', async () => {
+      const mgr = new WindowsServiceManager();
+      await mgr.stop();
+      expect(mockGracefulStop).toHaveBeenCalledWith(expect.any(Function));
+    });
+
+    it('fallback invokes taskkill /F /PID', async () => {
+      let capturedFallback: ((pid: number) => void) | undefined;
+      mockGracefulStop.mockImplementationOnce(async (fn) => { capturedFallback = fn; });
+      const mgr = new WindowsServiceManager();
+      await mgr.stop();
+      capturedFallback!(4242);
+      expect(execFileSync).toHaveBeenCalledWith('taskkill', ['/F', '/PID', '4242']);
+    });
+  });
+
+  describe('query', () => {
+    it('returns installed=true, running=false for Ready status', async () => {
+      vi.mocked(execFileSync).mockReturnValue('"ApraFleet","N/A","Ready"\r\n' as any);
+      const mgr = new WindowsServiceManager();
+      expect(await mgr.query()).toEqual({ installed: true, running: false });
+    });
+
+    it('returns installed=true, running=true for Running status', async () => {
+      vi.mocked(execFileSync).mockReturnValue('"ApraFleet","N/A","Running"\r\n' as any);
+      const mgr = new WindowsServiceManager();
+      expect(await mgr.query()).toEqual({ installed: true, running: true });
+    });
+
+    it('returns installed=false when task is not found', async () => {
+      vi.mocked(execFileSync).mockImplementation(() => { throw new Error('task not found'); });
+      const mgr = new WindowsServiceManager();
+      expect(await mgr.query()).toEqual({ installed: false, running: false });
+    });
+  });
+
+  describe('isInstalled', () => {
+    it('returns true when schtasks query succeeds', async () => {
+      vi.mocked(execFileSync).mockReturnValue('' as any);
+      expect(await new WindowsServiceManager().isInstalled()).toBe(true);
+    });
+
+    it('returns false when schtasks query throws', async () => {
+      vi.mocked(execFileSync).mockImplementation(() => { throw new Error('not found'); });
+      expect(await new WindowsServiceManager().isInstalled()).toBe(false);
+    });
+  });
+});
+
+// ---------------------------------------------------------------------------
+// Linux
+// ---------------------------------------------------------------------------
+describe('LinuxServiceManager', () => {
+  const savedXdg = process.env.XDG_RUNTIME_DIR;
+
+  beforeEach(() => {
+    vi.clearAllMocks();
+    process.env.XDG_RUNTIME_DIR = '/run/user/1000';
+    vi.mocked(execFileSync).mockReturnValue('' as any);
+    vi.mocked(fs.mkdirSync).mockReturnValue(undefined as any);
+    vi.mocked(fs.writeFileSync).mockReturnValue(undefined);
+    vi.mocked(fs.unlinkSync).mockReturnValue(undefined);
+    // Default: systemd available, unit file not installed
+    // Normalize separators for cross-platform compatibility (Windows uses backslash)
+    vi.mocked(fs.existsSync).mockImplementation((p) =>
+      String(p).replace(/\\/g, '/').endsWith('/systemd'),
+    );
+  });
+
+  afterEach(() => {
+    if (savedXdg === undefined) delete process.env.XDG_RUNTIME_DIR;
+    else process.env.XDG_RUNTIME_DIR = savedXdg;
+  });
+
+  describe('non-systemd detection', () => {
+    it('throws a clear error on register when systemd is absent', async () => {
+      vi.mocked(fs.existsSync).mockReturnValue(false);
+      await expect(
+        new LinuxServiceManager().register('/bin/apra-fleet', [], '/tmp/fleet.log'),
+      ).rejects.toThrow('systemd user mode is not available');
+    });
+
+    it('throws a clear error on start when systemd is absent', async () => {
+      vi.mocked(fs.existsSync).mockReturnValue(false);
+      await expect(new LinuxServiceManager().start()).rejects.toThrow('systemd user mode is not available');
+    });
+
+    it('throws a clear error on stop when systemd is absent', async () => {
+      vi.mocked(fs.existsSync).mockReturnValue(false);
+      await expect(new LinuxServiceManager().stop()).rejects.toThrow('systemd user mode is not available');
+    });
+  });
+
+  describe('register', () => {
+    it('writes unit file with correct content', async () => {
+      await new LinuxServiceManager().register(
+        '/usr/local/bin/apra-fleet', ['--transport', 'http'], '/home/user/fleet.log',
+      );
+      const [, content] = vi.mocked(fs.writeFileSync).mock.calls[0];
+      expect(content).toContain('Type=simple');
+      expect(content).toContain('ExecStart=/usr/local/bin/apra-fleet --transport http');
+      expect(content).toContain('Restart=on-failure');
+      expect(content).toContain('WantedBy=default.target');
+    });
+
+    it('runs daemon-reload and enable after writing unit file', async () => {
+      await new LinuxServiceManager().register('/bin/apra-fleet', [], '/tmp/fleet.log');
+      expect(execFileSync).toHaveBeenCalledWith('systemctl', ['--user', 'daemon-reload']);
+      expect(execFileSync).toHaveBeenCalledWith('systemctl', ['--user', 'enable', 'apra-fleet']);
+    });
+
+    it('warns (not throws) when loginctl enable-linger fails', async () => {
+      vi.mocked(execFileSync).mockImplementation((cmd: any, args: any) => {
+        if (cmd === 'loginctl') throw new Error('permission denied');
+        return '' as any;
+      });
+      const warnSpy = vi.spyOn(console, 'warn').mockImplementation(() => {});
+      await new LinuxServiceManager().register('/bin/apra-fleet', [], '/tmp/fleet.log');
+      expect(warnSpy).toHaveBeenCalledWith(expect.stringContaining('loginctl enable-linger failed'));
+    });
+  });
+
+  describe('unregister', () => {
+    it('gracefully stops then disables and removes the unit file', async () => {
+      await new LinuxServiceManager().unregister();
+      expect(mockGracefulStop).toHaveBeenCalled();
+      expect(execFileSync).toHaveBeenCalledWith('systemctl', ['--user', 'disable', 'apra-fleet']);
+      expect(execFileSync).toHaveBeenCalledWith('systemctl', ['--user', 'daemon-reload']);
+    });
+
+    it('is idempotent when unit is not installed', async () => {
+      vi.mocked(execFileSync).mockImplementation(() => { throw new Error('not found'); });
+      await expect(new LinuxServiceManager().unregister()).resolves.not.toThrow();
+    });
+  });
+
+  describe('start', () => {
+    it('calls systemctl --user start', async () => {
+      await new LinuxServiceManager().start();
+      expect(execFileSync).toHaveBeenCalledWith('systemctl', ['--user', 'start', 'apra-fleet']);
+    });
+  });
+
+  describe('stop', () => {
+    it('calls gracefulStopByServerJson', async () => {
+      await new LinuxServiceManager().stop();
+      expect(mockGracefulStop).toHaveBeenCalled();
+    });
+  });
+
+  describe('query', () => {
+    it('returns installed=false when unit file does not exist', async () => {
+      vi.mocked(fs.existsSync).mockImplementation((p) =>
+        String(p).replace(/\\/g, '/').endsWith('/systemd'), // only systemd dir
+      );
+      expect(await new LinuxServiceManager().query()).toEqual({ installed: false, running: false });
+    });
+
+    it('returns running=true and enabled=true for active/enabled unit', async () => {
+      vi.mocked(fs.existsSync).mockReturnValue(true);
+      vi.mocked(execFileSync).mockImplementation((_cmd: any, args: any) => {
+        if ((args as string[]).includes('is-active')) return 'active\n' as any;
+        if ((args as string[]).includes('is-enabled')) return 'enabled\n' as any;
+        return '' as any;
+      });
+      expect(await new LinuxServiceManager().query()).toEqual({ installed: true, running: true, enabled: true });
+    });
+
+    it('returns running=false and enabled=false for inactive/disabled unit', async () => {
+      vi.mocked(fs.existsSync).mockReturnValue(true);
+      vi.mocked(execFileSync).mockImplementation((_cmd: any, args: any) => {
+        if ((args as string[]).includes('is-active')) return 'inactive\n' as any;
+        if ((args as string[]).includes('is-enabled')) return 'disabled\n' as any;
+        return '' as any;
+      });
+      expect(await new LinuxServiceManager().query()).toEqual({ installed: true, running: false, enabled: false });
+    });
+  });
+
+  describe('isInstalled', () => {
+    it('returns true when unit file exists', async () => {
+      vi.mocked(fs.existsSync).mockReturnValue(true);
+      expect(await new LinuxServiceManager().isInstalled()).toBe(true);
+    });
+
+    it('returns false when unit file does not exist', async () => {
+      vi.mocked(fs.existsSync).mockImplementation((p) =>
+        String(p).endsWith('/systemd'),
+      );
+      expect(await new LinuxServiceManager().isInstalled()).toBe(false);
+    });
+  });
+});
+
+// ---------------------------------------------------------------------------
+// macOS
+// ---------------------------------------------------------------------------
+describe('MacOSServiceManager', () => {
+  beforeEach(() => {
+    vi.clearAllMocks();
+    vi.mocked(execFileSync).mockReturnValue('' as any);
+    vi.mocked(fs.mkdirSync).mockReturnValue(undefined as any);
+    vi.mocked(fs.writeFileSync).mockReturnValue(undefined);
+    vi.mocked(fs.unlinkSync).mockReturnValue(undefined);
+    vi.mocked(fs.existsSync).mockReturnValue(false);
+  });
+
+  describe('register', () => {
+    it('writes plist with Label, ProgramArguments, RunAtLoad, KeepAlive', async () => {
+      await new MacOSServiceManager().register(
+        '/usr/local/bin/apra-fleet', ['--transport', 'http'], '/Users/user/fleet.log',
+      );
+      const plistCall = vi.mocked(fs.writeFileSync).mock.calls.find(c =>
+        String(c[0]).endsWith('.plist'),
+      );
+      expect(plistCall).toBeDefined();
+      const content = String(plistCall![1]);
+      expect(content).toContain('<string>com.apra-fleet.server</string>');
+      expect(content).toContain('<string>/usr/local/bin/apra-fleet</string>');
+      expect(content).toContain('<true/>'); // RunAtLoad
+      expect(content).toContain('<key>SuccessfulExit</key>');
+      expect(content).toContain('<false/>'); // KeepAlive.SuccessfulExit
+    });
+
+    it('bootouts before bootstrap to be idempotent', async () => {
+      await new MacOSServiceManager().register('/bin/apra-fleet', [], '/tmp/fleet.log');
+      const calls = vi.mocked(execFileSync).mock.calls.map(c => c[1] as string[]);
+      const bootoutIdx = calls.findIndex(a => a.includes('bootout'));
+      const bootstrapIdx = calls.findIndex(a => a.includes('bootstrap'));
+      expect(bootoutIdx).toBeGreaterThanOrEqual(0);
+      expect(bootstrapIdx).toBeGreaterThan(bootoutIdx);
+    });
+
+    it('tolerates bootout error on first registration', async () => {
+      // bootout throws "not loaded" (first exec call), bootstrap succeeds (second exec call)
+      vi.mocked(execFileSync).mockImplementationOnce(() => { throw new Error('not loaded'); });
+      vi.mocked(execFileSync).mockImplementationOnce(() => {});
+      const mgr = new MacOSServiceManager();
+      await expect(mgr.register('/bin/apra-fleet', [], '/tmp/fleet.log')).resolves.not.toThrow();
+    });
+  });
+
+  describe('unregister', () => {
+    it('bootouts service and removes plist file', async () => {
+      await new MacOSServiceManager().unregister();
+      expect(execFileSync).toHaveBeenCalledWith('launchctl', expect.arrayContaining(['bootout']));
+      expect(fs.unlinkSync).toHaveBeenCalledWith(expect.stringContaining('com.apra-fleet.server.plist'));
+    });
+
+    it('tolerates bootout error when service is not loaded', async () => {
+      vi.mocked(execFileSync).mockImplementationOnce(() => { throw new Error('No such process'); });
+      await expect(new MacOSServiceManager().unregister()).resolves.not.toThrow();
+    });
+  });
+
+  describe('start', () => {
+    it('calls launchctl kickstart', async () => {
+      await new MacOSServiceManager().start();
+      expect(execFileSync).toHaveBeenCalledWith('launchctl', expect.arrayContaining(['kickstart']));
+    });
+  });
+
+  describe('stop', () => {
+    it('calls gracefulStopByServerJson', async () => {
+      await new MacOSServiceManager().stop();
+      expect(mockGracefulStop).toHaveBeenCalled();
+    });
+  });
+
+  describe('query', () => {
+    it('returns installed=false when plist does not exist', async () => {
+      vi.mocked(fs.existsSync).mockReturnValue(false);
+      expect(await new MacOSServiceManager().query()).toEqual({ installed: false, running: false });
+    });
+
+    it('extracts pid from launchctl print output', async () => {
+      vi.mocked(fs.existsSync).mockReturnValue(true);
+      vi.mocked(execFileSync).mockReturnValue('com.apra-fleet.server {\n\tpid = 1234\n\tstate = running\n}\n' as any);
+      expect(await new MacOSServiceManager().query()).toEqual({ installed: true, running: true, pid: 1234 });
+    });
+
+    it('returns running=false when launchctl print fails (not loaded)', async () => {
+      vi.mocked(fs.existsSync).mockReturnValue(true);
+      vi.mocked(execFileSync).mockImplementation(() => { throw new Error('Could not find specified service'); });
+      expect(await new MacOSServiceManager().query()).toEqual({ installed: true, running: false });
+    });
+
+    it('returns running=false when launchctl print shows no pid', async () => {
+      vi.mocked(fs.existsSync).mockReturnValue(true);
+      vi.mocked(execFileSync).mockReturnValue('com.apra-fleet.server {\n\tstate = stopped\n}\n' as any);
+      expect(await new MacOSServiceManager().query()).toEqual({ installed: true, running: false, pid: undefined });
+    });
+  });
+
+  describe('isInstalled', () => {
+    it('returns true when plist file exists', async () => {
+      vi.mocked(fs.existsSync).mockReturnValue(true);
+      expect(await new MacOSServiceManager().isInstalled()).toBe(true);
+    });
+
+    it('returns false when plist file does not exist', async () => {
+      vi.mocked(fs.existsSync).mockReturnValue(false);
+      expect(await new MacOSServiceManager().isInstalled()).toBe(false);
+    });
+  });
+});
diff --git a/tests/singleton.test.ts b/tests/singleton.test.ts
new file mode 100644
index 00000000..5121c491
--- /dev/null
+++ b/tests/singleton.test.ts
@@ -0,0 +1,181 @@
+import { describe, it, expect, beforeEach, afterEach } from 'vitest';
+import fs from 'node:fs';
+import http from 'node:http';
+import path from 'node:path';
+import os from 'node:os';
+import { checkRunningInstance, claimStartupLock } from '../src/services/singleton.js';
+
+// Use a per-run temp directory so tests are isolated and don't touch the real FLEET_DIR
+const TEST_DIR = path.join(os.tmpdir(), `apra-fleet-singleton-test-${process.pid}`);
+const SERVER_INFO = path.join(TEST_DIR, 'server.json');
+const LOCK_FILE = path.join(TEST_DIR, 'server.lock');
+
+const originalDataDir = process.env.APRA_FLEET_DATA_DIR;
+
+beforeEach(() => {
+  fs.mkdirSync(TEST_DIR, { recursive: true });
+  process.env.APRA_FLEET_DATA_DIR = TEST_DIR;
+});
+
+afterEach(() => {
+  if (originalDataDir === undefined) {
+    delete process.env.APRA_FLEET_DATA_DIR;
+  } else {
+    process.env.APRA_FLEET_DATA_DIR = originalDataDir;
+  }
+  try { fs.rmSync(TEST_DIR, { recursive: true, force: true }); } catch {}
+});
+
+// ---------------------------------------------------------------------------
+// (a) stale server.json (dead PID) is cleaned up and startup proceeds
+// ---------------------------------------------------------------------------
+describe('(a) stale server.json is cleaned up', () => {
+  it('returns running=false and deletes server.json when PID is dead', async () => {
+    // Write server.json with a PID that will never be alive (max safe int32)
+    fs.writeFileSync(SERVER_INFO, JSON.stringify({
+      pid: 2147483647,
+      url: 'http://127.0.0.1:7523/mcp',
+      version: 'v0.0.1',
+      port: 7523,
+      startedAt: new Date().toISOString(),
+    }));
+    expect(fs.existsSync(SERVER_INFO)).toBe(true);
+
+    const result = await checkRunningInstance();
+
+    expect(result.running).toBe(false);
+    expect(fs.existsSync(SERVER_INFO)).toBe(false);
+  });
+
+  it('returns running=false when server.json does not exist', async () => {
+    const result = await checkRunningInstance();
+    expect(result.running).toBe(false);
+  });
+
+  it('returns running=false when server.json is malformed', async () => {
+    fs.writeFileSync(SERVER_INFO, 'not json');
+    const result = await checkRunningInstance();
+    expect(result.running).toBe(false);
+  });
+});
+
+// ---------------------------------------------------------------------------
+// (b) health endpoint returns correct JSON
+// ---------------------------------------------------------------------------
+describe('(b) health endpoint check', () => {
+  it('returns running=true when PID is alive and health endpoint responds 200', async () => {
+    // Start a minimal HTTP server to act as the /health endpoint
+    const mockServer = http.createServer((req, res) => {
+      if (req.url === '/health') {
+        res.writeHead(200, { 'Content-Type': 'application/json' });
+        res.end(JSON.stringify({ status: 'ok' }));
+      } else {
+        res.writeHead(404);
+        res.end();
+      }
+    });
+
+    await new Promise<void>(resolve => mockServer.listen(0, '127.0.0.1', resolve));
+    const addr = mockServer.address() as { port: number };
+
+    try {
+      fs.writeFileSync(SERVER_INFO, JSON.stringify({
+        pid: process.pid, // current process is definitely alive
+        url: `http://127.0.0.1:${addr.port}/mcp`,
+        version: 'v0.0.1',
+        port: addr.port,
+        startedAt: new Date().toISOString(),
+      }));
+
+      const result = await checkRunningInstance();
+
+      expect(result.running).toBe(true);
+      if (result.running) {
+        expect(result.pid).toBe(process.pid);
+        expect(result.url).toContain('/mcp');
+      }
+    } finally {
+      await new Promise<void>(resolve => mockServer.close(() => resolve()));
+    }
+  });
+
+  it('returns running=false when PID is alive but health endpoint is down', async () => {
+    // Port 1 will always fail to connect
+    fs.writeFileSync(SERVER_INFO, JSON.stringify({
+      pid: process.pid,
+      url: 'http://127.0.0.1:1/mcp',
+      version: 'v0.0.1',
+      port: 1,
+      startedAt: new Date().toISOString(),
+    }));
+
+    const result = await checkRunningInstance();
+
+    expect(result.running).toBe(false);
+    expect(fs.existsSync(SERVER_INFO)).toBe(false);
+  });
+});
+
+// ---------------------------------------------------------------------------
+// (c) lock file prevents concurrent startup -- second acquire gets acquired=false
+// ---------------------------------------------------------------------------
+describe('(c) startup lock prevents concurrent startup', () => {
+  it('first claim acquires, second claim returns acquired=false', () => {
+    const lock1 = claimStartupLock();
+    expect(lock1.acquired).toBe(true);
+    expect(fs.existsSync(LOCK_FILE)).toBe(true);
+
+    const lock2 = claimStartupLock();
+    expect(lock2.acquired).toBe(false);
+
+    lock1.release();
+    expect(fs.existsSync(LOCK_FILE)).toBe(false);
+  });
+
+  it('release() deletes the lock file', () => {
+    const lock = claimStartupLock();
+    expect(lock.acquired).toBe(true);
+    expect(fs.existsSync(LOCK_FILE)).toBe(true);
+
+    lock.release();
+    expect(fs.existsSync(LOCK_FILE)).toBe(false);
+  });
+
+  it('after release, next claim acquires successfully', () => {
+    const lock1 = claimStartupLock();
+    lock1.release();
+
+    const lock2 = claimStartupLock();
+    expect(lock2.acquired).toBe(true);
+    lock2.release();
+  });
+});
+
+// ---------------------------------------------------------------------------
+// (d) stale lock file (>60s old) is cleaned up and lock is acquired
+// ---------------------------------------------------------------------------
+describe('(d) stale lock file is cleaned up', () => {
+  it('acquires lock when existing lock file is older than 60 seconds', () => {
+    // Create a lock file and backdate its mtime by 70 seconds
+    fs.writeFileSync(LOCK_FILE, '99999');
+    const staleMtime = new Date(Date.now() - 70_000);
+    fs.utimesSync(LOCK_FILE, staleMtime, staleMtime);
+
+    expect(fs.existsSync(LOCK_FILE)).toBe(true);
+
+    const lock = claimStartupLock();
+    expect(lock.acquired).toBe(true);
+    lock.release();
+  });
+
+  it('does not acquire when existing lock file is fresh (< 60 seconds)', () => {
+    // Create a fresh lock file
+    fs.writeFileSync(LOCK_FILE, '99999');
+
+    const lock = claimStartupLock();
+    expect(lock.acquired).toBe(false);
+
+    // Clean up manually since we didn't acquire
+    fs.unlinkSync(LOCK_FILE);
+  });
+});
diff --git a/tests/transport-integration.test.ts b/tests/transport-integration.test.ts
new file mode 100644
index 00000000..a4eb1e3c
--- /dev/null
+++ b/tests/transport-integration.test.ts
@@ -0,0 +1,277 @@
+/**
+ * Transport integration tests (Task 9 / PLAN.md Phase 3).
+ * Six end-to-end scenarios covering the full HTTP transport path and
+ * Gemini client compatibility.
+ *
+ * Tests (a)-(e) exercise the HTTP singleton path; test (d) exercises stdio
+ * via an in-process InMemoryTransport pair.
+ */
+
+import { describe, it, expect, afterEach } from 'vitest';
+import net from 'node:net';
+import { z } from 'zod';
+import { Client } from '@modelcontextprotocol/sdk/client/index.js';
+import { McpServer } from '@modelcontextprotocol/sdk/server/mcp.js';
+import { StreamableHTTPClientTransport } from '@modelcontextprotocol/sdk/client/streamableHttp.js';
+import { InMemoryTransport } from '@modelcontextprotocol/sdk/inMemory.js';
+import { LoggingMessageNotificationSchema } from '@modelcontextprotocol/sdk/types.js';
+import { createHttpTransport, HttpTransportHandle } from '../src/services/http-transport.js';
+import { fleetEvents } from '../src/services/event-bus.js';
+import { serverVersion } from '../src/version.js';
+
+// ---------------------------------------------------------------------------
+// Test infrastructure
+// ---------------------------------------------------------------------------
+
+const handles: HttpTransportHandle[] = [];
+const clients: Client[] = [];
+
+afterEach(async () => {
+  for (const client of clients.splice(0)) {
+    try { await client.close(); } catch { /* ignore */ }
+  }
+  fleetEvents.removeAllListeners();
+  for (const handle of handles.splice(0)) {
+    try { await handle.close(); } catch { /* ignore */ }
+  }
+});
+
+function registerVersionTool(server: McpServer): void {
+  server.tool(
+    'version',
+    'Returns the installed apra-fleet server version',
+    z.object({}).shape,
+    async () => ({
+      content: [{ type: 'text' as const, text: `apra-fleet ${serverVersion}` }],
+    })
+  );
+}
+
+function makeHttpClient(port: number): Client {
+  return new Client({ name: 'integration-test-client', version: '1.0.0' }, { capabilities: {} });
+}
+
+function makeHttpTransport(port: number): StreamableHTTPClientTransport {
+  return new StreamableHTTPClientTransport(
+    new URL(`http://127.0.0.1:${port}/mcp`),
+    {
+      reconnectionOptions: {
+        maxRetries: 0,
+        maxReconnectionDelay: 100,
+        initialReconnectionDelay: 100,
+        reconnectionDelayGrowFactor: 1,
+      },
+    }
+  );
+}
+
+// ---------------------------------------------------------------------------
+// (a) HTTP server with tools registered: client can call the version tool
+// ---------------------------------------------------------------------------
+describe('(a) HTTP server tool call end-to-end', () => {
+  it('client connects via StreamableHTTP and calls the version tool', async () => {
+    const handle = await createHttpTransport({
+      registerTools: registerVersionTool,
+      preferredPort: 0,
+    });
+    handles.push(handle);
+
+    const client = makeHttpClient(handle.port);
+    clients.push(client);
+    await client.connect(makeHttpTransport(handle.port));
+
+    const result = await client.callTool({ name: 'version', arguments: {} });
+
+    expect(result.content).toHaveLength(1);
+    const text = (result.content[0] as { type: string; text: string }).text;
+    expect(text).toContain('apra-fleet');
+  });
+});
+
+// ---------------------------------------------------------------------------
+// (b) credential:stored event reaches connected client as notifications/message
+// ---------------------------------------------------------------------------
+describe('(b) event bus -> notification/message broadcast', () => {
+  it('client receives notifications/message when credential:stored is emitted', async () => {
+    const handle = await createHttpTransport({
+      registerTools: registerVersionTool,
+      preferredPort: 0,
+    });
+    handles.push(handle);
+
+    const client = makeHttpClient(handle.port);
+    clients.push(client);
+
+    const received: unknown[] = [];
+    client.setNotificationHandler(LoggingMessageNotificationSchema, (n) => {
+      received.push(n.params.data);
+    });
+
+    await client.connect(makeHttpTransport(handle.port));
+
+    // Wait for SSE stream to be established (GET /mcp)
+    await new Promise(resolve => setTimeout(resolve, 200));
+
+    fleetEvents.emit('credential:stored', { name: 'test-cred' });
+
+    // Allow notification to propagate
+    await new Promise(resolve => setTimeout(resolve, 300));
+
+    expect(received).toHaveLength(1);
+    const payload = received[0] as { event: string; name: string };
+    expect(payload.event).toBe('credential:stored');
+    expect(payload.name).toBe('test-cred');
+  });
+});
+
+// ---------------------------------------------------------------------------
+// (c) Two concurrent clients both receive the notification
+// ---------------------------------------------------------------------------
+describe('(c) broadcast to multiple concurrent clients', () => {
+  it('both clients receive notifications/message on credential:stored', async () => {
+    const handle = await createHttpTransport({
+      registerTools: registerVersionTool,
+      preferredPort: 0,
+    });
+    handles.push(handle);
+
+    // Track SSE GET requests so we know when both streams are open
+    let sseGetCount = 0;
+    handle.httpServer.on('request', (req) => {
+      if (req.method === 'GET' && req.url === '/mcp') sseGetCount++;
+    });
+
+    const c1 = makeHttpClient(handle.port);
+    const c2 = makeHttpClient(handle.port);
+    clients.push(c1, c2);
+
+    const received1: unknown[] = [];
+    const received2: unknown[] = [];
+    c1.setNotificationHandler(LoggingMessageNotificationSchema, (n) => { received1.push(n.params.data); });
+    c2.setNotificationHandler(LoggingMessageNotificationSchema, (n) => { received2.push(n.params.data); });
+
+    await Promise.all([
+      c1.connect(makeHttpTransport(handle.port)),
+      c2.connect(makeHttpTransport(handle.port)),
+    ]);
+
+    // Wait for both SSE streams to open
+    const deadline = Date.now() + 3000;
+    while (sseGetCount < 2 && Date.now() < deadline) {
+      await new Promise(resolve => setTimeout(resolve, 20));
+    }
+    expect(sseGetCount).toBeGreaterThanOrEqual(2);
+
+    fleetEvents.emit('credential:stored', { name: 'shared-cred' });
+
+    await new Promise(resolve => setTimeout(resolve, 300));
+
+    expect(received1).toHaveLength(1);
+    expect(received2).toHaveLength(1);
+    expect((received1[0] as { event: string }).event).toBe('credential:stored');
+    expect((received2[0] as { event: string }).event).toBe('credential:stored');
+  });
+});
+
+// ---------------------------------------------------------------------------
+// (d) Stdio regression: tool calls work via in-process InMemoryTransport
+// ---------------------------------------------------------------------------
+describe('(d) stdio regression via InMemoryTransport', () => {
+  it('registers tools and responds to version tool call over in-memory transport', async () => {
+    const server = new McpServer(
+      { name: 'apra-fleet-test', version: serverVersion },
+      { capabilities: { logging: {} } }
+    );
+    registerVersionTool(server);
+
+    const [clientTransport, serverTransport] = InMemoryTransport.createLinkedPair();
+
+    const client = new Client(
+      { name: 'stdio-regression-client', version: '1.0.0' },
+      { capabilities: {} }
+    );
+
+    await Promise.all([
+      server.connect(serverTransport),
+      client.connect(clientTransport),
+    ]);
+
+    const result = await client.callTool({ name: 'version', arguments: {} });
+
+    expect(result.content).toHaveLength(1);
+    const text = (result.content[0] as { type: string; text: string }).text;
+    expect(text).toContain('apra-fleet');
+
+    await client.close();
+    // server closes implicitly when client disconnects
+  });
+});
+
+// ---------------------------------------------------------------------------
+// (e) Server binds to 127.0.0.1 only (not 0.0.0.0)
+// ---------------------------------------------------------------------------
+describe('(e) localhost-only binding', () => {
+  it('HTTP server address is 127.0.0.1', async () => {
+    const handle = await createHttpTransport({
+      registerTools: registerVersionTool,
+      preferredPort: 0,
+    });
+    handles.push(handle);
+
+    const addr = handle.httpServer.address() as net.AddressInfo;
+    expect(addr.address).toBe('127.0.0.1');
+  });
+
+  it('server URL reflects 127.0.0.1', async () => {
+    const handle = await createHttpTransport({
+      registerTools: registerVersionTool,
+      preferredPort: 0,
+    });
+    handles.push(handle);
+
+    expect(handle.url).toMatch(/^http:\/\/127\.0\.0\.1:\d+\/mcp$/);
+  });
+});
+
+// ---------------------------------------------------------------------------
+// (f) Gemini client compatibility test
+//
+// Gemini CLI uses StreamableHTTPClientTransport from the MCP SDK to connect
+// to MCP servers. This test validates that our StreamableHTTPServerTransport
+// is compatible with that client transport — independent of the open Gemini
+// bug google-gemini/gemini-cli#5268 (Gemini CLI may not support all
+// StreamableHTTP protocol features at the CLI level, but the MCP SDK client
+// transport itself is spec-compliant and should work against our server).
+//
+// If this test fails, it is a fleet-side issue (our server is not spec-
+// compliant). If it passes but Gemini CLI still fails in production, the
+// failure is Gemini-side (bug #5268 or related).
+// ---------------------------------------------------------------------------
+describe('(f) Gemini client compatibility', () => {
+  it('StreamableHTTPClientTransport can initialize and call a tool (Gemini-compatible path)', async () => {
+    const handle = await createHttpTransport({
+      registerTools: registerVersionTool,
+      preferredPort: 0,
+    });
+    handles.push(handle);
+
+    // Use the same transport class that Gemini CLI uses
+    const geminiClient = new Client(
+      { name: 'gemini-compat-test-client', version: '1.0.0' },
+      { capabilities: {} }
+    );
+    clients.push(geminiClient);
+
+    await geminiClient.connect(makeHttpTransport(handle.port));
+
+    const result = await geminiClient.callTool({ name: 'version', arguments: {} });
+
+    expect(result.content).toHaveLength(1);
+    const text = (result.content[0] as { type: string; text: string }).text;
+    expect(text).toContain('apra-fleet');
+
+    // Verify tool list is accessible (part of the Gemini initialization handshake)
+    const tools = await geminiClient.listTools();
+    expect(tools.tools.some(t => t.name === 'version')).toBe(true);
+  });
+});