🤖 fix: require provider stream terminal events by ethanndickson · Pull Request #3415 · coder/mux

ethanndickson · 2026-05-29T03:18:56Z

Summary

Requires provider stream completion to be witnessed by the AI SDK finish part before Mux finalizes an assistant message. If a non-empty stream closes before that terminal event, Mux now persists a retryable stream_truncated partial instead of committing partial text to chat.jsonl as a successful response.

Background

This mirrors the stream-terminal invariant from coder/coder#25074 and coder/fantasy#33: Anthropic streams must reach message_stop, and OpenAI Responses streams must reach a terminal lifecycle event (response.completed, response.incomplete, or response.failed) before the response is considered complete. A clean network EOF can be a proxy or provider drop and is not enough to prove semantic completion.

Implementation

StreamManager now records the terminal finish part as the proof required for the success path and classifies non-empty missing-terminal completions as retryable stream_truncated errors. Empty completions keep the existing safe internal retry path. The Copilot Responses adapter now enforces the same Responses lifecycle locally, maps response.incomplete to a length finish, and surfaces response.failed as an error instead of falling through to EOF handling.

Already-streamed content is preserved, not discarded

If the stream produced output before closing without a terminal event, that output is not thrown away. The final flushPartialWrite runs before the truncation guard fires, and persistStreamError writes a MuxMessage to partial.json whose parts array contains every streamed part accumulated so far (metadata.partial = true, metadata.errorType = "stream_truncated"). The success-path call to historyService.updateHistory is unreachable in this branch, so nothing partial leaks into chat.jsonl. The frontend surfaces the partial turn with a retry barrier; the user sees the streamed text and can retry, and because stream_truncated is omitted from NON_RETRYABLE_STREAM_ERRORS, retry is enabled by default. The new streamManager.test.ts regression test pins this: after a truncated stream, partial.parts still contains the streamed text-delta and partial.metadata.errorType === "stream_truncated".

Risks

This makes the success path stricter for every provider routed through streamText. If a provider adapter fails to emit a finish part for a genuinely complete response, Mux will now surface a retryable partial instead of committing it. That is intentional for Anthropic and OpenAI Responses and safer than silently finalizing truncated output.

Generated with mux • Model: openai:gpt-5.5 • Thinking: xhigh • Cost: $3.95

Treat the provider finish part as the proof that a streamed assistant response completed. Non-empty streams that end before a terminal finish now persist a retryable stream_truncated partial instead of committing partial text as success; empty streams keep the existing safe internal retry path. The Copilot Responses adapter now mirrors the same invariant by emitting an error when the SSE stream ends before a terminal Responses event, while treating response.incomplete as a terminal length finish and response.failed as an error. --- _Generated with `mux` • Model: `openai:gpt-5.5` • Thinking: `xhigh` • Cost: `.95`_

ethanndickson · 2026-05-29T03:19:12Z

@codex review

chatgpt-codex-connector · 2026-05-29T03:22:52Z

Codex Review: Didn't find any major issues. What shall we delve into next?

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Update model-only stream manager unit tests so their mocked successful streams include the terminal finish event now required before finalization. --- _Generated with `mux` • Model: `openai:gpt-5.5` • Thinking: `xhigh` • Cost: `.95`_

ethanndickson · 2026-05-29T03:27:47Z

@codex review

Pushed a follow-up test-only fix for the unit-test mocks that needed to emit the terminal finish event under the new invariant.

chatgpt-codex-connector · 2026-05-29T03:31:06Z

Codex Review: Didn't find any major issues. You're on a roll.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Codex pointed out that the previous discriminator lived in StreamManager and treated every `finish` part with `(unified: "other", raw: undefined)` as truncated. The LanguageModelV2 contract permits adapters to emit that shape as a legitimate terminal event, so the heuristic was too broad for the provider-agnostic StreamManager. Move the fix to the boundary where the synthesized default originates. The @ai-sdk/openai adapter family — Responses, Chat Completions, legacy Completions — initializes its internal finishReason to `{ unified: "other", raw: undefined }` and unconditionally emits that value from its TransformStream.flush() at end-of-stream, even when no terminal SSE event arrived. The SDK's mappers only return `unified: "other"` paired with a defined `raw`, so within this adapter family the `(other, undefined)` shape is unreachable except as the uninitialized default. Dropping it is safe and intentionally scoped to the OpenAI provider construction path (and the Copilot path, which reuses the same adapter). Implementation: introduce src/node/services/openAISynthesizedFinishFilter.ts which exposes `wrapOpenAIModelToFilterSynthesizedFinish(model)`. The wrapper pipes `doStream`'s output through a TransformStream that drops only the synthesized-default finish part; all other parts pass through unchanged. Apply the wrapper at the two `createOpenAI(...)` callsites in providerModelFactory.ts. With the synthesized finish dropped, the existing `!receivedTerminalEvent` branch in StreamManager handles a clean upstream EOF as `stream_truncated` exactly as PR #3415 intended. Revert the StreamManager-side heuristic and tests from the previous commit so StreamManager stays provider-agnostic.

coder#3441) ## Summary Drop `ai`'s synthesized-default `finish` part inside `StreamManager` so that PR coder#3415's missing-terminal-event guard turns a clean upstream EOF into a retryable `stream_truncated` error for **both** OpenAI and Anthropic providers, instead of silently committing partial output as if the assistant finished cleanly. ## Background PR coder#3415 added a `receivedTerminalEvent` guard in `StreamManager` that surfaces a missing terminal SSE event as a retryable `stream_truncated` error. That guard only fires when the SDK stream ends without emitting any `finish` part at all. Empirically that branch was unreachable: every real OpenAI and Anthropic stream ends with a `finish` part — but on truncated upstreams the part is a **synthesized default**, not a real terminal signal. The synthesis originates inside the `ai` package's `streamText`. Its internal `runStep` TransformStream initializes: ```js let stepFinishReason = "other"; let stepRawFinishReason = void 0; ``` and unconditionally emits those values from its own `flush()` at end-of-stream — even when the upstream SSE closed before any terminal event arrived. So every adapter ends up looking, at the StreamManager boundary, like it cleanly finished with `(other, undefined)` regardless of whether it actually did. Per-provider truncation behavior, observed in the installed source: - **OpenAI** (`@ai-sdk/openai`): each adapter (Responses, Chat Completions, legacy Completions) initializes its own `finishReason = { unified: "other", raw: undefined }` and emits it from its own `flush()`. `streamText` normalizes that to `(other, undefined)` and forwards it. - **Anthropic** (`@ai-sdk/anthropic`): the adapter has no `flush()` and only emits its `finish` part on a real `message_stop`. On a truncated stream there is no adapter-level finish at all — and `streamText`'s `runStep.flush()` then synthesizes the same `(other, undefined)` part. Same symptom at the StreamManager layer, two different SDK-internal causes. ## Implementation Filter the synthesized default at the `streamText` → `StreamManager` boundary — the layer that actually produces it. In `StreamManager.processStreamWithCleanup`'s `case "finish":` handler, treat a part whose normalized `finishReason === "other"` **and** `rawFinishReason === undefined` as a non-event: do not set `receivedTerminalEvent = true`. The existing `!receivedTerminalEvent` branch below then routes the stream to `handleTruncatedStreamCompletion`, which writes a retryable `stream_truncated` partial with the streamed text preserved. **Why the discriminator is safe (empirical):** - **OpenAI** — `mapOpenAIResponseFinishReason` and `mapOpenAIFinishReason` only return `unified: "other"` from their `default:` branches, which are reached via `isResponseFinishedChunk` / `isResponseFailedChunk`, both of which carry a defined `raw` value. The `(other, undefined)` shape is therefore unreachable as a real OpenAI finish. - **Anthropic** — `mapAnthropicStopReason` only returns `"other"` for the `"compaction"` case and the `default:` fallback. Both call sites in the adapter (`message_delta` and `message_start` handlers) pair the unified reason with `raw: value.message.stop_reason` (a defined string from the API). `(other, undefined)` is unreachable as a real Anthropic finish. - **streamText's own flush** — the only path in this layer that produces `(other, undefined)` is the synthesized default in `runStep`'s end-of-stream flush. So the discriminator distinguishes precisely between "the SDK fabricated a finish to keep the type system happy" and "the model genuinely finished with `other`". A defensive test guards the false-positive surface: a real `(other, "compaction")` finish must pass through as a clean completion. ## Risks Behavioral change is localized to streams that previously committed partial output silently on a clean truncated EOF. After this change those surface as retryable `stream_truncated` errors — the UX PR coder#3415 originally intended. Regression surface is the synthesized-default discriminator itself: a false positive would treat a legitimate `(other, undefined)` finish as truncated, triggering an unnecessary retry. We mitigate by: 1. Tying the discriminator to the empirically-unreachable `(other, undefined)` shape, verified against the OpenAI and Anthropic mappers (see Implementation). 2. A regression test that asserts real `(other, <raw>)` finishes (e.g. Anthropic's `"compaction"`) still complete cleanly. If a future provider adapter does emit `(other, undefined)` as a real terminal finish, the worst case is a retry — preferable to silently committing partial output as a clean completion. ## Pains The first revision moved this same heuristic into `StreamManager` and was correctly flagged by Codex as theoretically too broad — the public `LanguageModelV2` contract permits any adapter to emit `(other, undefined)` as a legitimate terminal finish. The second revision scoped a similar filter to the `@ai-sdk/openai` adapter callsites, which was contract-safe but did not actually fix the bug: `streamText`'s `runStep.flush()` re-synthesizes the identical part one layer up, and it produced no fix at all for Anthropic where the adapter has no `flush()` to filter in the first place. This revision returns the fix to `StreamManager` but now with concrete evidence — gathered from reading `node_modules/{ai,@ai-sdk/openai,@ai-sdk/anthropic}/dist/index.js` — that the `(other, undefined)` shape is unreachable from the two real adapter mappers we care about, and is uniquely produced by `streamText`'s own flush. The discriminator's safety is a property of the two SDKs in use, not a guarantee of the public V2 contract. --- _Generated with `mux` • Model: `anthropic:claude-opus-4-7` • Thinking: `xhigh` • Cost: `$60.15`_

ethanndickson added this pull request to the merge queue May 29, 2026

Merged via the queue into main with commit 18d8332 May 29, 2026
24 checks passed

ethanndickson deleted the fix/provider-stream-terminal-events branch May 29, 2026 05:12

ethanndickson mentioned this pull request Jun 2, 2026

🤖 fix: filter streamText's synthesized default finish in StreamManager #3441

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🤖 fix: require provider stream terminal events#3415

🤖 fix: require provider stream terminal events#3415
ethanndickson merged 2 commits into
mainfrom
fix/provider-stream-terminal-events

ethanndickson commented May 29, 2026 •

edited

Loading

Uh oh!

ethanndickson commented May 29, 2026

Uh oh!

chatgpt-codex-connector Bot commented May 29, 2026

Uh oh!

ethanndickson commented May 29, 2026

Uh oh!

chatgpt-codex-connector Bot commented May 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ethanndickson commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ethanndickson commented May 29, 2026

Uh oh!

chatgpt-codex-connector Bot commented May 29, 2026

Uh oh!

ethanndickson commented May 29, 2026

Uh oh!

chatgpt-codex-connector Bot commented May 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ethanndickson commented May 29, 2026 •

edited

Loading