feat(middleware): result handling, model-state isolation, and system-prompt fidelity by zastrowm · Pull Request #2812 · strands-agents/harness-sdk

zastrowm · 2026-06-15T21:24:06Z

Description

Follow-up to the internal InvokeModelStage middleware (#2760), aligning the Python port with the TypeScript behavioral spec. Two behavior changes.

1. Result handling. The MiddlewareResult sentinel no longer rides the event stream. Instead the last yielded event is the result, matching the SDK convention (ModelStopReason ends stream_messages(), ToolResultEvent ends tool execution). The chain is now transparent. Output handlers — which transform the result — take/return a MiddlewareResult wrapper so we can extend it later without changing the signature.

# Before: integration site sniffs the stream for the sentinel
async for event in registry.invoke(InvokeModelStage, ctx, terminal):
    if isinstance(event, MiddlewareResult):
        model_result = event.value
    else:
        yield event

# After: events flow straight through; the last one is the result
async for event in registry.invoke(InvokeModelStage, ctx, terminal):
    yield event
stop_reason, message, usage, metrics = event["stop"]

2. Model-state isolation. Model runtime state is snapshotted before the chain and written back only on success, so middleware can't corrupt provider state (e.g. server-side response IDs) via the agent escape hatch.

# Before: terminal streamed against live agent state
model_state=agent._model_state,

# After: stream against a snapshot, write back after the chain succeeds
model_state_snapshot = copy.deepcopy(agent._model_state)
# ... terminal uses model_state_snapshot ...
agent._model_state = model_state_snapshot

Public API Changes

InvokeModelStage is internal, so no public surface changes. For middleware authors, the Output-phase handler shape changes to a MiddlewareResult wrapper:

def output_handler(result):  # result: MiddlewareResult
    stop_reason, message, usage, metrics = result.value["stop"]
    return result.replace(value=ModelStopReason(stop_reason="end_turn", message=message, usage=usage, metrics=metrics))

Wrap and Input handlers are unchanged.

Related Issues

Middleware - Python #2735 (Middleware – Python tracking issue), builds on feat: add internal middleware system for InvokeModelStage #2760
Behavioral spec: strands-ts/src/middleware/README.md

Documentation PR

None — internal stage; divergences documented in strands-py/src/strands/_middleware/README.md.

Type of Change

New feature

Testing

I ran hatch run prepare

Checklist

I have read the CONTRIBUTING document
I have reviewed and understand every line of code in this PR, including any generated by AI tools, and I can explain why it works
My change is focused and reasonably small; I have split unrelated work into separate PRs
I have added any necessary tests that prove my fix is effective or my feature works
I have updated the documentation accordingly
I have added an appropriate example to the documentation to outline the feature, or no new docs are needed
My changes generate no new warnings
Any dependent changes have been merged and published

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…prompt fidelity Builds on the internal InvokeModelStage middleware (strands-agents#2760) with correctness fixes and an extensible result surface. Result handling: - Drop the MiddlewareResult sentinel from the event stream; the last yielded event is the result, matching the SDK's ModelStopReason / ToolResultEvent convention. - Output-phase handlers take and return a MiddlewareResult wrapper (with a .replace() helper) so we can extend the result later without changing the handler signature. - Type InvokeModelStage as MiddlewareStage[InvokeModelContext, ModelStopReason, TypedEvent]. Model-state isolation (matches TS): - Snapshot agent model_state before the chain, stream against the snapshot, and write back only on success. Middleware cannot leak state into the model call before or after next(). model_state is not exposed on the context. System-prompt fidelity: - Prefer system-prompt content blocks over the concatenated string so cachePoints survive the middleware round-trip; the terminal splits back into both forms for Model.stream(). Tests: - Python: model-state writeback/deletions/before-after-next, system-prompt string/ blocks/transform, CancelledError + multi-layer interrupt + inner-middleware-throw finally cases. Helper models moved to module top. - TS: Output tool-dispatch-prevention and message-append coverage.

github-actions · 2026-06-16T02:07:54Z

+            agent._model_state = model_state_snapshot
+
+            # The last event from the chain is ModelStopReason (the authoritative result)
+            stop_reason, message, usage, metrics = event["stop"]


Issue: event here is the async for loop variable, so if the middleware chain yields zero events it's never bound and this line raises UnboundLocalError: cannot access local variable 'event'. The previous code guarded this with an explicit if model_result is None: raise RuntimeError("Middleware chain did not produce a MiddlewareResult..."), which gave middleware authors a clear, actionable message. That guard is gone now.

This is reachable today — a Wrap middleware that short-circuits without yielding the result event triggers it. Reproduced:

async def silent(context, next_fn): if False: yield # never yields the result event agent._middleware_registry.add_middleware(InvokeModelStage, silent) agent("test") # -> UnboundLocalError at this line

Suggestion: Restore an explicit guard so a misbehaving middleware fails loudly instead of with a cryptic UnboundLocalError. For example, sentinel-initialize before the loop and check after:

last_event = None async for event in agent._middleware_registry.invoke(...): last_event = event yield event if last_event is None: raise RuntimeError( "Middleware chain did not yield a result event. " "Ensure middleware forwards events from next()." ) agent._model_state = model_state_snapshot stop_reason, message, usage, metrics = last_event["stop"]

github-actions · 2026-06-16T02:07:55Z

+                transformed = handler(MiddlewareResult(value=last_event))
+                if inspect.isawaitable(transformed):
+                    transformed = await transformed
+                yield transformed.value


Issue: If an Output handler forgets to return (returns None), this line raises AttributeError: 'NoneType' object has no attribute 'value', which doesn't point the author at the real mistake. The contract (handler must return a MiddlewareResult) is documented, but the failure mode is opaque.

Suggestion: Consider a targeted check after awaiting transformed to produce a clearer message, e.g.:

if not isinstance(transformed, MiddlewareResult): raise TypeError( f"Output handler must return a MiddlewareResult, got {type(transformed).__name__}" ) yield transformed.value

Optional/low-priority since this is an internal stage, but it's a cheap guardrail given the wrapper contract is new for middleware authors.

github-actions · 2026-06-16T02:07:56Z

Assessment: Comment

Clean, well-tested port. The result-encoding simplification (last event is the result) and the model-state snapshot/writeback isolation are both clear improvements over the sentinel approach, and the system-prompt content-block fidelity fix is a nice correctness catch. 97 unit tests pass locally and they assert on observable behavior (full messages, model-received state) rather than internal plumbing.

Review themes

Error ergonomics: The previous explicit RuntimeError guard for "chain produced no result" was dropped; a misbehaving middleware now surfaces as an UnboundLocalError/AttributeError. Both are reachable and reproduced — see inline comments. Restoring loud, actionable failures keeps the "obvious path is the happy path" tenet for middleware authors.
Model-state isolation: Snapshot-before / writeback-on-success is correct and the discard-on-error semantics match the TS spec. Tests cover before-next, after-next, deletions, and writeback well.
Docs: README updates (both Python and TS spec) accurately describe the new wrapper contract and isolation behavior.

Nice work keeping the chain transparent while still giving Output handlers room to evolve via the MiddlewareResult wrapper.

Add explicit guards so middleware authors get clear messages: - RuntimeError if a Wrap handler yields zero events (was UnboundLocalError) - TypeError if an Output handler forgets to return MiddlewareResult (was AttributeError)

codecov · 2026-06-16T18:45:54Z

Codecov Report

❌ Patch coverage is 96.00000% with 1 line in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
strands-py/src/strands/_middleware/registry.py	90.90%	0 Missing and 1 partial ⚠️

📢 Thoughts on this report? Let us know!

github-actions Bot added size/l python Pull requests that update python code enhancement New feature or request area-hooks Features or requests that might be implementable via hooks area-model Related to models or model providers strands-running labels Jun 15, 2026

github-actions Bot reviewed Jun 16, 2026

View reviewed changes

github-actions Bot removed the strands-running label Jun 16, 2026

fix(middleware): actionable errors for misbehaving handlers

ccd1c13

Add explicit guards so middleware authors get clear messages: - RuntimeError if a Wrap handler yields zero events (was UnboundLocalError) - TypeError if an Output handler forgets to return MiddlewareResult (was AttributeError)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(middleware): result handling, model-state isolation, and system-prompt fidelity#2812

feat(middleware): result handling, model-state isolation, and system-prompt fidelity#2812
zastrowm wants to merge 2 commits into
strands-agents:mainfrom
zastrowm:python-middleware-internal

zastrowm commented Jun 15, 2026

Uh oh!

github-actions Bot Jun 16, 2026

Uh oh!

github-actions Bot Jun 16, 2026

Uh oh!

github-actions Bot commented Jun 16, 2026

Uh oh!

codecov Bot commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

zastrowm commented Jun 15, 2026

Description

Public API Changes

Related Issues

Documentation PR

Type of Change

Testing

Checklist

Uh oh!

github-actions Bot Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jun 16, 2026

Uh oh!

codecov Bot commented Jun 16, 2026

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant