feat: add /v1/responses passthrough MVP by arniesaha · Pull Request #54 · arniesaha/mux

arniesaha · 2026-05-04T05:20:16Z

Summary

add native POST /v1/responses handling in the Express app
allow providers to declare supported protocols and route responses requests only to responses-capable downstreams
add openai-compatible downstream passthrough + tests/docs for the MVP flow

Testing

npm test
npm run check

Closes #40

…onses Extends the Anthropic streaming path to capture cache_read_input_tokens and cache_creation_input_tokens from message_start (and defensively from message_delta for models that emit mid-stream usage updates), and to emit a trailing OpenAI-shaped usage chunk (choices: [], usage: {...}) so clients with stream_options.include_usage: true see cache hit rate even on streaming calls. Parity with toOpenAIResponse: - cache_read + cache_creation rolled into prompt_tokens - cache_read surfaced as prompt_tokens_details.cached_tokens (only when cache fields are present) Also plumbs the buckets through StreamResult into the Anthropic SDK adapter, which sets two new OTel span attrs (conditionally, when >0): - prov.llm.cache_read_input_tokens - prov.llm.cache_creation_input_tokens The same attrs are now also set on the non-streaming path for AgentWeave parity across streaming and non-streaming sessions. Closes #52. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

- centralize DOWNSTREAM_PROTOCOLS parsing in config (one source of truth; registry no longer duplicates env parsing inline) - drop unused ResponsesInputItem import in app.ts - rename emitDownstreamResponsesAsSse → emitMockResponsesAsSse with proper output[].content[].text walk; clarifies it's the mock-only path - strip Mux-internal fields (runtime, protocol) before forwarding to downstream — prevents 400s from strict OpenAI endpoints - promote protocol?: RequestProtocol to ChatCompletionsRequest, drop the intersection-type casts at resolveRoute call sites - document DOWNSTREAM_PROTOCOLS in .env.example - add docs/deploy-debian.md covering systemd unit, OpenClaw plumbing (requires openclaw 13085b0bdf for baseUrl honoring), AgentWeave passthrough, SSE proxy-buffering caveat, and troubleshooting Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

arniesaha and others added 3 commits April 23, 2026 01:10

feat(api): add responses passthrough MVP

fa60d21

arniesaha merged commit e13d242 into master May 4, 2026
1 check passed

arniesaha deleted the feat/stream-cache-usage branch May 4, 2026 06:25

arniesaha mentioned this pull request May 4, 2026

docs(deploy-debian): rewrite to match actual NAS deploy #55

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add /v1/responses passthrough MVP#54

feat: add /v1/responses passthrough MVP#54
arniesaha merged 3 commits intomasterfrom
feat/stream-cache-usage

arniesaha commented May 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

arniesaha commented May 4, 2026

Summary

Testing

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant