Skip to content

🤖 feat: complete Claude Opus 4.8 metadata coverage#3410

Merged
ThomasK33 merged 1 commit into
mainfrom
claude-version-q36w
May 29, 2026
Merged

🤖 feat: complete Claude Opus 4.8 metadata coverage#3410
ThomasK33 merged 1 commit into
mainfrom
claude-version-q36w

Conversation

@ThomasK33

@ThomasK33 ThomasK33 commented May 28, 2026

Copy link
Copy Markdown
Member

Summary

Completes the Claude Opus 4.8 rollout on top of the upstream default-model bump by adding the remaining metadata, examples, and regression coverage that mirror the previous Opus upgrade pattern.

Background

main now includes the core Claude Opus 4.8 default-model update from #3409. After rebasing this branch onto main, this PR is the follow-up diff: Terminal-Bench leaderboard metadata, story/test fixtures, model-routing examples, and explicit Opus 4.8 coverage around native 1M/xhigh behavior.

Implementation

  • Adds anthropic/claude-opus-4-8 to Terminal-Bench leaderboard submission metadata and examples.
  • Updates the Models settings story fixture to use Opus 4.8.
  • Updates routing/example comments and Opus 4.7+ thinking/provider comments to describe the forward-compatible behavior accurately.
  • Adds explicit Opus 4.8 assertions for native 1M context, xhigh effort override, and Anthropic wire transforms.
  • Table-drives duplicated Opus 4.7/4.8 wire-transform test setup from the simplify/deslop pass.

Validation

  • bun test src/common/constants/knownModels.test.ts src/common/utils/ai/models.test.ts src/common/utils/ai/providerOptions.test.ts src/node/services/providerModelFactory.test.ts src/browser/hooks/useModelsFromSettings.test.ts
  • make static-check
  • git diff --check

Risks

Low risk: the default model bump itself is already on main; this PR only fills in surrounding metadata, examples, and tests. Runtime changes are limited to comments and test coverage, with no behavior change beyond leaderboard metadata availability.


Generated with mux • Model: openai:gpt-5.5 • Thinking: xhigh • Cost: $13.05

@ThomasK33

Copy link
Copy Markdown
Member Author

@codex review

@chatgpt-codex-connector

Copy link
Copy Markdown

Codex Review: Didn't find any major issues. 🎉

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

@mintlify

mintlify Bot commented May 28, 2026

Copy link
Copy Markdown

Preview deployment for your docs. Learn more about Mintlify Previews.

Project Status Preview Updated (UTC)
Mux 🟢 Ready View Preview May 28, 2026, 5:26 PM

💡 Tip: Enable Workflows to automatically generate PRs for you.

@ThomasK33

Copy link
Copy Markdown
Member Author

@codex review

Addressed the CI unit failure by updating the remaining Copilot dot-form Opus fixture from 4.7 to 4.8. The integration timeout reproduced locally as passing for the failing test file, so the new push should rerun CI on a clean commit.

@chatgpt-codex-connector

Copy link
Copy Markdown

Codex Review: Didn't find any major issues. Bravo.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

@ThomasK33 ThomasK33 force-pushed the claude-version-q36w branch from 370b1ad to 403c776 Compare May 28, 2026 19:05
@ThomasK33 ThomasK33 changed the title 🤖 feat: update default model to Claude Opus 4.8 🤖 feat: complete Claude Opus 4.8 metadata coverage May 28, 2026
@ThomasK33

Copy link
Copy Markdown
Member Author

@codex review

Rebased onto latest main and resolved the overlap with #3409. The PR now contains only the remaining Opus 4.8 metadata/examples/test coverage diff on top of main.

@chatgpt-codex-connector

Copy link
Copy Markdown

Codex Review: Didn't find any major issues. Delightful!

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Updates the curated Opus model, gateway/workflow defaults, token metadata, docs, and tests for Anthropic Claude Opus 4.8.

---

_Generated with `mux` • Model: `openai:gpt-5.5` • Thinking: `xhigh` • Cost: `$13.05`_

<!-- mux-attribution: model=openai:gpt-5.5 thinking=xhigh costs=13.05 -->
@ThomasK33 ThomasK33 force-pushed the claude-version-q36w branch from 403c776 to 40097d6 Compare May 28, 2026 19:12
@ThomasK33

Copy link
Copy Markdown
Member Author

@codex review

Ran simplify/deslop review, removed duplicate Opus wire-transform test setup by table-driving the 4.7/4.8 cases, and tightened the native-1M test name after rebasing on main.

@chatgpt-codex-connector

Copy link
Copy Markdown

Codex Review: Didn't find any major issues. Breezy!

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

@ThomasK33 ThomasK33 added this pull request to the merge queue May 28, 2026
@github-merge-queue github-merge-queue Bot removed this pull request from the merge queue due to failed status checks May 28, 2026
@ThomasK33 ThomasK33 added this pull request to the merge queue May 29, 2026
Merged via the queue into main with commit ac6b195 May 29, 2026
40 of 42 checks passed
@ThomasK33 ThomasK33 deleted the claude-version-q36w branch May 29, 2026 09:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant