🤖 feat: complete Claude Opus 4.8 metadata coverage#3410
Conversation
|
@codex review |
|
Codex Review: Didn't find any major issues. 🎉 ℹ️ About Codex in GitHubYour team has set up Codex to review pull requests in this repo. Reviews are triggered when you
If Codex has suggestions, it will comment; otherwise it will react with 👍. Codex can also answer questions or update the PR. Try commenting "@codex address that feedback". |
|
Preview deployment for your docs. Learn more about Mintlify Previews.
💡 Tip: Enable Workflows to automatically generate PRs for you. |
|
@codex review Addressed the CI unit failure by updating the remaining Copilot dot-form Opus fixture from 4.7 to 4.8. The integration timeout reproduced locally as passing for the failing test file, so the new push should rerun CI on a clean commit. |
|
Codex Review: Didn't find any major issues. Bravo. ℹ️ About Codex in GitHubYour team has set up Codex to review pull requests in this repo. Reviews are triggered when you
If Codex has suggestions, it will comment; otherwise it will react with 👍. Codex can also answer questions or update the PR. Try commenting "@codex address that feedback". |
370b1ad to
403c776
Compare
|
Codex Review: Didn't find any major issues. Delightful! ℹ️ About Codex in GitHubYour team has set up Codex to review pull requests in this repo. Reviews are triggered when you
If Codex has suggestions, it will comment; otherwise it will react with 👍. Codex can also answer questions or update the PR. Try commenting "@codex address that feedback". |
Updates the curated Opus model, gateway/workflow defaults, token metadata, docs, and tests for Anthropic Claude Opus 4.8. --- _Generated with `mux` • Model: `openai:gpt-5.5` • Thinking: `xhigh` • Cost: `$13.05`_ <!-- mux-attribution: model=openai:gpt-5.5 thinking=xhigh costs=13.05 -->
403c776 to
40097d6
Compare
|
@codex review Ran simplify/deslop review, removed duplicate Opus wire-transform test setup by table-driving the 4.7/4.8 cases, and tightened the native-1M test name after rebasing on |
|
Codex Review: Didn't find any major issues. Breezy! ℹ️ About Codex in GitHubYour team has set up Codex to review pull requests in this repo. Reviews are triggered when you
If Codex has suggestions, it will comment; otherwise it will react with 👍. Codex can also answer questions or update the PR. Try commenting "@codex address that feedback". |
Summary
Completes the Claude Opus 4.8 rollout on top of the upstream default-model bump by adding the remaining metadata, examples, and regression coverage that mirror the previous Opus upgrade pattern.
Background
mainnow includes the core Claude Opus 4.8 default-model update from #3409. After rebasing this branch ontomain, this PR is the follow-up diff: Terminal-Bench leaderboard metadata, story/test fixtures, model-routing examples, and explicit Opus 4.8 coverage around native 1M/xhigh behavior.Implementation
anthropic/claude-opus-4-8to Terminal-Bench leaderboard submission metadata and examples.Validation
bun test src/common/constants/knownModels.test.ts src/common/utils/ai/models.test.ts src/common/utils/ai/providerOptions.test.ts src/node/services/providerModelFactory.test.ts src/browser/hooks/useModelsFromSettings.test.tsmake static-checkgit diff --checkRisks
Low risk: the default model bump itself is already on
main; this PR only fills in surrounding metadata, examples, and tests. Runtime changes are limited to comments and test coverage, with no behavior change beyond leaderboard metadata availability.Generated with
mux• Model:openai:gpt-5.5• Thinking:xhigh• Cost:$13.05