Cli install sdk evals by viadezo1er · Pull Request #85 · braintrustdata/bt

ViaDézo1er / cedric (viadezo1er) · 2026-03-27T01:52:42Z

bt setup instrument: interactive mode, language selection, and scoped permissions

Adds three new flags to bt setup instrument and wires them end-to-end through agent invocation and task generation.

--interactive / -i opens the agent in its interactive TUI (Claude Code, etc.) so the user can review and approve each tool use.
--yolo runs the agent in the background with bypassPermissions — no approval prompts.
--language <LANG> restricts instrumentation to specific languages (python, typescript, go, java, ruby, csharp); repeatable; omit to let the agent auto-detect.

Run-mode prompt (interactive terminal, no flags)

When none of the above flags are passed and the terminal is interactive, the user is asked how to run the agent. Background mode uses acceptEdits with --allowedTools scoped to the package managers for the selected language(s) only (e.g. uv for Python, npm/yarn/pnpm for TypeScript, dotnet for C#). Interactive TUI mode opens the agent's terminal UI.

Language selection prompt

A multi-select prompt is shown between the workflow and run-mode prompts. Selecting "All languages" (the default) lets the agent auto-detect; selecting specific languages also narrows the background tool allowlist.

You'll notice since the mcp can be setup (and is the default option) when using bt setup, adding the mcp resources to this repo can make them redondant. However they are needed in case the mcp isn't setup.

skills/sdk-install/instrument-task.md

src/setup/sdk_install_docs.rs

Adds an optional, repeatable `--language` flag to `bt setup instrument` that lets callers specify the target language(s) directly, bypassing the agent's language auto-detection step. Accepted values (case-insensitive): python, typescript, javascript, go, csharp, c#, java, ruby `typescript` and `javascript` are treated as the same language; duplicate values are deduplicated before being passed to the agent. When one or more languages are provided the rendered task prompt includes a "Language Override" section telling the agent to skip Step 2 (auto-detection) and instrument the specified language(s) directly. Also fixes a pre-existing compile error in tests where `render_instrument_task` was already called with a `workflows` argument that the implementation didn't accept, and adds the `{WORKFLOW_CONTEXT}` placeholder so non-instrument workflows inject `bt` CLI guidance. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…chosen

ViaDézo1er / cedric (viadezo1er) · 2026-03-27T20:45:36Z

CI passes after git rebase origin/main cedric/cli-install-sdk-evals

github-actions · 2026-03-27T20:54:42Z

Latest downloadable build artifacts for this PR commit 7ea0bdf008b5:

Workflow run: https://github.com/braintrustdata/bt/actions/runs/23826919101
Download all artifacts (GitHub CLI): gh run download 23826919101 --repo braintrustdata/bt
Installers are published from main automatically. To publish one for a PR branch, run release-canary manually via workflow_dispatch.

Available artifact names

``artifacts-build-global
``artifacts-build-local-x86_64-apple-darwin
``artifacts-build-local-x86_64-pc-windows-msvc
``artifacts-build-local-aarch64-apple-darwin
``artifacts-build-local-x86_64-unknown-linux-musl
``artifacts-build-local-x86_64-unknown-linux-gnu
``artifacts-build-local-aarch64-unknown-linux-musl
``artifacts-build-local-aarch64-unknown-linux-gnu
``artifacts-plan-dist-manifest
``cargo-dist-cache

Abhijeet Prasad (AbhiPrasad)

let's give this a try!

src/setup/mod.rs

Abhijeet Prasad (AbhiPrasad) · 2026-03-31T15:28:08Z

src/setup/mod.rs

+                1 => Some(LanguageArg::Python),
+                2 => Some(LanguageArg::TypeScript),
+                3 => Some(LanguageArg::Go),
+                4 => Some(LanguageArg::Java),
+                5 => Some(LanguageArg::Ruby),
+                6 => Some(LanguageArg::CSharp),


can we avoid the magic indices?

Abhijeet Prasad (AbhiPrasad) · 2026-03-31T15:39:13Z

skills/sdk-install/instrument-task.md

+If the SDK does not print a URL, construct one manually using the URL format documented in `{SDK_INSTALL_DIR}/braintrust-url-formats.md`:
+
+```
+https://www.braintrust.dev/app/{org}/p/{project_name}/logs?r={root_span_id}


The URL might be different for self hosted.

They should use BRAINTRUST_APP_URL to help construct the URL.

…ing the wizard

…trument when possible

fixed cargo-clippy warning

…ct place

ViaDézo1er / cedric (viadezo1er) force-pushed the cedric/cli-install-sdk-evals branch from 8dc2661 to b68b629 Compare March 27, 2026 19:55

ViaDézo1er / cedric (viadezo1er) requested review from Abhijeet Prasad (AbhiPrasad) and Olmo Maldonado (ibolmo) and removed request for Olmo Maldonado (ibolmo) March 27, 2026 19:57

ViaDézo1er / cedric (viadezo1er) marked this pull request as ready for review March 27, 2026 20:05

Andrew Kent (realark) reviewed Mar 27, 2026

View reviewed changes

skills/sdk-install/instrument-task.md Show resolved Hide resolved

Andrew Kent (realark) reviewed Mar 27, 2026

View reviewed changes

src/setup/sdk_install_docs.rs Show resolved Hide resolved

ViaDézo1er / cedric (viadezo1er) and others added 3 commits March 27, 2026 13:43

feat: add mcp prompts to bt cli

a5af193

feat: bt setup install evals, either in the background or in the TUI …

bcec472

…chosen

ViaDézo1er / cedric (viadezo1er) force-pushed the cedric/cli-install-sdk-evals branch from b68b629 to bcec472 Compare March 27, 2026 20:44

ViaDézo1er / cedric (viadezo1er) requested a review from Parker Henderson (parkerhendo) March 30, 2026 19:39

chore: select evals by default

1b17eaa

Abhijeet Prasad (AbhiPrasad) approved these changes Mar 31, 2026

View reviewed changes

Abhijeet Prasad (AbhiPrasad) reviewed Mar 31, 2026

View reviewed changes

Andrew Kent (realark) approved these changes Mar 31, 2026

View reviewed changes

ViaDézo1er / cedric (viadezo1er) added 7 commits March 31, 2026 18:58

fix: -p/--project flag didn't work

8347520

feat: skill, mcp and local/global flags cna be used instead of answer…

5037ae3

…ing the wizard

feat: no-mcp-skill, instrument flags ; pre select the language to ins…

c330025

…trument when possible

chore: quiet flag didn't hide the new flags' output

aa01901

chore: formatting

48f7ea5

fix: workflow flag ending the wizard prematurely

c2b16cc

fixed cargo-clippy warning

chore: show hint about spacebar/enter use for the wizard at the corre…

7ea0bdf

…ct place

ViaDézo1er / cedric (viadezo1er) merged commit 688454c into main Apr 1, 2026
34 checks passed

ViaDézo1er / cedric (viadezo1er) mentioned this pull request Apr 1, 2026

feat: Release v0.4.0 #89

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cli install sdk evals#85

Cli install sdk evals#85
ViaDézo1er / cedric (viadezo1er) merged 11 commits intomainfrom
cedric/cli-install-sdk-evals

ViaDézo1er / cedric (viadezo1er) commented Mar 27, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

ViaDézo1er / cedric (viadezo1er) commented Mar 27, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 27, 2026 •

edited

Loading

Uh oh!

Abhijeet Prasad (AbhiPrasad) left a comment

Uh oh!

Uh oh!

Abhijeet Prasad (AbhiPrasad) Mar 31, 2026

Uh oh!

Abhijeet Prasad (AbhiPrasad) Mar 31, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ViaDézo1er / cedric (viadezo1er) commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

bt setup instrument: interactive mode, language selection, and scoped permissions

Run-mode prompt (interactive terminal, no flags)

Language selection prompt

Uh oh!

Uh oh!

Uh oh!

ViaDézo1er / cedric (viadezo1er) commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Abhijeet Prasad (AbhiPrasad) left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Abhijeet Prasad (AbhiPrasad) Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Abhijeet Prasad (AbhiPrasad) Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ViaDézo1er / cedric (viadezo1er) commented Mar 27, 2026 •

edited

Loading

ViaDézo1er / cedric (viadezo1er) commented Mar 27, 2026 •

edited

Loading

github-actions bot commented Mar 27, 2026 •

edited

Loading