Skip to content

Persist live orchestrator trajectories#48

Merged
sarvesh1327 merged 2 commits into
mainfrom
live-orchestrator-trajectories
Apr 24, 2026
Merged

Persist live orchestrator trajectories#48
sarvesh1327 merged 2 commits into
mainfrom
live-orchestrator-trajectories

Conversation

@sarvesh1327
Copy link
Copy Markdown
Owner

  • Persist profile-local AdvisorTrajectory records for live orchestrator runs, including packet/advice, executor observations, verifier hints, final outcome, and reward evidence.

Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 75854dcb8b

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

reward_label=reward_label,
)
self.trace_store.record_lineage(packet.run_id, manifest, lineage)
self.trace_store.record_trajectory(
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Avoid storing baseline runs as advisor trajectories

run() now records a trajectory unconditionally, even when routing_decision.arm is "baseline" and no advice was injected into the executor. In that AB-test path, the persisted trajectory still pairs primary_advice with the observed outcome/reward, so downstream consumers of advisor_trajectories can misinterpret off-policy baseline behavior as advisor-attributed training evidence whenever advisor_fraction < 1.0. Gate trajectory persistence to advisor-arm runs or persist routing metadata in the trajectory so baseline samples can be filtered.

Useful? React with 👍 / 👎.

@sarvesh1327 sarvesh1327 merged commit a610f6d into main Apr 24, 2026
1 check passed
@sarvesh1327 sarvesh1327 deleted the live-orchestrator-trajectories branch April 24, 2026 10:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant