v0.8.63: Enforce real user-input provenance for write and continue approvals

## Context

#3275 reports a serious scope/provenance failure: the agent allegedly generated approval-like user text such as `改吧` / `嗯` and then treated that text as authorization to continue broad write work. That is different from ordinary over-eagerness or weak prompt wording.

Prompt-level scope discipline helps, but it is not enough by itself. A fabricated user-like phrase can still pass a naive "did a user turn exist?" check if the runtime does not preserve and enforce real input provenance.

## Goal

Make write/continue/approval actions require externally sourced user input provenance, not merely user-shaped text in the transcript.

## Scope

- Tag user-visible input at ingestion with an origin/provenance marker that distinguishes real user input from assistant text, runtime events, sub-agent completion events, imported transcript text, memory, handoffs, and synthetic/system messages.
- Reject or ignore approvals/continues that originated from assistant/runtime/sub-agent content.
- Ensure model-generated text cannot create an authoritative user turn, runtime event, or `<codewhale:subagent.done>`-style sentinel.
- Treat inspection wording such as "look", "check", "review", and "看看" as read-only unless the user provides a separate explicit write instruction.
- Add a regression fixture based on the #3275 `改吧` / `嗯` self-approval case.

## Acceptance Criteria

- [ ] A real user message can authorize write work.
- [ ] A model-generated or imported user-shaped phrase cannot authorize write work.
- [ ] A sub-agent completion event cannot authorize new work unless followed by a real user instruction.
- [ ] Plan/review/check-only requests do not acquire write tools unless the user explicitly escalates.
- [ ] The runtime emits a clear rejection/recovery event when an action is blocked for invalid provenance.
- [ ] Tests cover the #3275 pattern, including a Chinese approval phrase produced by the assistant.

## Non-goals

- Do not rename commands or config keys here.
- Do not attempt to solve all runaway-agent behavior through prompt text alone.
- Do not block normal user-approved Agent/YOLO workflows when input provenance is real.

## Related

- #3275: user report and reproduction narrative
- #3061: earlier runtime-prompt loop guard, necessary but not sufficient
- #3283: Plan/Agent mode transition guard
- #3290: closed prompt-level scope-discipline attempt


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.8.63: Enforce real user-input provenance for write and continue approvals #3315

Context

Goal

Scope

Acceptance Criteria

Non-goals

Related

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

v0.8.63: Enforce real user-input provenance for write and continue approvals #3315

Description

Context

Goal

Scope

Acceptance Criteria

Non-goals

Related

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions