feat(steering): dynamic steering — activation-conditioned steering by RhizoNymph · Pull Request #180 · RhizoNymph/vllm

RhizoNymph · 2026-06-17T07:50:43Z

Draft. Ties activation capture to activation steering so activations decide when/how to steer. Three controller tiers (async → sync → in-graph), each configuring the one below. Design authority: docs/design/dynamic_steering.md (+ dynamic_steering_apc_notification.md, dynamic_steering_row_gating.md).

What's here

Phase 0 — async transport. In-process steering_action_queue (bounded, decode-tier-only validation), drained at the top of _update_steering_buffers.
Phase 1a — sync consumers + per-request actuation. execution="sync" consumer axis (every TP rank, 1-step latency); dynamic-override row pool (pure routing); observability + GET /v1/steering/dynamic; event-based on_step timing.
Phase 1b — gain primitives. Per-row strength scale (§5.3) + dedicated-gather dynamic additive tier (§5.4).
Phase 2 — in-graph monitor. Graph-safe monitor op computes a per-token gate sigmoid(sharpness·(residual·probe − threshold)) and modulates the §5.4 tier same-forward; and per-request rows (decode-only, prefill protected via a decode mask) when gate_rows is set.
APC correctness. Worker→scheduler effective-decode-steering-signature notification so decode KV produced under dynamic steering is not falsely reused — resolves the streaming-continuation prefix-cache hole.
Example controller emit_mode = scale | monitor.

Status / validation

GPU-validated on gemma4-31B: tp=1 (per-request actuation, tier, APC reuse), tp=2 cross-node (rank-replication + APC re-keying), pp=2, active in-graph monitor (tier + row gating), and row-gating kernel/op/cudagraph parity. Extensive CPU suites.

Notes for review

Includes a fix for a pre-existing decode-only per-request steering short-circuit deadlock (also proposed standalone against the base in fix(steering): decode-only per-request steering dropped by nothing-active short-circuit #178).
Deferred: model_runner_v2 integration (upstream dev-flag-gated).

…step

…d decode steering)

…che analysis)

…tier, packed banks)

…r and on_step

…ring

…n with per-request actuation

…ild backend

…d activation_reward_producer Co-authored-by: Claude

… throughput A/B

…ride e2e test

…ed FP nondeterminism

…th operator decode steering

…ler tiers

…le action)

…l term), replace populate-folding

…e 2 M1)

…cuit (Phase 2 M2)

…hase 2 M3)

…g open items

…r-gain + in-graph probe)

…ification

…otification M1)

…to v2 runner

fix(steering): content-keyed bounded probe tensor cache

fix(steering): bridged overrides preserve compose_admitted

fix(steering): fail-safe declarative gate resolution at admission

…arity fix(steering): port declarative override parity (compose+precedence) to v2 runner

test(steering): update stale fixtures for post-#217/#219 runner state

…declarative-gate-fail-closed # Conflicts: # tests/v1/test_steering_schema.py

fix(steering): declarative probe gates fail closed

…gle-source op args

…resh to v1

…e grouping

chore(steering): typed RowOwner state + refcount-0 purge + dirty-state grouping

…ness test(steering): cross-runner conformance harness for the control plane

fix(capture): port client_request_id sidecar + streaming metadata refresh to v1

…cksum feat(steering): cross-rank applied-action checksum in dynamic status

fix(steering): warmup matches runtime row-monitor specialization; single-source op args

feat(steering): worker-registered named vectors + latch-by-reference

test(steering): drop stale second arg from scheduler override hook call

…e/steering-trust-hardening # Conflicts: # vllm/v1/worker/steering_model_runner_mixin.py

test(steering): conformance harness tracks typed RowOwner keys

Bound latch memory + document dynamic-steering trust model

RhizoNymph added 30 commits June 11, 2026 03:11

feat(steering): in-process dynamic steering action queue drained per …

3ef4057

…step

feat(capture): dynamic steering controller example plugin (probe-gate…

0af6ced

…d decode steering)

docs(steering): dynamic steering design (phases 0-2, determinism + ca…

5570113

…che analysis)

test(steering): real-manager integration tests for dynamic action queue

eef9780

docs(steering): rework phase 1 around sync/async consumer execution axis

335fc84

docs(steering): record phase 1 decisions (per-request first, dynamic …

949b96c

…tier, packed banks)

feat(capture): sync consumer execution axis with all-rank slim manage…

9d345f8

…r and on_step

feat(steering): dynamic-override row pool for per-request decode stee…

35b8f7b

…ring

feat(steering): dynamic steering observability + sync plugin migratio…

4276c96

…n with per-request actuation

docs(steering): record tp=1 GPU validation of phase 1a; fix plugin bu…

8d3767f

…ild backend

fix(examples): use setuptools.build_meta backend in minimal_plugin an…

0f4945c

…d activation_reward_producer Co-authored-by: Claude

feat(steering): event-based on_step timing for honest sync-consumer cost

95f2914

docs(steering): record GPU validation of event-based on_step timing +…

9e02feb

… throughput A/B

feat(steering): sync-consumer warmup hook + engine-level dynamic-over…

42a77bc

…ride e2e test

test(steering): force spawn start method in dynamic-steering e2e test

c4e36dd

test(steering): assert dynamic-override e2e within-run to dodge batch…

a889bc9

…ed FP nondeterminism

fix(test): move NOISE_FLOOR constant above decorators

fb265f4

docs(steering): record warmup-hook and engine-level e2e GPU validation

5c5637a

feat(steering): dynamic additive tier (populate-folding) — compose wi…

906bb3e

…th operator decode steering

docs(steering): lock in policy expressiveness contract across control…

c7782d2

…ler tiers

feat(steering): per-row strength scale tensor (kernel + manager + sca…

050068b

…le action)

feat(steering): dedicated-gather dynamic tier (per-token gate + kerne…

4205263

…l term), replace populate-folding

feat(steering): in-graph monitor op — per-token gate from probe (Phas…

4dc3c75

…e 2 M1)

feat(steering): monitor config in manager + runner populate/short-cir…

5003751

…cuit (Phase 2 M2)

feat(steering): monitor action + dispatch + status + kernel warmup (P…

4538d62

…hase 2 M3)

docs(steering): Phase 2 in-graph monitor design — resolve reset/gatin…

0f70497

…g open items

docs(steering): record Phase 2 monitor CPU + GPU validation (node2)

6cec618

feat(steering): example controller emit_mode=scale|monitor (cheap tie…

6bc09a0

…r-gain + in-graph probe)

docs(steering): plan for worker->scheduler APC steering-signature not…

be882de

…ification

feat(steering): effective decode steering signature in manager (APC n…

b29e6e7

…otification M1)

RhizoNymph and others added 30 commits July 1, 2026 22:58

fix(steering): fail-safe declarative gate resolution at admission

1e3bbd9

fix(steering): declarative probe gates fail closed

1249cec

fix(steering): port declarative override parity (compose+precedence) …

c6900e9

…to v2 runner

test(steering): update stale fixtures for post-#217/#219 runner state

68bf71b

Merge pull request #220 from RhizoNymph/fix/declarative-probe-cache

ce980e3

fix(steering): content-keyed bounded probe tensor cache

Merge pull request #221 from RhizoNymph/fix/latch-bridge-compose

10fb2e1

fix(steering): bridged overrides preserve compose_admitted

Merge pull request #222 from RhizoNymph/fix/resolve-gates-guard

f5c5fb1

fix(steering): fail-safe declarative gate resolution at admission

Merge pull request #224 from RhizoNymph/fix/v2-declarative-override-p…

17a628b

…arity fix(steering): port declarative override parity (compose+precedence) to v2 runner

Merge pull request #225 from RhizoNymph/test/steering-fixture-parity

333592e

test(steering): update stale fixtures for post-#217/#219 runner state

Merge remote-tracking branch 'origin/feat/dynamic-steering' into fix/…

4242489

…declarative-gate-fail-closed # Conflicts: # tests/v1/test_steering_schema.py

Merge pull request #223 from RhizoNymph/fix/declarative-gate-fail-closed

3f2dbca

fix(steering): declarative probe gates fail closed

chore(steering): latch byte bounds + documented trust model

0b41e0c

fix(steering): warmup matches runtime row-monitor specialization; sin…

39d5532

…gle-source op args

fix(capture): port client_request_id sidecar + streaming metadata ref…

8fd7e5e

…resh to v1

test(steering): cross-runner conformance harness for the control plane

66cfcd1

feat(steering): cross-rank applied-action checksum in dynamic status

1785cfb

chore(steering): typed RowOwner state + refcount-0 purge + dirty-stat…

0c18794

…e grouping

Merge pull request #234 from RhizoNymph/chore/steering-row-owner

aef9ff6

chore(steering): typed RowOwner state + refcount-0 purge + dirty-state grouping

Merge pull request #232 from RhizoNymph/test/steering-conformance-har…

f3f94bf

…ness test(steering): cross-runner conformance harness for the control plane

Merge pull request #231 from RhizoNymph/fix/v1-capture-drift

ba8f7ac

fix(capture): port client_request_id sidecar + streaming metadata refresh to v1

feat(steering): worker-registered named vectors + latch-by-reference

9bc506d

test(steering): drop stale second arg from scheduler override hook call

2939c04

Merge pull request #233 from RhizoNymph/feat/steering-determinism-che…

84de327

…cksum feat(steering): cross-rank applied-action checksum in dynamic status

Merge pull request #230 from RhizoNymph/fix/steering-op-args-warmup

d3258d4

fix(steering): warmup matches runtime row-monitor specialization; single-source op args

Merge pull request #238 from RhizoNymph/feat/steering-named-latch

6e16a7a

feat(steering): worker-registered named vectors + latch-by-reference

Merge pull request #239 from RhizoNymph/test/request-steering-sig-parity

4cce740

test(steering): drop stale second arg from scheduler override hook call

Merge remote-tracking branch 'origin/feat/dynamic-steering' into chor…

530861d

…e/steering-trust-hardening # Conflicts: # vllm/v1/worker/steering_model_runner_mixin.py

test(steering): conformance harness tracks typed RowOwner keys

a4f26c5

Merge pull request #240 from RhizoNymph/test/conformance-rowowner-pins

bc5b111

test(steering): conformance harness tracks typed RowOwner keys

Merge pull request #229 from RhizoNymph/chore/steering-trust-hardening

9506093

Bound latch memory + document dynamic-steering trust model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(steering): dynamic steering — activation-conditioned steering#180

feat(steering): dynamic steering — activation-conditioned steering#180
RhizoNymph wants to merge 125 commits into
feat/integrationfrom
feat/dynamic-steering

RhizoNymph commented Jun 17, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

RhizoNymph commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What's here

Status / validation

Notes for review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

RhizoNymph commented Jun 17, 2026 •

edited

Loading