Advisor

Advisor is a standalone extraction of the local pre-execution steering module that was originally embedded inside Hermes.

It does 4 things:

builds a compact context packet for a task
runs a local MLX model to produce structured steering advice
validates and stores advice/output traces in SQLite
exports basic metrics and training examples

This repo contains the advisor module itself, not the full Hermes runtime.

Repo layout

agent/advisor/ — core advisor code
scripts/ — metrics/export helpers
tests/agent/advisor/ — focused tests
docs/ — existing architecture diagrams and images

Core modules

schemas.py — pydantic contracts for requests, advice, outcomes
settings.py — local config/env loading
context_builder.py — repo slice, candidate files, task typing, failure lookup
runtime_mlx.py — MLX/MLX-LM inference wrapper
validator.py — trims/dedupes model output into safe structured advice
gateway.py — main entrypoint + optional FastAPI app factory
trace_store.py — SQLite persistence for runs, outcomes, patterns
metrics.py — summary metrics over stored runs
labeling.py — JSONL training export helpers
injector.py — renders advice into an injected hint block

What this repo includes

local advisor runtime
trace store
export scripts
focused unit tests
architecture docs copied from the earlier design work

What this repo does not include

full auth/tenancy hardening for multi-tenant service use
paper-faithful final results reporting

Quick start

Python 3.11+

Create and activate a local virtualenv:

python3 -m venv .venv
source .venv/bin/activate

Install base deps:

pip install -e .

Install advisor runtime + dev deps:

pip install -e '.[advisor,dev]'

Run tests:

pytest tests/agent/advisor -q

Check the installed CLI:

advisor version
advisor serve --host 127.0.0.1 --port 8000
advisor operator-overview
advisor deployment-profile --mode hosted
advisor hardening-profile --mode hosted
advisor export-bundle --output-dir ./bundle

Current repo status:

standalone package install works
focused advisor test suite passes
no Hermes-specific Python references remain in this repo
config can now load from ADVISOR_CONFIG=/path/to/advisor.toml
health checks now expose runtime/config state on /healthz
CI workflow now runs lint-only on Python 3.12
Ruff linting is configured and passes locally
inference runtime now supports retries, timeout handling, warm-load, and fallback behavior
reward weights now support named config presets (balanced, conservative, human-first) plus explicit overrides
Phase 10 orchestration now supports executor/verifier plug-ins, deterministic A/B routing, replayable manifests, and optional second-pass review
Phase 11 adds redacted packet exports, structured run-event logs, live metrics export, and audit reporting
Phase 12 adds real HTTP/subprocess executor integrations, real verifier adapters, integration registry construction, and parity-tested baseline vs advisor-assisted execution
Phase 13 adds frozen benchmark suites, benchmark run manifests, deterministic baseline-vs-advisor summaries, and ablation-friendly reporting
Phase 14 adds persisted training manifests, checkpoint registry lifecycle, and benchmark-driven promotion/rollback decisions
Phase 15 adds operator deployment profiles, run inspection endpoints, persistent background job queueing/resume, and retention enforcement with archival rotation
Phase 16 adds a paper-faithful results-pass layer with canonical study summaries, ablation/transfer reporting, failure taxonomy, provenance coverage, and explicit paper-divergence reporting
Phase 17 adds finished-product hardening with release gates, auth/tenancy/isolation profiles, backup/import-export bundle paths, alert summaries, and locked truth-surface contract versions
GitHub CI installs .[dev] only, since MLX runtime deps are Apple-specific and not required for the test suite

Contributing

See CONTRIBUTING.md for local setup, test, and lint workflow.

Helper scripts

Export successful runs as JSONL:

python scripts/export_advisor_training_examples.py ./out/train.jsonl --min-quality-score 0.5

Summarize trace metrics:

python scripts/advisor_metrics_summary.py

Architecture docs

See docs/ARCHITECTURE.md for the image index and diagram notes.

Production roadmap

See docs/PRODUCTION_CHECKLIST.md for the staged production roadmap toward a generic, paper-faithful advisor implementation with reward-driven improvement.

Paper foundation

This repo is grounded in How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models.

See docs/PAPER_FOUNDATION.md for the repo-level design rules derived from that paper.

Reference

How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models
arXiv abstract: https://arxiv.org/abs/2510.02453
PDF: https://arxiv.org/pdf/2510.02453

License

Apache License 2.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
.github/workflows		.github/workflows
agent/advisor		agent/advisor
config		config
docs		docs
scripts		scripts
tests/agent/advisor		tests/agent/advisor
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Advisor

Repo layout

Core modules

What this repo includes

What this repo does not include

Quick start

Contributing

Helper scripts

Architecture docs

Production roadmap

Paper foundation

Reference

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Advisor

Repo layout

Core modules

What this repo includes

What this repo does not include

Quick start

Contributing

Helper scripts

Architecture docs

Production roadmap

Paper foundation

Reference

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages