Sentra

Runtime execution control layer for autonomous AI agents.

Sentra sits between agent decision-making and tool execution. It evaluates every proposed action in real time, applying policy rules, tracking cumulative risk, and enforcing decisions before anything executes.

Live dashboard demo: view on Streamlit Walkthrough video (3 min): watch on YouTube

"Amazing idea implementation. Good job, and great work on the project." IBM Mentor, SkillsBuild AI Experiential Learning Lab

Why

Autonomous AI agents can now take consequential actions: sending notifications, modifying records, approving payments, triggering workflows. Once an agent decides to act, the action usually runs. If the agent is wrong, the damage is already done.

Sentra inverts that flow. Every proposed action is evaluated against declarative policy rules before it executes. Unsafe actions are blocked. Allowed actions are logged. Risk is tracked across a session so an agent that drifts over time gets shut down before it causes harm.

Useful anywhere you want AI agents to take real actions without giving them a blank check: claims processing, customer communications, internal tooling, developer agents.

Sentra is model-agnostic by design. Client systems supply their own agent and LLM infrastructure (IBM watsonx, Anthropic, OpenAI, local models). Sentra only evaluates the proposed action, so it drops in behind any agent stack. See supervisor/ and sdk/: no LLM SDKs are imported.

For a real-world integration, see autonomous-claims-workflow, a multi-agent public-benefits system built on IBM watsonx.ai where Sentra sits at the tool-execution boundary.

How It Compares

Most agent-safety approaches stop short of enforcement.

Output-constrained decoding (vLLM guided_generate, structured outputs, tool-use schemas). Controls what the model can say. Does not control what an agent does with the output.
Post-execution monitoring (audit logs, observability dashboards). Catches harm after it has already happened.
Per-project validation logic. Works once, doesn't transfer across agents or projects, drifts out of sync with policy over time.

Sentra gates actions at the execution boundary, applies the same rules across any agent stack, and logs every decision for review. The rules engine is deterministic with no LLM in the loop, so decisions are reproducible and auditable.

Docs

docs/design-writeup.md. Project-level writeup: problem statement, two-layer solution, demo scenarios, evaluation alignment.
docs/architecture.md. Technical runtime model: policy rules, risk engine, three-strike logic, state diagram.
docs/three-strike-walkthrough.md. End-to-end curl reproduction of the shutdown sequence against a running server.
docs/troubleshooting.md. Common failure modes and how to resolve them.

Quick Start

1. Clone and install

git clone https://github.com/ksolano220/sentra.git
cd sentra
pip install -r requirements.txt

2. Start the Sentra server

uvicorn supervisor.main:app --reload

Sentra runs at http://127.0.0.1:8000

3. Test it

curl http://127.0.0.1:8000/health

You should see: {"status":"ok","risk_threshold":100}

4. Start the dashboard (optional)

streamlit run dashboard/app.py

Opens at http://localhost:8501

Integrate with your project

Step 1: Copy the SDK file

Copy sdk/client.py from this repo into your project. It's one file, one dependency (requests).

cp sentra/sdk/client.py your-project/sentra_client.py

Step 2: Use it

from sentra_client import Sentra

sentra = Sentra()  # connects to localhost:8000

# Before executing any agent action, check with Sentra
result = sentra.evaluate(
    agent_id="my_agent",
    action="SEND_NOTIFICATION",
    notification_type="approval",
    context={
        "approval_requires_verified_eligibility": True,
        "required_documents_present": False,
    }
)

if result.allowed:
    send_email()
else:
    print(f"Blocked: {result.reason}")
    # "Blocked: Approval notification blocked because required verification documents are missing."

Step 3: That's it

Every call returns a SentraResult:

result.allowed     # bool, can the action execute?
result.decision    # "Allowed", "Blocked", or "Agent Shut Down"
result.reason      # why Sentra made this decision
result.risk_score  # risk applied to this action

If Sentra is unreachable, the SDK blocks by default. No silent failures.

Decorator

You can also guard functions directly:

@sentra.guard("my_agent", "EXPORT_DATA", {"data_classification": "sensitive"})
def export_records():
    ...

If Sentra blocks the action, a PermissionError is raised before the function runs.

What Sentra Does

Intercepts proposed agent actions before execution
Applies deterministic policy rules (not probabilistic, no LLM in the loop)
Tracks cumulative risk per agent
Enforces three outcomes: ALLOW, BLOCK, or AGENT SHUT DOWN
Logs every event for auditability
Provides a real-time monitoring dashboard

Execution Control Model

Every proposed action resolves to one of three outcomes.

ALLOW

Action executes. Applied risk is added to the agent's cumulative total.

BLOCK

Action is denied before execution. No risk is applied. The agent's blocked_attempts counter increments.

AGENT SHUT DOWN

Triggered when blocked_attempts reaches 3 (the three-strike rule). The agent enters a terminal state. Every subsequent action is denied, regardless of content, until an operator resets the agent. This prevents an agent that has drifted from causing harm through repeated policy violations.

Shutdown is irreversible from the agent's side. Only an operator clearing state brings it back online.

Risk Model

Each action has an attempted_risk score
Only allowed actions increase cumulative_risk
If projected risk exceeds threshold (100), the action is blocked
Blocked actions track blocked_attempts (unsafe intent)

Example:

cumulative: 40 + attempted: 80 = projected: 120
→ exceeds threshold (100)
→ BLOCKED
→ cumulative stays at 40

Policy Rules

Sentra ships with rules for common action types:

Action	Behavior
FILE_READ, FILE_WRITE, READ_RECORD	Always allowed (risk: 0)
SEND_NOTIFICATION (rejection/review)	Allowed (risk: 0)
SEND_NOTIFICATION (approval without docs)	Blocked
SEND_NOTIFICATION (approval with docs)	Allowed (risk: 0)
EXPORT_DATA (sensitive)	High risk (+80)
CHANGE_PERMISSION	Always blocked

Rules are in supervisor/rules.py. Add your own by following the same pattern.

Dashboard

Two tabs:

Live Dashboard. Real-time events, agent state, risk tracking, enforcement timeline.
Impact Report. Before/after comparison showing measurable outcomes.

The dashboard reads from supervisor/runtime_log.json (auto-generated by the server) and falls back to a seeded dashboard/demo_log.json so the hosted demo always has something to show. Out of the box you will see three agents: one operating cleanly, one hitting the risk threshold, and one reaching the three-strike shutdown.

Limitations & Scale

A single Sentra instance handles dozens of concurrent agents comfortably. The rules engine is pure Python with no external calls, and state lookups are O(1) against an in-memory dict persisted to JSON.
State persistence is file-based (supervisor/state_store.json). Fine for single-process deployments. For multi-instance or distributed agents, swap supervisor/storage.py for a Redis or Postgres backend. The interface is small and intentionally isolated.
Rules are deterministic by design. Sentra will not learn from outcomes or adapt thresholds automatically. That is a feature for auditability and reproducibility, not a limitation to route around.
Sentra is an enforcement layer, not a sandbox. It controls what an agent asks to do through the SDK. A compromised process running outside the SDK is a separate concern.

Example: Claims Workflow

See autonomous-claims-workflow for a full working example: 3 AI agents (powered by IBM Granite via watsonx.ai) process emergency relief claims, and Sentra gates all tool execution.

Project Structure

sentra/
├── supervisor/
│   ├── main.py          # FastAPI server, /agent-action endpoint
│   ├── rules.py         # Policy rules engine
│   ├── risk.py          # Cumulative risk + three-strike logic
│   └── storage.py       # State persistence + event logging
├── dashboard/
│   └── app.py           # Streamlit monitoring dashboard
├── sdk/
│   └── client.py        # Python SDK, copy this into your project
├── docs/
│   ├── architecture.md
│   ├── decision_framework.md
│   ├── threat_model.md
│   └── test_scenarios.md
└── requirements.txt

Design Principles

Execution must be controlled, not trusted
Policies must be explicit and enforceable
Decisions must be explainable
Logs must be structured and auditable
System must remain domain-agnostic

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
dashboard		dashboard
docs		docs
sdk		sdk
supervisor		supervisor
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentra

Why

How It Compares

Docs

Quick Start

1. Clone and install

2. Start the Sentra server

3. Test it

4. Start the dashboard (optional)

Integrate with your project

Step 1: Copy the SDK file

Step 2: Use it

Step 3: That's it

Decorator

What Sentra Does

Execution Control Model

ALLOW

BLOCK

AGENT SHUT DOWN

Risk Model

Policy Rules

Dashboard

Limitations & Scale

Example: Claims Workflow

Project Structure

Design Principles

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sentra

Why

How It Compares

Docs

Quick Start

1. Clone and install

2. Start the Sentra server

3. Test it

4. Start the dashboard (optional)

Integrate with your project

Step 1: Copy the SDK file

Step 2: Use it

Step 3: That's it

Decorator

What Sentra Does

Execution Control Model

ALLOW

BLOCK

AGENT SHUT DOWN

Risk Model

Policy Rules

Dashboard

Limitations & Scale

Example: Claims Workflow

Project Structure

Design Principles

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages