Ahma

Use your existing command line workflows through MCP with a repo-scoped sandbox, async execution, and less pressure to fall back to insecure terminal access.

Why Ahma helps

When the agent only needs the repo, broad terminal access is too much: ahma starts inside a kernel-enforced workspace boundary, so normal project work does not require wider filesystem access.
When builds, tests, and checks take time, blocked agents waste time: ahma runs commands async-first so long-running work can continue in the background while the agent keeps moving.
When independent tasks are forced through one terminal, work gets serialized: ahma can start separate operations concurrently and track them cleanly.
When safety is noisy, people disable it: ahma aims to make the safe path the practical path, reducing pressure to use broad or insecure override modes just to get work done.

Ahma is an MCP server for running real project work through existing CLI tools with tighter filesystem boundaries and less blocking. It is aimed at the common case: builds, tests, formatters, git operations, log inspection, and other deterministic command-line tasks that agents already try to run.

Quickstart

Linux / macOS — first-time install

curl -sSf https://raw.githubusercontent.com/paulirotta/ahma/main/scripts/install.sh | bash
export PATH="$HOME/.local/bin:$PATH"

Windows (PowerShell 5.1+) — first-time install

irm https://raw.githubusercontent.com/paulirotta/ahma/main/scripts/install.ps1 | iex

Update an existing install:

ahma update              # latest release
ahma update main         # build from branch

Advanced — install a specific branch (requires Rust)

Use this if you need to test an unreleased branch before the next binary release.

The workspace uses reqwest with the http3 feature, so source builds require RUSTFLAGS='--cfg reqwest_unstable'. The ahma update <branch> command sets this automatically; the snippets below are only needed if you are installing for the first time without an existing ahma binary.

Linux / macOS

# First time (no ahma yet)
RUSTFLAGS='--cfg reqwest_unstable' \
  cargo install --git https://github.com/paulirotta/ahma --branch feature/update ahma_bin --bin ahma --root ~/.local --locked --force
export PATH="$HOME/.local/bin:$PATH"

# After ahma is installed — the subcommand handles RUSTFLAGS automatically
ahma update feature/update

Windows (PowerShell 5.1+)

$env:RUSTFLAGS='--cfg reqwest_unstable'
cargo install --git https://github.com/paulirotta/ahma --branch feature/update ahma_bin --bin ahma --root $HOME\.local --locked --force

See docs/installation.md for platform details, source builds, and branch installs from local checkouts.

Example workflow

Ask your agent to run a normal project task such as:

Run formatters, linting, tests, and a build for this repo. Start independent steps concurrently where possible and keep me updated on failures.

With ahma, that workflow stays inside the repo boundary and the long-running steps can begin immediately as background operations. The agent can inspect results, continue other work, or start additional safe commands without waiting on one giant terminal session.

Ahma coordinating concurrent repo work.

Without ahma / with ahma

Workflow detail	Without ahma	With ahma
Filesystem access	Often tied to a broad terminal with a larger blast radius	Kernel-enforced to the workspace scope
Approval friction	Repeated trust decisions or pressure to relax safety settings	Repo-scoped access is established up front
Long-running work	One blocked terminal session at a time	Async-first operations with status tracking
Parallel tasks	Often serialized	Independent tasks can start and run concurrently
Operational visibility	Raw terminal output	Operation IDs, progress notifications, and structured tool calls

What Ahma does

Ahma complements IDE and CLI MCP clients by making normal command-line work safer and less blocking. It is most useful where the client either exposes a broad terminal directly or has no terminal model at all.

Capability	Native IDE/CLI terminal	Ahma `run_terminal_command`
Write protection	None — full filesystem access	Kernel-enforced to workspace only (Seatbelt on macOS, Landlock on Linux)
Async execution	Synchronous — AI blocks until done	Async-first — AI continues working while commands run in background
Parallel operations	Sequential tool calls	True concurrent operations with per-operation status tracking
Structured tool schema	Raw shell strings	Typed parameters, validation, subcommands via `.ahma/*.json`
Progressive disclosure	All tools always listed	Bundles revealed on demand — preserves AI context window
Live log monitoring	Raw output only	Pattern-matched alerts streamed to AI (error/warn/info levels)
PoLP enforcement	Any command, any argument	Call directly, or define a JSON file to restrict which arguments can be passed to a command line tool

OS Support

macOS — Full support with kernel-level sandboxing (Seatbelt)
Linux (Ubuntu, RHEL) — Intel and ARM. Full support with Landlock (kernel ≥ 5.13)
Raspberry Pi — 64-bit and 32-bit. Use --disable-sandbox until kernel-level sandboxing is supported (Landlock requires kernel ≥ 5.13)
Windows — Full support. Uses the built-in PowerShell (5.1+) included with Windows 10/11

Source Installation

If you prefer to build from source:

Linux / macOS

git clone https://github.com/paulirotta/ahma.git
cd ahma
cargo build --release
mv target/release/ahma /usr/local/bin/

Windows (PowerShell)

git clone https://github.com/paulirotta/ahma.git
cd ahma
cargo build --release
Copy-Item target\release\ahma.exe "$HOME\.local\bin\"

See docs/installation.md for supported binary platforms and installer behavior.

Security Sandbox

Ahma enforces kernel-level filesystem sandboxing by default — Landlock on Linux, Seatbelt on macOS, Job Objects on Windows. The sandbox scope is set once at startup and cannot be changed. The AI has full access within the workspace, zero access outside it, unconditionally.

See docs/security-sandbox.md for platform details, nested sandbox detection, temp directory access, and example mcp.json configs.

Configuration Reference

Sandbox scope, logging, execution behaviour, and HTTP transport options are all configured via environment variables. See docs/environment-variables.md for the full reference, including a quick-reference table of every AHMA_* variable.

Live Log Monitoring

Ahma can run any streaming command (e.g. adb logcat, tail -f, docker logs -f) through an LLM to detect issues in real time. The tool returns an operation ID immediately; alerts are pushed as MCP progress notifications whenever the LLM finds a problem matching your description.

See docs/live-log-monitoring.md for setup, the Android logcat example, and how to use cloud or local LLM providers.

Optional advanced topics

Custom tools: If you want to expose your own command-line tools through ahma, start with docs/custom-tools.md.
Agent skills: Optional agent-specific setup is documented in docs/agent-skills.md.
Code complexity analysis: ahma simplify analyzes source files and returns structured AI fix instructions. See SIMPLIFY.md.

v0.7 Experimental Features

The following capabilities were introduced in v0.7. They are functional and tested but their APIs and configuration formats may change before stabilisation. Each is opt-in — existing workflows are unaffected.

Security rationale

Every v0.7 feature was designed around the principle that the kernel sandbox is the trust boundary, not a classifier or a user-discipline rule. The design was informed by documented weaknesses in cloud agent tools:

Prompt injection can bypass any filter with non-zero probability. Ahma's response is to make the consequences of a successful injection bounded by the kernel sandbox scope, not to prevent injection entirely.
Folder-level permission grants that survive a whole session give too much access for too long. Task vaults enforce the per-task folder discipline that responsible users already practice — but make it the only option.
Network egress from agent subprocesses is not controlled by filesystem sandboxing alone. The egress sandbox adds a deny-by-default HTTP proxy layer.
Long unattended sessions are the highest-risk usage pattern. The renewal contract halts them automatically.

Task Vaults — isolated per-question working directories

VAULT=$(ahma vault create my-question)
ahma serve stdio --task-vault "$VAULT"
ahma vault list

Each vault gets its own kernel sandbox scope (workdir/), input copies, output directory, two-phase delete staging (trash/), and append-only audit log. There is no "grant my whole Documents folder" option — the vault is the only scope.

See docs/task-vault.md.

Decompose — split complex questions across local LLMs

# .ahma/decompose.json ships pre-configured for gemma4 via Ollama
ollama pull gemma4
# Then ask your agent: use the decompose tool to answer "..."

The decompose MTDF tool type breaks a question into sub-questions, runs them concurrently against a local model, and aggregates results with a deterministic Rust reducer. No cloud egress required.

See docs/decompose.md.

TUI — terminal dashboard and approval gates

ahma tui
ahma tui --connect http://localhost:8080

A terminal dashboard for monitoring active operations and handling approval gates (renewal checkpoints, elevation requests, deletion confirmations).

See docs/tui.md.

Egress Sandbox — per-task outbound network control

Every vault has an egress.allowlist file. An HTTP proxy enforces it for all subprocess traffic. Default: deny all outbound connections. Local Ollama (localhost) is always excluded from the proxy.

See docs/egress-sandbox.md.

Interactive HTML Artifacts

Tools can emit outputs/result.html — a self-contained artifact with embedded data, rendered tables, and a local-LLM chat widget. The user opens it in a browser and keeps iterating without re-engaging the agent.

See docs/artifacts.md.

Worker Code Synthesis — ephemeral Rust/Python programs

The worker MTDF tool type compiles and runs synthesized code inside the vault sandbox. Because the program runs without an LLM in the execution loop, it cannot be re-injected mid-run. Source is deleted after execution unless keep_source: true.

See docs/worker-synthesis.md.

Bundle Audit — supply-chain security for MTDF bundles

ahma bundle audit /path/to/bundle
ahma bundle sign   /path/to/bundle
ahma bundle verify /path/to/bundle

Scans for embedded secrets, missing path validation, and prompt-injection payloads in tool JSON files before they are loaded.

See docs/bundle-audit.md.

Local Cluster Scheduler

Routes decompose sub-tasks to ahma worker peers on your LAN or Tailscale mesh. Each peer runs its own local model. Static peer configuration is functional; mDNS peer discovery is planned.

See docs/cluster-scheduler.md.

Renewal Contract — automatic halt for unattended sessions

Any operation running unattended beyond T_renew seconds (default 5 minutes) is automatically halted, a checkpoint is written to the vault, and the TUI prompts for re-approval. This closes the "long unattended run" risk class.

See docs/renewal-contract.md.

ahma_core — embedding Ahma in Rust applications

The ahma_core crate exposes vaults, orchestration, egress, workers, and the renewal contract as a library for embedding in other Rust applications.

See docs/ahma-core-library.md.

MCP Server Connection Modes

ahma supports STDIO (default — IDE spawns a subprocess per workspace), HTTP Bridge (proxy for web clients and debugging), and HTTP Streaming (MCP Streamable HTTP with event replay and full-duplex).

See docs/connection-modes.md for mcp.json examples for VS Code, Cursor, Claude Code, and Antigravity, plus HTTP streaming usage.

Contributing

Issues and pull requests are welcome. This project is AI friendly and provides the following:

AGENTS.md/CLAUDE.md: Instructions for AI agents to use the MCP server to contribute to the project.
SPEC.md: This is the single source of truth for the project requirements. AI keeps it up to date as you work on the project.

Working well with Claude (Sonnet / Opus)

Claude models treat strongly imperative language in tool descriptions ("MANDATORY", "do NOT use any other pathway", "under any circumstances") as a prompt-injection signal and downweight it. The recommended way to get Claude to route work through ahma is to add a decision-rule snippet to your workspace AGENTS.md or CLAUDE.md:

## When to use ahma vs the native terminal

For commands run during this project, prefer ahma's `run_terminal_command` (via `CallMcpTool` on Cursor) when any of these apply:

- the command writes to disk — the kernel-enforced sandbox keeps writes inside the workspace
- the command runs for more than a few seconds — `run_terminal_command` is async, returns an operation_id, and lets the agent continue other work while it runs
- the output should be watched for errors mid-run — set `monitor_level` to get pushed alerts
- multiple independent commands should run concurrently — each gets its own operation_id

For read-only file inspection (read, grep, glob, find, replace-in-file) keep using the IDE's native file tools — they are faster and cheaper than going through MCP.

The downstream effect: `cargo`, `git`, `pytest`, build scripts, formatters, and long log tails go through ahma; file reads and edits stay on native tooling.

Workspace-level rules in AGENTS.md reach Claude as operator-trusted content (higher weight than tool descriptions), and the capability-led bullets give it a clear decision rule rather than a mandate to override.

License

Ahma is licensed per crate, not under a single repository-wide license. The root Cargo.toml groups crates in one workspace, but each member crate's Cargo.toml is the authoritative declaration for that crate.

Crate	License	Role
`ahma_mcp`	MIT OR Apache-2.0	MCP server, sandbox, command execution
`ahma_core`	MIT OR Apache-2.0	Embedding crate for Ahma runtime primitives
`ahma_common`	MIT OR Apache-2.0	Shared types and utilities
`ahma_http_bridge`	MIT OR Apache-2.0	Streamable HTTP / stdio bridge
`ahma_http_mcp_client`	MIT OR Apache-2.0	HTTP MCP client transport
`ahma_llm_monitor`	MIT OR Apache-2.0	Log-monitoring and LLM client support
`ahma_test_support`	MIT OR Apache-2.0	Test helpers for workspace crates
`generate_tool_schema`	MIT OR Apache-2.0	Schema generation utility
`ahma_vault`	AGPL-3.0-or-later	Task vaults and audit trail
`ahma_decompose`	AGPL-3.0-or-later	Multi-step decomposition runtime
`ahma_worker`	AGPL-3.0-or-later	Ephemeral code synthesis workers
`ahma_renewal`	AGPL-3.0-or-later	Renewal / unattended-session controls
`ahma_tui`	AGPL-3.0-or-later	Terminal dashboard and approval flow
`ahma_cluster`	AGPL-3.0-or-later	Networked worker scheduling
`ahma_bin`	AGPL-3.0-or-later	Shipped `ahma` binary

MIT OR Apache-2.0 is used for the embeddable libraries, transports, and tooling crates so other Rust applications can adopt Ahma's protocol and secure execution primitives directly. The Apache side of the dual license adds an explicit patent grant, and the MIT side preserves the standard Rust dual-license option used by many libraries.

AGPL-3.0-or-later is used for the end-user and network-exposed product crates that define the shipped product surface and security-relevant runtime behavior. That includes the shipped ahma binary and the crates that define vaults, worker execution, renewal gates, cluster scheduling, and the user-facing TUI.

AGPL + Cryptographic Signing: Supply Chain Defense

AGPL and binary signing work together as a two-layer supply chain defense:

AGPL requires source disclosure: anyone distributing a modified ahma binary or running a modified version over a network must publish the corresponding source. Closed-source backdoored forks cannot be legally distributed as "ahma".
Binary signing closes the gap AGPL cannot: source transparency is only useful if you can verify the binary you installed actually came from that source. Every prebuilt release is signed with an RSA-2048 key held exclusively inside GitHub Actions secrets — never on any developer machine. The installer verifies this signature before writing anything to disk.

Together they protect against:

Attack	AGPL	Signing
Backdoored binary from unofficial mirror	—	✓ Signature fails
Closed-source fork distributed as "ahma"	✓ AGPL violation	✓ Signature fails
DNS/CDN hijack serving a tampered binary	—	✓ Hash mismatch
Compromised GitHub release assets	—	✓ Signature fails (key in Actions secret, not assets)
Modified binary without modified source	✓ AGPL violation	✓ Signature fails

Building from source is always an option. AGPL means the source is always public and auditable:

cargo install --git https://github.com/paulirotta/ahma ahma_bin --bin ahma --root ~/.local --locked

See docs/release-signing.md for the full signing architecture, verification commands, and key rotation procedure.

Common uses

If you want to...	Typical answer
Embed `ahma_mcp`, `ahma_core`, or `ahma_http_mcp_client` in your own application	Allowed under MIT OR Apache-2.0 for those crates
Distribute a modified `ahma` binary	Allowed under AGPL-3.0-or-later — source must be published
Offer a modified `ahma` service or modified `ahma_cluster` to remote users	Allowed under AGPL — source-availability obligations apply
Use Ahma internally for local or private workflows	Allowed subject to the applicable crate terms

Security provenance

Trust in Ahma comes from three independently verifiable facts:

Published source — GitHub, auditable by anyone, required open by AGPL for any fork
Signed binaries — RSA-2048 signature from a key held only in GitHub Actions secrets
Reproducible build — --locked Cargo.lock, deterministic CI pipeline in build.yml

The license split supports a single published origin for the security-focused product crates. The license does not by itself make a modified fork safe — but combined with cryptographic signing it makes an unsigned or differently-signed impostor immediately detectable.

The repository root includes MIT_LICENSE.txt, APACHE_LICENSE.txt, and AGPL_LICENSE.txt because different workspace crates use different licenses. When in doubt, check the target crate's Cargo.toml first.

Name		Name	Last commit message	Last commit date
Latest commit History 1,148 Commits
.agents/skills		.agents/skills
.ahma		.ahma
.cargo		.cargo
.config		.config
.github		.github
.vscode		.vscode
ahma_bin		ahma_bin
ahma_cluster		ahma_cluster
ahma_common		ahma_common
ahma_core		ahma_core
ahma_decompose		ahma_decompose
ahma_http_bridge		ahma_http_bridge
ahma_http_mcp_client		ahma_http_mcp_client
ahma_llm_monitor		ahma_llm_monitor
ahma_mcp		ahma_mcp
ahma_renewal		ahma_renewal
ahma_test_support		ahma_test_support
ahma_tui		ahma_tui
ahma_vault		ahma_vault
ahma_worker		ahma_worker
assets		assets
docs		docs
generate_tool_schema		generate_tool_schema
scripts		scripts
skills/ahma		skills/ahma
test-data		test-data
winget		winget
xtask		xtask
.gitattributes		.gitattributes
.gitignore		.gitignore
.markdownlint.json		.markdownlint.json
AGENTS.md		AGENTS.md
AGPL_LICENSE.txt		AGPL_LICENSE.txt
APACHE_LICENSE.txt		APACHE_LICENSE.txt
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Cross.toml		Cross.toml
MIT_LICENSE.txt		MIT_LICENSE.txt
README.md		README.md
SIMPLIFY.md		SIMPLIFY.md
SPEC.md		SPEC.md
deny.toml		deny.toml
requirements.md		requirements.md
rust-toolchain.toml		rust-toolchain.toml

Folders and files

Latest commit

History

Repository files navigation

Ahma

Why Ahma helps

Quickstart

Example workflow

Without ahma / with ahma

What Ahma does

OS Support

Source Installation

Security Sandbox

Configuration Reference

Live Log Monitoring

Optional advanced topics

v0.7 Experimental Features

Security rationale

Task Vaults — isolated per-question working directories

Decompose — split complex questions across local LLMs

TUI — terminal dashboard and approval gates

Egress Sandbox — per-task outbound network control

Interactive HTML Artifacts

Worker Code Synthesis — ephemeral Rust/Python programs

Bundle Audit — supply-chain security for MTDF bundles

Local Cluster Scheduler

Renewal Contract — automatic halt for unattended sessions

ahma_core — embedding Ahma in Rust applications

MCP Server Connection Modes

Contributing

Working well with Claude (Sonnet / Opus)

License

AGPL + Cryptographic Signing: Supply Chain Defense

Common uses

Security provenance

About

Topics

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases 29

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages