Agent Max

Self-hosted AI agent. Opinionated fork of pi-mono with A2A, browser automation, and distributed compute.

Philosophy

Agent state (plans, todos, memory) lives in plain markdown files, not structured tools or hooks. The agent reads and writes .md files directly. No JSON state machines, no tool-call interception, no confirmation flows. Keep it light.

Architecture

Max communicates with companion agents via A2A (Agent-to-Agent) protocol. Users interact through Telegram.

1) System architecture

2) A2A task lifecycle (Nix -> Max)

3) Runtime surfaces and execution modes

Source files for all diagrams are in docs/diagrams/ as both .excalidraw and .png.

Tools

Tool	Description
`browser_control`	Chrome automation via CDP
`browser_task`	Agentic browser tasks (browser-use)
`wake_gpu` / `shutdown_gpu` / `gpu_status`	GPU PC power management (WoL + Ollama)
`ssh_to_nas`	Run commands on remote hosts via SSH
`delegate_to_nix`	Send tasks to companion agents via A2A
`read_file` / `write_file` / `list_files`	Local filesystem operations
`run_shell`	Execute shell commands
`delegate_to_claude_subagent`	Launch Claude Code subagent jobs asynchronously with AgentWeave attribution
`linkedin_search` / `linkedin_results`	LinkedIn scraping
`launchpad_run_scraper` / `launchpad_deploy` / `launchpad_scrape`	Launchpad automation
`ios_list_devices` / `ios_build` / `ios_install`	iOS build and deploy
`context_info`	Agent context and state info

Setup

git clone <repo-url> && cd agent-max
cp .env.example .env
# Fill in your API keys and config in .env
npm install
npm run build
npm start

Environment Variables

See .env.example for the full list. Key variables:

GOOGLE_API_KEY / ANTHROPIC_API_KEY — LLM provider keys (used for direct provider mode)
MUX_ENABLED / MUX_BASE_URL / MUX_API_KEY — route Max through Mux via an OpenAI-compatible endpoint while preserving the requested model name
CLAUDE_SUBAGENT_MODEL — Claude model used by delegate_to_claude_subagent (default: claude-sonnet-4-6)
TELEGRAM_BOT_TOKEN / TELEGRAM_ALLOWED_USERS — Telegram bot config
A2A_PORT — Port for the A2A server (default: 8770)
A2A_SHARED_SECRET — Shared secret for A2A auth between agents
NIX_A2A_URL — URL of the companion NAS agent
NAS_HOST / NAS_USER — NAS SSH access
GPU_HOST / GPU_WOL_URL / GPU_SHUTDOWN_TOKEN — GPU PC management
MAX_A2A_URL — Public URL for this agent's A2A card

Mux integration

Set these in .env to route Max through Mux:

MUX_ENABLED=true
MUX_BASE_URL=http://<mux-host>:8787/v1
# optional if Mux later requires auth
MUX_API_KEY=
# requested model name Max will send to Mux
DEFAULT_MODEL=claude-sonnet-4-6

When Mux is enabled, Max switches to the OpenAI-compatible transport internally, keeps the requested model id (for example claude-sonnet-4-6), and adds X-Runtime: agent-max on requests so Mux can attribute routing decisions by runtime.

Claude Code subagent delegation

delegate_to_claude_subagent runs Claude Code in a background process so Max can stay responsive.

action=start launches a job and immediately returns job_id
action=status returns current state plus output
action=list shows recent jobs

For AgentWeave trace linking, Max injects the following headers into the child Claude process via ANTHROPIC_CUSTOM_HEADERS:

X-AgentWeave-Session-Id (child)
X-AgentWeave-Parent-Session-Id (current Max session)
X-AgentWeave-Agent-Id
X-AgentWeave-Agent-Type=subagent
X-AgentWeave-Task-Label

Requirements:

claude CLI installed and authenticated on the host
ANTHROPIC_BASE_URL should point to your AgentWeave proxy if you want subagent LLM calls visible in AgentWeave

Context sizing

Max automatically compacts old history once the running token estimate crosses a threshold. Defaults target a Claude subscription, where every input token counts against the 5-hour rate limit:

Env var	Default	Meaning
`MAX_CONTEXT_WINDOW`	`200000`	Upper bound used for budget math
`MAX_COMPACT_THRESHOLD`	`0.75`	Fraction of the window before compaction kicks in (default = 150K tokens)
`MAX_KEEP_RECENT`	`6`	Messages always kept intact at the tail

For Gemini direct or other cheap-long-context providers, raise the window:

MAX_CONTEXT_WINDOW=1000000
MAX_COMPACT_THRESHOLD=0.8

The effective values are logged once on the first transformContext call.

Development

npm run dev   # Watch mode with tsx + nodemon
npm run tui   # Interactive TUI client

A2A Protocol

Max exposes an A2A server for receiving tasks from other agents:

GET /.well-known/agent.json - Agent card (public)
GET /health - Health check (public)
POST /tasks - Submit a task (auth required)
POST /tasks/stream - Submit with SSE streaming (auth required)
GET /tasks/:id - Query task status (auth required)

Auth uses Authorization: Bearer <A2A_SHARED_SECRET>.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
.github/workflows		.github/workflows
docs/diagrams		docs/diagrams
scripts		scripts
src		src
tests		tests
.claudeignore		.claudeignore
.contextignore		.contextignore
.cursorignore		.cursorignore
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
jest.config.cjs		jest.config.cjs
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agent Max

Philosophy

Architecture

1) System architecture

2) A2A task lifecycle (Nix -> Max)

3) Runtime surfaces and execution modes

Tools

Setup

Environment Variables

Mux integration

Claude Code subagent delegation

Context sizing

Development

A2A Protocol

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agent Max

Philosophy

Architecture

1) System architecture

2) A2A task lifecycle (Nix -> Max)

3) Runtime surfaces and execution modes

Tools

Setup

Environment Variables

Mux integration

Claude Code subagent delegation

Context sizing

Development

A2A Protocol

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages