Skip to content

777genius/claude_agent_teams_ui

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1,717 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Kanban Board  Code Review  Team View  Task Detail  Agent Teams  Execution Logs  Agent Comments  Create Team  Settings

You're the CTO, agents are your team. They handle tasks themselves, message each other, review each other. You just look at the kanban board and drink coffee.

Latest Release  CI Status  Discord

Free desktop app for AI agent teams. Auto-detects Claude/Codex/OpenCode (75+ LLM providers). Use the provider access you already have - subscriptions or API keys. Not just coding agents.

image
demo_new-English_with_captions.mp4
landing_demo_3x.mov

Installation

No prerequisites - the app can detect supported runtimes/providers and guide setup from the UI.

If you want the FRESHEST version, clone the repo and run it from the dev branch.

macOS Apple Silicon
macOS Intel
Windows
May trigger SmartScreen — click "More info" → "Run anyway"
Linux AppImage
.deb   .rpm   .pacman

Table of contents

What is this

An orchestration layer for AI agent teams across Claude and Codex.

  • Claude + Codex orchestration — auto-detect available Claude/Codex runtimes and use the provider access you already have - subscriptions or API keys
  • Assemble your team — create agent teams with different roles that work autonomously in parallel
  • Agents talk to each other — they communicate, create and manage their own tasks, review, leave comments
  • Cross-team communication — agents can fully communicate across different teams; you can configure or prompt them to collaborate and message each other between teams
  • Sit back and watch — tasks change status on the kanban board while agents handle everything on their own
  • Review changes like in Cursor — see what code each task changed, then approve, reject, or comment
  • Built-in review workflow — easily see how agents review each other's tasks to make sure everything went exactly as planned
  • Task-specific logs and messages — clearly see agent/runtime logs (tools), actions and messages in isolation for each individual task, making it easy to trace what happened for any assignment
  • Live process section — see which agents are running processes and open URLs directly in the browser
  • Stay in control — send a direct message to any agent, drop a comment on a task, or pick a quick action right on the kanban card whenever you want to clarify something or add new work
  • Flexible autonomy — let agents run fully autonomous, or review and approve each action one by one (you'll get a notification) — configure the level of control that fits your security needs
  • Solo mode — one-member team: a single agent that creates its own tasks and shows live progress. Saves tokens; can expand to a full team anytime
More features
  • Task creation with attachments — send a message to the team lead with any attached images. The lead will automatically create a fully described task and attach your files directly to the task for complete context.

  • Auto-resume after rate limits — when the lead hits a Claude rate limit and the reset time is known, the app can automatically nudge the lead to continue once the cooldown has passed

  • Deep session analysis — detailed breakdown of what happened in each agent session: bash commands, reasoning, subprocesses

  • Smart task-to-log/changes matching — automatically links session logs/changes to specific tasks

  • Advanced context monitoring system — comprehensive breakdown of what consumes tokens at every step: user messages, Claude.md instructions, tool outputs, thinking text, and team coordination. Token usage, percentage of context window, and session cost are displayed for each category, with detailed views by category or size.

  • Recent tasks across projects — browse the latest completed tasks from all your projects in one place

  • Zero-setup onboarding — built-in runtime detection and provider authentication

  • Built-in code editor — edit project files with Git support without leaving the app

  • Branch strategy — choose via prompt: single branch or git worktree per agent

  • Team member stats — global performance statistics per member

  • Attach code context — reference files or snippets in messages, like in Cursor. You can also mention tasks using #task-id, or refer to another team with @team-name in your messages.

  • Notification system — configurable alerts when tasks complete, agents need your response, new comments arrive, or errors occur

  • MCP integration — supports the built-in mcp-server (see mcp-server folder) for integrating external tools and extensible agent plugins out of the box

  • Post-compact context recovery — when the active runtime compacts its context, the app restores the key team-management instructions so kanban/task-board coordination stays consistent and important operational context is not lost

  • Task context is preserved — thanks to task descriptions, comments, and attachments, all essential information about each task remains available for ongoing work and future reference

  • Workflow history — see the full timeline of each task: when and how its status changed, which agents were involved, and every action that led to the current state

Developer architecture docs

For feature architecture and implementation guidance:

Comparison

Feature Agent Teams Gastown Paperclip Cursor Claude Code CLI
Cross-team communication ✅ Native cross-team messages ⚠️ Cross-rig coordination ⚠️ Company-scoped org work N/A
Agent-to-agent messaging ✅ Native real-time mailbox ✅ Mailboxes + handoffs ⚠️ Comments + @mentions ✅ Team mailbox, no UI
Linked tasks ✅ Cross-refs + dependencies ⚠️ Beads deps + convoys ✅ Goals, parents, blockers ✅ Shared task list
Session analysis ✅ Task logs + token tracking ⚠️ Session recall, feed, OTEL ⚠️ Run transcripts + cost audit ⚠️ Usage command, no UI
Task attachments ✅ Auto-attach, agents read & attach files ❌ Not task-level ✅ Docs, attachments, work products ⚠️ Chat session only ⚠️ Chat images only
Hunk-level review ✅ Accept / reject individual hunks ❌ Bring your own review
Built-in code editor ✅ With Git support ❌ Control plane, not editor ✅ Full IDE
Full autonomy ✅ Agents create, assign, review tasks end-to-end ✅ Mayor, convoys, recovery ✅ Heartbeats + governance ⚠️ Background agents, not teams ✅ Experimental CLI teams
Task dependencies (blocked by) ✅ Guaranteed ordering ✅ DAG waves via Beads ✅ Blockers + execution locks ✅ Team task deps, no UI
Review workflow ✅ Agents review each other + human review UI ⚠️ Refinery merge queue ✅ Approvals + governance ⚠️ PR/BugBot only ✅ Team review, no UI
Zero setup ✅ Guided runtime setup ❌ Go/Git/Dolt/Beads/tmux ⚠️ npx + embedded Postgres ⚠️ CLI + env flag
Kanban board ✅ 5 columns, real-time ❌ Dashboard, not Kanban ✅ 7 columns, drag-and-drop
Execution log viewer ✅ Tool calls, reasoning, timeline ⚠️ Feed, OTEL, dashboard ✅ Run transcripts + ledger ⚠️ Agent chat + terminal
Live processes ✅ View, stop, open URLs in browser ⚠️ Agent health dashboard ⚠️ Manual services + previews ⚠️ Native terminal only
Per-task code review ✅ Accept / reject / comment ⚠️ Merge queue, no diff UI ⚠️ PR/work products, no inline diff ✅ BugBot on PRs
Flexible autonomy ✅ Per-action approvals + notifications ✅ Gates, escalation, recovery ✅ Board approvals, pause, terminate ⚠️ BG agents auto-run commands ✅ Permissions + hooks
Git worktree isolation ✅ Optional ✅ Core primitive ✅ Worktrees / branches ⚠️ Background branches/VMs ⚠️ Manual worktrees
Multi-agent backend ✅ Claude, Codex + OpenCode teammates ✅ Claude, Codex, Gemini, Copilot + more ✅ BYO agents: Claude, Codex, Cursor/OpenCode, HTTP ⚠️ Multi-model agents, no team backend ⚠️ Claude-only experimental teams
Org chart / governance ⚠️ Roles + approvals, no org chart ⚠️ Roles + escalation ✅ Org chart + board governance ⚠️ Team admin only
Budget controls ⚠️ Cost/token visibility, no hard caps ⚠️ Cost tiers + digest, no hard caps ✅ Per-agent budgets + hard stops ⚠️ Usage + BG spend limits ⚠️ /cost + workspace limits
Price Free OSS UI, provider access needed Free OSS, runtime plans needed Free OSS, self-hosted + infra Free + paid usage Claude plan or API usage

Fact sources checked on May 5, 2026: detailed research notes, Gastown README, Gastown provider guide, Gastown scheduler, Paperclip README, Paperclip adapters, Paperclip budgets, Paperclip runtime services, Paperclip Kanban source, Cursor Background Agents, Cursor Diffs & Review, Cursor Bugbot, Cursor pricing, Claude Code agent teams, Claude Code subagents, Claude Code workflows, Claude Code costs, Claude pricing.


Quick start

  1. Download the app for your platform (see Installation)
  2. Launch — On first run, the setup wizard will detect the runtime and guide provider authentication
  3. Create a team — Pick a project, define roles, write a provisioning prompt
  4. Watch — Agents spawn, create tasks, and work. You see it all on the kanban board

FAQ

Do I need to install a runtime before using this app?
No. The app guides runtime detection/setup and provider authentication from the UI - just launch and follow the setup wizard.
Does it read or upload my code?
The app is not a cloud code-sync service. It reads local runtime/session data to power the UI, and your project stays on your machine unless you choose a provider/runtime path that sends data to that provider. In `multimodel` mode, startup may also perform runtime access and capability checks before launch.
Can agents communicate with each other?
Yes. Agents send direct messages, create shared tasks, and leave comments - all coordinated by the app's own orchestration layer.
Is it free?
Yes, free and open source. The app has no paid tier of its own. To run agents, you only need access to a supported provider/runtime, such as Anthropic or Codex.
Can I review code changes before they're applied?
Yes. Every task shows a full diff view where you can accept, reject, or comment on individual code hunks — similar to Cursor's review flow.
What happens if an agent gets stuck?
Send a direct message to course-correct, or stop and restart from the process dashboard. If an agent needs your input, you'll get a notification and the task will show a distinct badge on the board.
Does it support multiple projects and teams?
Yes. Run multiple teams in one project or across different projects, even simultaneously. To avoid Git conflicts, ask agents to use git worktree in your provisioning prompt.

Development

Tech stack

Electron 40, React 19, TypeScript 5, Tailwind CSS 3, Zustand 4. Data from ~/.claude/ (session logs, todos, tasks). The desktop app works with local runtime/session state, while some runtime modes may also use provider or startup capability services when required.

Build from source

Prerequisites: Node.js 20+, pnpm 10+

git clone https://github.com/777genius/claude_agent_teams_ui.git
cd claude_agent_teams_ui
pnpm install
pnpm dev

The app auto-discovers Claude Code projects from ~/.claude/.

Build for distribution

pnpm dist:mac:arm64  # macOS Apple Silicon (.dmg)
pnpm dist:mac:x64    # macOS Intel (.dmg)
pnpm dist:win        # Windows (.exe)
pnpm dist:linux      # Linux (AppImage/.deb/.rpm/.pacman)
pnpm dist            # macOS + Windows + Linux

Scripts

Command Description
pnpm dev Development with hot reload
pnpm build Production build
pnpm typecheck TypeScript type checking
pnpm lint Lint (no auto-fix)
pnpm lint:fix Lint and auto-fix
pnpm format Format code with Prettier
pnpm test Run all tests
pnpm test:watch Watch mode
pnpm test:coverage Coverage report
pnpm test:coverage:critical Critical path coverage
pnpm check Full quality gate (types + lint + test + build)
pnpm fix Lint fix + format
pnpm quality Full check + format check + knip

Roadmap

  • Planning mode to organize agent plans before execution
  • Visual workflow editor (@xyflow/react) for building and orchestrating agent pipelines with drag & drop
  • Remote agent execution via SSH: launch and manage agent teams on remote machines over SSH (stream-json protocol over SSH channel, SFTP-based file monitoring for tasks/inboxes/config)
  • CLI runtime: Run not only on a local PC but in any headless/console environment (web UI), e.g. VPS, remote server, etc.
  • 2 modes: current (agent teams), and a new mode: regular subagents (no communication between them)
  • Curate what context each agent sees (files, docs, MCP servers, skills)
  • Slash commands
  • Outgoing message queue — queue user messages while the lead (or agent) is busy; clear agent-busy status in the UI; flush to stdin or relay from inbox when idle (durable queue on disk for the lead inbox path)
  • createTasksBatch — IPC/service API to create many team tasks in one call (playbooks, markdown checklist import, scripts); complements single createTask
  • Command palette — extend Cmd/Ctrl+K beyond project/session search to runnable actions (quick commands, navigation shortcuts, team/task operations) in a keyboard-first flow
  • Custom kanban columns
  • Run terminal commands
  • Monitor agents processes/stats
  • Reusable agents with SOUL.md
  • Сommunicate via messenger
  • SDK to programmatically launch agents

Contributing

See CONTRIBUTING.md for development guidelines. Please read our Code of Conduct.

Security

IPC and standalone HTTP handlers validate IDs, paths, and payload shape at the boundary. Project editing and write operations are constrained to the selected project root, while read-only discovery also accesses local Claude data under ~/.claude/ and app-owned state paths when required. Path traversal and sensitive config/credential targets are blocked. See SECURITY.md for details.

License

AGPL-3.0

About

You're the CTO, agents are your team. They handle tasks on their own, message each other, and review each other's work. You just watch the kanban board and give high-level commands. Codex/Claude/OpenCode(75+ LLM providers). Build your AI company.

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Contributors