Skip to content

Cursor AI Agent Team Framework: A three-agent collaboration system for AI-assisted work

License

Notifications You must be signed in to change notification settings

thiswind/cursor-agent-team

Repository files navigation

Logo

Cursor AI Agent Team Framework

A methodology for human-AI collaboration, implemented as a Cursor framework. Intelligence augmentation — an elite team under your command, with zero handoff and inherent context continuity.

cursor-agent-team is first a methodology and a philosophy of how humans and AI should work together—then a framework that implements it. How we think about AI shapes how we build with it.

What This Is

cursor-agent-team is an architectural reference implementation for single-conversation, multi-role human-AI collaboration.

This IS This is NOT
Methodology contribution Commercial product / SaaS
Reference architecture General-purpose framework
Advanced tool for methodologically-aware individuals/small teams Enterprise platform
Single-conversation, multi-role paradigm Multi-agent, multi-instance paradigm

We chose depth over breadth. This is a design blueprint that prioritizes implementation completeness over broad compatibility.

Why cursor-agent-team?

We augment human capability; we don't replace it. Three design pillars:

  1. Multi-role, not multi-agent — One LLM, one conversation. /discuss and /crew share the same context. No agent handoff, no context loss. Like a meeting room where everyone has perfect memory.

  2. Human-in-the-loop by design — You are the conductor. We explore, you decide. We execute, you confirm. "Your command, our execution" — not "set and forget."

  3. Empowerment, not replacement — Democratizing team access: individuals get team-level capability. Cognitive load redistribution: you think strategy, we handle execution details. Frees you from the "details quagmire" for purer thinking.

Target users: Individuals and small teams with methodological awareness—those who think about how they work with AI, not just what they want AI to do. This is not a plug-and-play solution for users seeking immediate productivity gains without conceptual investment.

We believe: AI should augment human judgment, not replace it. Context continuity matters more than agent count. Plans grounded in fresh research beat plans from training data alone. And the human must remain in the loop—as conductor, not spectator.

Core value (formal): Intelligence Augmentation (IA); Democratization of expertise / Capability expansion; Cognitive load redistribution; Human-in-the-loop (HITL); Human-AI Teaming — human as conductor.

Core innovation: Multi-role, single-conversation architecture. Zero handoff → inherent context continuity. Multi-agent systems face context loss at handoff; we avoid it by design. No AI-AI coordination; human orchestrates.

Design philosophy: (1) Intelligence Augmentation — augment, not replace. (2) Human-in-the-loop by design; not set-and-forget. (3) Multi-role, not multi-agent — role as "mask," single conversation. (4) Human-AI teaming — human as conductor.

Design principles: Zero handoff; Plan-and-Execute (planning-execution separation); Constrained generation / specification-driven; Exploration vs exploitation (by role); Retrieval-augmented planning (knowledge cutoff mitigation); Dedicated agent workspace (scratchpad, external memory, staged generation); Common Ground / Mental Model Alignment (persona); Persona Sandboxing.

What it is

A multi-role collaboration framework for Cursor IDE and Qwen Code. One LLM wears different "masks" (commands) in the same conversation. Provides:

  • Structured workflow: discuss → plan → execute
  • Specialized roles: Each command has distinct responsibilities
  • Hard constraint validation: Python scripts ensure deterministic output
  • Extensible team: Create new roles via /prompt_engineer

Positioning & Related Concepts

Concept Our approach
Intelligence Augmentation (IA) We augment human cognitive capability rather than replace it (Licklider's man-computer symbiosis; Springer 2024)
Multi-role vs multi-agent Multi-agent systems use handoffs; context loss is a critical challenge. We avoid it by design: zero handoff, one conversation
Human-AI teaming Human as conductor; AI roles are "masks" in the same meeting (National Academies 2022)
Cognitive load redistribution You focus on strategy; we handle execution details (Cognitive Load Theory)
vs cursor-agent-team
Multi-agent frameworks No handoff, no context loss
Autonomous agents Human-in-the-loop, not set-and-forget
Generic AI assistants Structured roles, workflow enforcement, team metaphor

Positioning in the Landscape

We occupy a specific niche: single-conversation, multi-role, context-preserving collaboration.

Approach Representative Key Difference
Multi-Agent Handoff Google ADK, Microsoft AutoGen They optimize handoff; we eliminate it
Role-Playing MAS ChatCollab, SupportPlay Multi-instance, multi-conversation; we stay single-instance
Single-Model Multi-Ability CALM Model-level unification; we focus on workflow orchestration
Cursor Ecosystem cursor-agents, cursor-rules templates Engineering practice; we add methodology depth

Evaluation context: Among single-conversation, multi-role, context-heavy approaches, cursor-agent-team is a first-tier architectural reference implementation—designed for methodological exploration, not product deployment.

Quick Start

Tell Cursor Agent:

Install cursor-agent-team from https://github.com/thiswind/cursor-agent-team.git as a submodule and run the install script.

Then type /discuss to start.

For manual installation or Qwen Code, see Installation.

Core Roles

Role Command Description
Discussion Partner /discuss Exploration mode — breadth and depth, no execution. Research-first planning: automatically searches for latest academic and industry research before synthesizing plans (Retrieval-augmented planning; knowledge cutoff mitigation).
Crew Member /crew Execution mode — strict adherence to plan as specification. Plan-and-Execute architecture; constrained generation. Exploitation mode.
Prompt Engineer /prompt_engineer Creates and maintains new roles (commands)

Research-first planning — Plans should not come from LLM training data alone. Training data has a knowledge cutoff; plans synthesized from it can be outdated or wrong. We design /discuss to search for latest academic and industry research before synthesizing plans (retrieval-augmented planning). Fresh context, then synthesis—a methodological stance, not just a feature.

Workflow

Framework Banner — /crew, /discuss, /prompt_engineer

/discuss → [Explore & Plan] → /crew → [Execute] → Done
                ↓
         /prompt_engineer → [Create New Role] → Use New Command
  1. Plan: Use /discuss to explore ideas and generate execution plans
  2. Execute: Use /crew to execute the plans
  3. Expand: Use /prompt_engineer to create new roles when needed

Installation

Cursor IDE — Tell Cursor Agent to install, or run manually:

git submodule add -f https://github.com/thiswind/cursor-agent-team.git cursor-agent-team
./cursor-agent-team/install.sh

Update: git submodule update --remote cursor-agent-team && ./cursor-agent-team/install.sh

Qwen Code:

git submodule add -f https://github.com/thiswind/cursor-agent-team.git cursor-agent-team
./cursor-agent-team/install_qwen.sh

Update: git submodule update --remote cursor-agent-team && ./cursor-agent-team/install_qwen.sh

Note: The workspace at cursor-agent-team/ai_workspace/ is shared between both platforms.

Features

Core

Agent Workspace — Dedicated persistent workspace for agents. Agents can write scripts, take notes, save intermediate results from searches and research. Aligns with scratchpad reasoning and external memory research; enables staged refinement for higher output quality than direct generation. See ai_workspace/README.md.

Persona System (v0.8.0+) — Script-driven persona integration with Persona Sandboxing: the persona expresses at the Output Layer; the Work Layer (code, analysis, reasoning) runs in a clean context. This prevents style contamination from affecting technical accuracy. Based on persona-spec.

Communication requires synchronization of mental models. Persona provides warmth and rapport that increase human affinity and trust, improving coordination efficiency between human leaders and AI teams—not for companionship, but for more effective human-machine collaboration.

python cursor-agent-team/_scripts/persona_output.py --check

Inspiration Capital (v0.7.0+) — Scatter card collection for sparking creativity.

python ai_workspace/inspiration_capital/scripts/create_card.py --source "Source" --trigger "Trigger"
python ai_workspace/inspiration_capital/scripts/draw_cards.py --count 3

See ai_workspace/inspiration_capital/README.md for details.

Extended

  • Text-to-Speech (macOS): Voice feedback via say; activated when user requests ("read to me"). python cursor-agent-team/_scripts/tts_speak.py --check
  • Social Media: Integration with Moltbook. See .cursor/rules/social_media_policy.mdc
  • Spec-Kit Translator: Converts plans to spec-kit format. /spec_translator PLAN-B-001

Technical Architecture

Hybrid architecture: LLM soft constraints (prompt rules) + script hard constraints (Python). Critical operations use deterministic scripts to validate outputs before committing.

┌─────────────────────────────────────────────────┐
│                    LLM Layer                     │
│   (Soft Constraints: Prompt rules)              │
└────────────────────┬────────────────────────────┘
                     │ Calls
                     ▼
┌─────────────────────────────────────────────────┐
│                  Script Layer                    │
│   (Hard Constraints: Python scripts)             │
│   - validate_topic_tree.py  - preflight_check.py │
│   - cleanup_ai_workspace.py                     │
└─────────────────────────────────────────────────┘

Architecture highlights: Multi-role + single conversation; Plan-and-Execute; Dedicated agent workspace (context engineering, cognitive artifacts); Hybrid constraints (soft + hard); Phase markers (workflow verification); Command-as-role.

Why This Architecture

We are not behind the Skills wave—we are ahead of it. Our design addresses problems that traditional rules-based and skill-based architectures cannot solve.

Orchestration vs Capability

Approach Focus What it solves
Rules Passive constraints by scope Code style, conventions—but cannot role-switch or orchestrate workflow
Skills Capability modules (add-and-use) Extend what the agent can do—but no workflow model, no join points
Ours Orchestration-first, methodology-first How humans and AI collaborate—workflow, role switching, spec-driven execution

We define collaboration workflow; we don't just add capabilities. Command + Rules + Scripts work together: Command defines phases (join points), Rules define aspects, Scripts provide deterministic validation.

Aspect-Oriented Design

Cross-cutting concerns (Gleaning, Wandering, Persona Output, TTS) are woven into the workflow at defined join points—not embedded in core logic. Commands define Phase/Step as join points; Rules define aspects that invoke scripts at those points. Traditional Skill architectures have no workflow model or join points; they cannot achieve this weaving.

Spec-Script Integration

Specification (Command + mdc) drives when and why to call; scripts execute how with deterministic validation. This aligns with "Blueprint First, Model Second" (workflow logic in spec, LLM for bounded tasks) and Formal-LLM (hard constraints via script validation). The spec-script loop—LLM reads spec, runs script, script validates—runs in a single conversation.

Why Cursor

Cursor provides Commands (workflow definition), Rules (aspect definition), and Agent (script execution) in one session. This tight integration enables spec-driven execution and AOP-style weaving.

This binding is intentional. We start with Cursor because:

  • Its Rules, integrated terminal, and workspace model align naturally with our command–rules–scripts–workspace architecture
  • It aggregates state-of-the-art models behind a single subscription
  • Its IDE experience (interface, file tree, workspace semantics) currently leads the space

We prioritize depth on Cursor rather than breadth of platform support. A watered-down, platform-agnostic version would lose the tight spec-script loop that makes our methodology work.

Future ports will be considered only where we can preserve the same methodological guarantees (minimal handoff, HITL, workspace semantics). This is a conscious design choice, not a limitation or oversight.

See cursor-agent-team/_scripts/README.md for script details.

Minimal Handoff in Numbers

Handoff Type Context Cost Effect
State-transfer (MAS) 50–200KB compressed state 10–20% context retention (estimate)
Prompt-swap (ours) 1–3KB rule text Full history preserved

We don't transfer state; we swap masks. The "Writer" knows what the "Planner" discussed because they share the same memory stream.

Research Foundation

This framework is grounded in peer-reviewed research:

Concept Foundation
Intelligence Augmentation Licklider (1960): human-computer symbiosis
Lost in the Middle Liu et al. (2023): context degradation in long sequences
Aspect-Oriented Programming Kiczales et al. (1997): cross-cutting concerns separation
Retrieval-Augmented Planning RaDA, RPG: fresh information before synthesis

For a detailed treatment, see our preprint:

cursor-agent-team: A Multi-Role, Single-Conversation Framework for Human-AI Collaboration
Read the paper (PDF)

Built with cursor-agent-team

We used this framework to write its own academic paper—a form of "dogfooding" that subjects the methodology to its own claims.

  • Five structured plans (PLAN-AA-001 through PLAN-AA-005) executed via /crew and /writer
  • Minimal handoff in action: the writer role retained full context of design discussions
  • Phase markers prevented step-skipping; vocabulary bans were enforced via external grep
  • Result: 4,000-word preprint, 12 pages, ready for arXiv submission

This self-referential implementation experience validates the framework's practical utility.

Direction

See DIRECTION.md for potential future directions.

We focus on methodology depth over feature breadth. No timeline commitments — the project evolves based on real needs.

Version

Current version: v0.10.13. See CHANGELOG.md.

License

GNU General Public License v3.0 (GPL-3.0). See LICENSE.

Author

thiswind@thiswind

About

Cursor AI Agent Team Framework: A three-agent collaboration system for AI-assisted work

Resources

License

Stars

Watchers

Forks

Packages

No packages published