Cursor AI Agent Team Framework

A methodology for human-AI collaboration, implemented as a Cursor framework. Intelligence augmentation — an elite team under your command, with zero handoff and inherent context continuity.

cursor-agent-team is first a methodology and a philosophy of how humans and AI should work together—then a framework that implements it. How we think about AI shapes how we build with it.

What This Is

cursor-agent-team is an architectural reference implementation for single-conversation, multi-role human-AI collaboration.

This IS	This is NOT
Methodology contribution	Commercial product / SaaS
Reference architecture	General-purpose framework
Advanced tool for methodologically-aware individuals/small teams	Enterprise platform
Single-conversation, multi-role paradigm	Multi-agent, multi-instance paradigm

We chose depth over breadth. This is a design blueprint that prioritizes implementation completeness over broad compatibility.

Why cursor-agent-team?

We augment human capability; we don't replace it. Three design pillars:

Multi-role, not multi-agent — One LLM, one conversation. /discuss and /crew share the same context. No agent handoff, no context loss. Like a meeting room where everyone has perfect memory.
Human-in-the-loop by design — You are the conductor. We explore, you decide. We execute, you confirm. "Your command, our execution" — not "set and forget."
Empowerment, not replacement — Democratizing team access: individuals get team-level capability. Cognitive load redistribution: you think strategy, we handle execution details. Frees you from the "details quagmire" for purer thinking.

Target users: Individuals and small teams with methodological awareness—those who think about how they work with AI, not just what they want AI to do. This is not a plug-and-play solution for users seeking immediate productivity gains without conceptual investment.

We believe: AI should augment human judgment, not replace it. Context continuity matters more than agent count. Plans grounded in fresh research beat plans from training data alone. And the human must remain in the loop—as conductor, not spectator.

Core value (formal): Intelligence Augmentation (IA); Democratization of expertise / Capability expansion; Cognitive load redistribution; Human-in-the-loop (HITL); Human-AI Teaming — human as conductor.

Core innovation: Multi-role, single-conversation architecture. Zero handoff → inherent context continuity. Multi-agent systems face context loss at handoff; we avoid it by design. No AI-AI coordination; human orchestrates.

Design philosophy: (1) Intelligence Augmentation — augment, not replace. (2) Human-in-the-loop by design; not set-and-forget. (3) Multi-role, not multi-agent — role as "mask," single conversation. (4) Human-AI teaming — human as conductor.

Design principles: Zero handoff; Plan-and-Execute (planning-execution separation); Constrained generation / specification-driven; Exploration vs exploitation (by role); Retrieval-augmented planning (knowledge cutoff mitigation); Dedicated agent workspace (scratchpad, external memory, staged generation); Common Ground / Mental Model Alignment (persona); Persona Sandboxing.

What it is

A multi-role collaboration framework for Cursor IDE and Qwen Code. One LLM wears different "masks" (commands) in the same conversation. Provides:

Structured workflow: discuss → plan → execute
Specialized roles: Each command has distinct responsibilities
Hard constraint validation: Python scripts ensure deterministic output
Extensible team: Create new roles via /prompt_engineer

Positioning & Related Concepts

Concept	Our approach
Intelligence Augmentation (IA)	We augment human cognitive capability rather than replace it (Licklider's man-computer symbiosis; Springer 2024)
Multi-role vs multi-agent	Multi-agent systems use handoffs; context loss is a critical challenge. We avoid it by design: zero handoff, one conversation
Human-AI teaming	Human as conductor; AI roles are "masks" in the same meeting (National Academies 2022)
Cognitive load redistribution	You focus on strategy; we handle execution details (Cognitive Load Theory)

vs	cursor-agent-team
Multi-agent frameworks	No handoff, no context loss
Autonomous agents	Human-in-the-loop, not set-and-forget
Generic AI assistants	Structured roles, workflow enforcement, team metaphor

Positioning in the Landscape

We occupy a specific niche: single-conversation, multi-role, context-preserving collaboration.

Approach	Representative	Key Difference
Multi-Agent Handoff	Google ADK, Microsoft AutoGen	They optimize handoff; we eliminate it
Role-Playing MAS	ChatCollab, SupportPlay	Multi-instance, multi-conversation; we stay single-instance
Single-Model Multi-Ability	CALM	Model-level unification; we focus on workflow orchestration
Cursor Ecosystem	cursor-agents, cursor-rules templates	Engineering practice; we add methodology depth

Evaluation context: Among single-conversation, multi-role, context-heavy approaches, cursor-agent-team is a first-tier architectural reference implementation—designed for methodological exploration, not product deployment.

Quick Start

Tell Cursor Agent:

Install cursor-agent-team from https://github.com/thiswind/cursor-agent-team.git as a submodule and run the install script.

Then type /discuss to start.

For manual installation or Qwen Code, see Installation.

Core Roles

Role	Command	Description
Discussion Partner	`/discuss`	Exploration mode — breadth and depth, no execution. Research-first planning: automatically searches for latest academic and industry research before synthesizing plans (Retrieval-augmented planning; knowledge cutoff mitigation).
Crew Member	`/crew`	Execution mode — strict adherence to plan as specification. Plan-and-Execute architecture; constrained generation. Exploitation mode.
Prompt Engineer	`/prompt_engineer`	Creates and maintains new roles (commands)

Research-first planning — Plans should not come from LLM training data alone. Training data has a knowledge cutoff; plans synthesized from it can be outdated or wrong. We design /discuss to search for latest academic and industry research before synthesizing plans (retrieval-augmented planning). Fresh context, then synthesis—a methodological stance, not just a feature.

Workflow

/discuss → [Explore & Plan] → /crew → [Execute] → Done
                ↓
         /prompt_engineer → [Create New Role] → Use New Command

Plan: Use /discuss to explore ideas and generate execution plans
Execute: Use /crew to execute the plans
Expand: Use /prompt_engineer to create new roles when needed

Installation

Cursor IDE — Tell Cursor Agent to install, or run manually:

git submodule add -f https://github.com/thiswind/cursor-agent-team.git cursor-agent-team
./cursor-agent-team/install.sh

Update: git submodule update --remote cursor-agent-team && ./cursor-agent-team/install.sh

Qwen Code:

git submodule add -f https://github.com/thiswind/cursor-agent-team.git cursor-agent-team
./cursor-agent-team/install_qwen.sh

Update: git submodule update --remote cursor-agent-team && ./cursor-agent-team/install_qwen.sh

Note: The workspace at cursor-agent-team/ai_workspace/ is shared between both platforms.

Features

Core

Agent Workspace — Dedicated persistent workspace for agents. Agents can write scripts, take notes, save intermediate results from searches and research. Aligns with scratchpad reasoning and external memory research; enables staged refinement for higher output quality than direct generation. See ai_workspace/README.md.

Persona System (v0.8.0+) — Script-driven persona integration with Persona Sandboxing: the persona expresses at the Output Layer; the Work Layer (code, analysis, reasoning) runs in a clean context. This prevents style contamination from affecting technical accuracy. Based on persona-spec.

Communication requires synchronization of mental models. Persona provides warmth and rapport that increase human affinity and trust, improving coordination efficiency between human leaders and AI teams—not for companionship, but for more effective human-machine collaboration.

python cursor-agent-team/_scripts/persona_output.py --check

Inspiration Capital (v0.7.0+) — Scatter card collection for sparking creativity.

python ai_workspace/inspiration_capital/scripts/create_card.py --source "Source" --trigger "Trigger"
python ai_workspace/inspiration_capital/scripts/draw_cards.py --count 3

See ai_workspace/inspiration_capital/README.md for details.

Extended

Text-to-Speech (macOS): Voice feedback via say; activated when user requests ("read to me"). python cursor-agent-team/_scripts/tts_speak.py --check
Social Media: Integration with Moltbook. See .cursor/rules/social_media_policy.mdc
Spec-Kit Translator: Converts plans to spec-kit format. /spec_translator PLAN-B-001

Technical Architecture

Hybrid architecture: LLM soft constraints (prompt rules) + script hard constraints (Python). Critical operations use deterministic scripts to validate outputs before committing.

┌─────────────────────────────────────────────────┐
│                    LLM Layer                     │
│   (Soft Constraints: Prompt rules)              │
└────────────────────┬────────────────────────────┘
                     │ Calls
                     ▼
┌─────────────────────────────────────────────────┐
│                  Script Layer                    │
│   (Hard Constraints: Python scripts)             │
│   - validate_topic_tree.py  - preflight_check.py │
│   - cleanup_ai_workspace.py                     │
└─────────────────────────────────────────────────┘

Architecture highlights: Multi-role + single conversation; Plan-and-Execute; Dedicated agent workspace (context engineering, cognitive artifacts); Hybrid constraints (soft + hard); Phase markers (workflow verification); Command-as-role.

Why This Architecture

We are not behind the Skills wave—we are ahead of it. Our design addresses problems that traditional rules-based and skill-based architectures cannot solve.

Orchestration vs Capability

Approach	Focus	What it solves
Rules	Passive constraints by scope	Code style, conventions—but cannot role-switch or orchestrate workflow
Skills	Capability modules (add-and-use)	Extend what the agent can do—but no workflow model, no join points
Ours	Orchestration-first, methodology-first	How humans and AI collaborate—workflow, role switching, spec-driven execution

We define collaboration workflow; we don't just add capabilities. Command + Rules + Scripts work together: Command defines phases (join points), Rules define aspects, Scripts provide deterministic validation.

Aspect-Oriented Design

Cross-cutting concerns (Gleaning, Wandering, Persona Output, TTS) are woven into the workflow at defined join points—not embedded in core logic. Commands define Phase/Step as join points; Rules define aspects that invoke scripts at those points. Traditional Skill architectures have no workflow model or join points; they cannot achieve this weaving.

Spec-Script Integration

Specification (Command + mdc) drives when and why to call; scripts execute how with deterministic validation. This aligns with "Blueprint First, Model Second" (workflow logic in spec, LLM for bounded tasks) and Formal-LLM (hard constraints via script validation). The spec-script loop—LLM reads spec, runs script, script validates—runs in a single conversation.

Why Cursor

Cursor provides Commands (workflow definition), Rules (aspect definition), and Agent (script execution) in one session. This tight integration enables spec-driven execution and AOP-style weaving.

This binding is intentional. We start with Cursor because:

Its Rules, integrated terminal, and workspace model align naturally with our command–rules–scripts–workspace architecture
It aggregates state-of-the-art models behind a single subscription
Its IDE experience (interface, file tree, workspace semantics) currently leads the space

We prioritize depth on Cursor rather than breadth of platform support. A watered-down, platform-agnostic version would lose the tight spec-script loop that makes our methodology work.

Future ports will be considered only where we can preserve the same methodological guarantees (minimal handoff, HITL, workspace semantics). This is a conscious design choice, not a limitation or oversight.

See cursor-agent-team/_scripts/README.md for script details.

Minimal Handoff in Numbers

Handoff Type	Context Cost	Effect
State-transfer (MAS)	50–200KB compressed state	10–20% context retention (estimate)
Prompt-swap (ours)	1–3KB rule text	Full history preserved

We don't transfer state; we swap masks. The "Writer" knows what the "Planner" discussed because they share the same memory stream.

Research Foundation

This framework is grounded in peer-reviewed research:

Concept	Foundation
Intelligence Augmentation	Licklider (1960): human-computer symbiosis
Lost in the Middle	Liu et al. (2023): context degradation in long sequences
Aspect-Oriented Programming	Kiczales et al. (1997): cross-cutting concerns separation
Retrieval-Augmented Planning	RaDA, RPG: fresh information before synthesis

For a detailed treatment, see our preprint:

cursor-agent-team: A Multi-Role, Single-Conversation Framework for Human-AI Collaboration
Read the paper (PDF)

Built with cursor-agent-team

We used this framework to write its own academic paper—a form of "dogfooding" that subjects the methodology to its own claims.

Five structured plans (PLAN-AA-001 through PLAN-AA-005) executed via /crew and /writer
Minimal handoff in action: the writer role retained full context of design discussions
Phase markers prevented step-skipping; vocabulary bans were enforced via external grep
Result: 4,000-word preprint, 12 pages, ready for arXiv submission

This self-referential implementation experience validates the framework's practical utility.

Direction

See DIRECTION.md for potential future directions.

We focus on methodology depth over feature breadth. No timeline commitments — the project evolves based on real needs.

Version

Current version: v0.10.13. See CHANGELOG.md.

License

GNU General Public License v3.0 (GPL-3.0). See LICENSE.

Author

thiswind — @thiswind

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
_cursor		_cursor
_qwen		_qwen
_scripts		_scripts
ai_workspace		ai_workspace
config		config
paper		paper
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
DIRECTION.md		DIRECTION.md
LICENSE		LICENSE
README.md		README.md
banner.png		banner.png
banner_old.png		banner_old.png
install.sh		install.sh
install_qwen.sh		install_qwen.sh
logo.png		logo.png
uninstall.sh		uninstall.sh
uninstall_qwen.sh		uninstall_qwen.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cursor AI Agent Team Framework

What This Is

Why cursor-agent-team?

What it is

Positioning & Related Concepts

Positioning in the Landscape

Quick Start

Core Roles

Workflow

Installation

Features

Core

Extended

Technical Architecture

Why This Architecture

Orchestration vs Capability

Aspect-Oriented Design

Spec-Script Integration

Why Cursor

Minimal Handoff in Numbers

Research Foundation

Built with cursor-agent-team

Direction

Version

License

Author

About

Uh oh!

Releases 18

Packages

Uh oh!

Languages

License

thiswind/cursor-agent-team

Folders and files

Latest commit

History

Repository files navigation

Cursor AI Agent Team Framework

What This Is

Why cursor-agent-team?

What it is

Positioning & Related Concepts

Positioning in the Landscape

Quick Start

Core Roles

Workflow

Installation

Features

Core

Extended

Technical Architecture

Why This Architecture

Orchestration vs Capability

Aspect-Oriented Design

Spec-Script Integration

Why Cursor

Minimal Handoff in Numbers

Research Foundation

Built with cursor-agent-team

Direction

Version

License

Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 18

Packages 0

Uh oh!

Languages

Packages