ai-memory

Persistent AI memory for any project. Drop a .ai/ directory into any codebase — Claude Code, Cursor, Windsurf, or any other AI tool reads it at session start and carries knowledge forward.

npm install @radix-ai/ai-memory
npx @radix-ai/ai-memory init

The problem

AI coding assistants forget everything between sessions. Decisions made last week, bugs already fixed, patterns that work — gone on every restart. You re-explain the stack, re-discover the same issues, watch the AI repeat the same mistakes.

What this does

ai-memory gives any AI tool persistent memory by maintaining a structured .ai/ directory in your project. The AI reads it at session start. A local MCP server gives it structured tools to search and write memory. A governance layer enforces project rules in code, not prompts.

Install

The install command sets up three things: context loading (the AI reads .ai/ at session start), MCP server (structured memory tools), and skills (slash commands like /mem-compound).

Tool	Install	Skills
Cursor	`/add-plugin ai-memory`	`/mem-compound`, `/mem-session-close`, `/mem-validate`, `/mem-init`
Claude Code	`/plugin install ai-memory`	Same slash commands via plugin
Antigravity	`npx @radix-ai/ai-memory install --to antigravity`	Ask AI: "run the compound protocol"
VS Code + Copilot	`npx @radix-ai/ai-memory install --to copilot`	Paste SKILL.md content into chat
Any other tool	Paste bootstrap instruction into system prompt	Paste SKILL.md content

CLI only

npx @radix-ai/ai-memory init            # scaffold .ai/
npx @radix-ai/ai-memory install --to cursor  # install for your tool

Quick start

install runs init automatically if .ai/ is missing, so you can run either order.

# 1. Install for your tool (scaffolds .ai/ if needed)
npx @radix-ai/ai-memory install --to cursor    # or claude-code, antigravity, copilot

# 2. Fill in what your project is
# Edit .ai/IDENTITY.md and .ai/PROJECT_STATUS.md (project status)

# 3. Restart your AI tool (or start a new chat)

# 4. At the end of a session with real learning:
/mem-compound

Or run init first to scaffold .ai/ before installing.

Verify

npx @radix-ai/ai-memory verify

Checks: .ai/ structure, bootstrap installed, MCP configured, harness validity, rule coverage, memory index populated.

Or manually in a session: ask "Call search_memory with query 'test'" (confirms MCP). Then "What does .ai/IDENTITY.md say?" (confirms context loading).

Troubleshooting

Issue	What to do
`'ai-memory' is not recognized`	You're in the ai-memory dev repo. Run `npm run build` then `node dist/cli/index.js init` and `node dist/cli/index.js install --to cursor`. For published users, use `npx @radix-ai/ai-memory` — the package is pre-built.
MCP tools not available	`install` scaffolds `.ai/` if missing. Restart Cursor (or start a new chat) after install.
Cursor not picking up MCP	Cursor reads `.cursor/mcp.json`. The install writes it for Cursor. If you added manually, ensure the config is in `.cursor/mcp.json` (not `.mcp.json` at project root).
`Cannot find module '.ai/mcp-launcher.cjs'`	Run `npx @radix-ai/ai-memory install --to cursor` first — it creates `.ai/` and the launcher.
`'ai-memory' is not recognized` (Windows)	The launcher uses `cmd /c` on Windows. If it still fails, delete `.cursor/mcp.json` and run `install --to cursor` again to refresh the launcher.

What gets created

.ai/
├── IDENTITY.md             — What this project is; constraints for the AI
├── PROJECT_STATUS.md       — Current focus, open questions, what's working
├── memory/
│   ├── decisions.md        — Architectural decisions [P0/P1/P2 tagged]
│   ├── patterns.md         — Reusable patterns and anti-patterns
│   ├── debugging.md        — Non-obvious bugs with root cause and fix
│   ├── improvements.md     — Incremental improvements over time
│   └── memory-index.md     — Auto-generated priority index
├── agents/                 — Agent methodology files
├── skills/                 — Project domain knowledge
├── toolbox/                — General tech knowledge
├── rules/                  — Behavioral constraints
├── sessions/
│   ├── open-items.md       — Live registry of open tasks
│   └── archive/
│       └── thread-archive.md
└── reference/
    └── PROJECT.md          — Architecture (loaded on demand only)

Add governance, evals, docs schema, and ACP with --full:

npx @radix-ai/ai-memory init --full

Creates: acp/, docs-schema.json, rules/doc-placement.md, agents/docs-manager.md.

Skills

Skill	When to use
`/mem-compound`	End of any session with real learning
`/mem-session-close`	End of a short/exploratory session
`/mem-init`	First-time project setup
`/mem-validate`	Before a risky change (Full tier)
`/mem-auto-review`	Automated PR review (Bugbot, CI, automations)
`/browser`	Browser automation (screenshots, navigate, interact)
`/screen-capture`	Desktop/app window screenshot for vision analysis
`/desktop-automation`	Desktop UI automation (mouse, keyboard, OCR)

How to invoke skills

Skills are stored in .ai/skills/ (canonical) with stubs in your tool's skills directory (e.g., .cursor/skills/, .claude/skills/, .agents/skills/ for Antigravity). How you invoke them depends on your tool:

Tool	How to invoke	Example
Cursor	Type `/` in chat to see available skills	`/mem-compound`
Claude Code	Say "run" followed by the skill name	`run mem-compound`
Windsurf / Cline	Ask the agent to follow the skill	"Run the mem-compound protocol"
Copilot	Paste the skill content from `.ai/skills/mem-compound/SKILL.md`	Manual
Any tool with MCP	The agent can read `.ai/skills/` and follow instructions	"Follow .ai/skills/mem-compound/SKILL.md"

If your tool doesn't discover skills automatically, just tell the agent: "Read .ai/skills/mem-compound/SKILL.md and follow the steps."

Project-specific compound

/mem-compound runs standard steps (scan, conflict check, update status, archive, sync) plus project-specific doc updates. The skill maps session work to domains (UI, Backend/API, AI/ML, Architecture, Backlog) and updates docs via get_doc_path and validate_doc_placement. Open items may be broad or categorical. Work done anywhere must be broken down into atomic tasks that fit RALPH loops and avoid conflicts when agents work in parallel. With init --full, .ai/docs-schema.json defines canonical paths and naming (SCREAMING_SNAKE by default).

CLI

ai-memory init [--full] [--download-model]  # Scaffold .ai/. Use --download-model to pre-fetch hybrid search model (~23MB).
ai-memory install --to <tool>    # Bootstrap for cursor, claude-code, antigravity, copilot
ai-memory mcp                    # Start MCP server (stdio)
ai-memory mcp --http --port 3100 # Start MCP server (HTTP, for cloud agents)
ai-memory validate               # Validate all .ai/ files
ai-memory validate-docs          # Validate doc placement against .ai/docs-schema.json (staged or --paths)
# Pre-commit: add `ai-memory validate-docs` to validate new docs before commit (e.g. via husky)
ai-memory index                  # Regenerate memory-index.md from memory files
ai-memory fmt                    # Auto-format YAML frontmatter
ai-memory eval [--json]          # Memory health report
ai-memory prune [--dry-run]      # Review stale entries
ai-memory generate-harness       # Compile rule set from [P0] entries
ai-memory verify [--json]        # Verify full installation chain (.ai/, bootstrap, MCP, harness)

# Extensibility
ai-memory agent create <name>    # Scaffold a new agent
ai-memory skill create <name>    # Scaffold a new skill
ai-memory rule create <name>     # Scaffold a new rule
ai-memory eval add <name>        # Add a custom eval metric

Configuration

Env var	Default	Description
`AI_DIR`	`./.ai`	Path to the `.ai/` directory
`AI_SEARCH`	`hybrid`	Search mode: `keyword` (TF only, no model), `semantic` (vector only), or `hybrid` (keyword + semantic + RRF). Use `keyword` for faster startup or constrained environments.
`AI_SEARCH_WASM`	—	Set to `1` to prefer WASM backend (onnxruntime-web) over native. Use on Windows when onnxruntime-node fails. Slower but cross-platform.
`AI_MODEL_PATH`	—	Local path to the embedding model for air-gapped/CI. When set, disables remote model downloads. Model must be in HuggingFace format under this path.

Platform note: Semantic/hybrid search uses onnxruntime-node on Linux/macOS. On Windows, onnxruntime-node may be unavailable; set AI_SEARCH=keyword for keyword-only, or AI_SEARCH_WASM=1 to try WASM.

Governance (Full tier)

Tag decisions [P0] and add a constraint_pattern to enforce them in code:

### [P0] No external auth libraries

**Context:** ...
**Decision:** Use the internal auth module only.

\`\`\`yaml
constraint_pattern:
  type: ast
  language: typescript
  pattern: "import $_ from '$LIB'"
  where:
    LIB:
      regex: "passport|jsonwebtoken|bcrypt"
  path: "src/**/*.ts"
\`\`\`

Then:

ai-memory generate-harness    # Compile .ai/temp/harness.json

validate_context in /mem-compound will hard-block any commit that violates a [P0] rule.

Harness features

Path filtering: Rules apply only to files matching path (minimatch glob). Example: path: "src/**/*.ts" skips tests/.
AST constraints: Use where to filter by meta-variable regex (e.g. path traversal checks).
Deletion-aware regex: Set scope: "deletions" to catch removal of protected content (e.g. [P0] markers).
Stability Certificate: On success, returns an audit log of rules checked; on failure, a detailed violation report.

Multi-agent & iterative loops

RALPH loops (iterative self-improvement)

PROJECT_STATUS.md is writable by default (writable: true in frontmatter). This means:

Agent reads PROJECT_STATUS.md, picks a task, does work
Agent updates PROJECT_STATUS.md with what it learned
Agent exits (or session ends)
Next iteration reads the updated PROJECT_STATUS.md
Natural convergence through iteration

This follows the autoresearch pattern and the Ralph Wiggum approach: the status file on disk is the shared state.

Concurrent agents (cloud agents, worktrees, background tasks)

commit_memory uses claim-based locking:

Before writing, the agent acquires a claim on the target path
If another agent holds an active claim (5-minute TTL), the write is rejected
Claims auto-expire, so crashed agents don't permanently lock files
Each write includes a session_id header for traceability

This works in:

Cursor Cloud Agents: .ai/ is in the git clone, MCP server starts per-agent
Claude Code worktrees: .ai/ is copied to the worktree; run /mem-compound before exit to persist
Claude Code sandbox: MCP runs outside sandbox boundary — all memory tools work

Immutability model

File	Default	Control
`IDENTITY.md`	Immutable	Set `writable: true` in frontmatter to allow AI writes
`PROJECT_STATUS.md`	Writable	Set `writable: false` in frontmatter to lock
`toolbox/`, `acp/`, `rules/`	Always immutable	Structural — no override
Everything else	Writable	Via `commit_memory` tool

MCP server

The MCP server starts automatically via .mcp.json. Tools exposed:

Tool	What it does
`search_memory`	Hybrid search across `.ai/` (keyword + semantic + RRF). Params: `limit`, `include_deprecated`
`get_memory`	Summary of a specific topic
`commit_memory`	Write to `.ai/` with immutability + claim-based locking
`get_open_items`	Return open-items.md
`prune_memory`	Identify stale entries
`get_repo_root`	Git repo root path (for path resolution when agent runs from subdir)
`validate_context`	Check git diff against [P0] rules; returns Stability Certificate or violation report
`validate_schema`	Validate memory entry frontmatter
`generate_harness`	Compile `harness.json` from [P0] entries
`get_evals`	Return latest eval report
`claim_task`	Claim a task to prevent duplicate work across agents
`publish_result`	Publish task result (success/failure) to archive
`sync_memory`	Git commit `.ai/` changes (essential for ephemeral environments)
`get_doc_path`	Resolve canonical path for a doc type (use before creating docs)
`validate_doc_placement`	Validate doc path against `.ai/docs-schema.json`
`list_doc_types`	List doc types with path and pattern

HTTP transport (for cloud agents)

AI_MEMORY_AUTH_TOKEN=secret ai-memory mcp --http --port 3100

Env var	Purpose
`AI_MEMORY_AUTH_TOKEN`	Bearer token for HTTP auth (optional, no auth when unset)
`AI_MEMORY_CORS_ORIGINS`	Allowed origins, comma-separated (default: `*`)

Context7 MCP (bundled)

.mcp.json includes the Context7 MCP server for up-to-date docs (Cursor, Claude Code, Windsurf, etc.). Works without an API key; set CONTEXT7_API_KEY for higher rate limits.

Evals

ai-memory eval

Reports: rule coverage, session cadence, frontmatter coverage, open items, deprecated ratio, memory depth, session count, memory freshness, hook coverage, skill discoverability, cloud readiness, automation readiness, integration coverage.

Add custom metrics:

ai-memory eval add my-metric
# Edit .ai/temp/custom-evals/my-metric.ts

ACP (Full tier)

.ai/acp/manifest.json declares this agent's capabilities to ACP-aware orchestrators like acpx. The MCP server is the transport layer; ACP is the identity layer.

Design

Environment adaptation — Spec-driven detection and injection. Implemented via detectEnvironments() in src/cli/environment.ts and install --capability.

acpx — Headless CLI for Agent Communication Protocol
Agent Communication Protocol
compound-engineering-plugin — Plugin this system is modeled after
lossless-claw — Lossless context compaction (inspiration for session archive design)
autoresearch-at-home — Multi-agent iterative research
Ralph Wiggum — Iterative agent loops via status file on disk

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.agents/skills		.agents/skills
.claude-plugin		.claude-plugin
.cursor-plugin		.cursor-plugin
.github/workflows		.github/workflows
plugins		plugins
src		src
templates		templates
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ai-memory

The problem

What this does

Install

CLI only

Quick start

Verify

Troubleshooting

What gets created

Skills

How to invoke skills

Project-specific compound

CLI

Configuration

Governance (Full tier)

Harness features

Multi-agent & iterative loops

RALPH loops (iterative self-improvement)

Concurrent agents (cloud agents, worktrees, background tasks)

Immutability model

MCP server

HTTP transport (for cloud agents)

Context7 MCP (bundled)

Evals

ACP (Full tier)

Design

Related

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ai-memory

The problem

What this does

Install

CLI only

Quick start

Verify

Troubleshooting

What gets created

Skills

How to invoke skills

Project-specific compound

CLI

Configuration

Governance (Full tier)

Harness features

Multi-agent & iterative loops

RALPH loops (iterative self-improvement)

Concurrent agents (cloud agents, worktrees, background tasks)

Immutability model

MCP server

HTTP transport (for cloud agents)

Context7 MCP (bundled)

Evals

ACP (Full tier)

Design

Related

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages