Forge

Describe your idea. AI builds it.

Forge is an open-source CLI that turns a markdown spec into a working codebase using a pipeline of specialized LLM agents. No prompt engineering, no copy-pasting — just forge build.

forge new my-app -t web-app
# Edit .forge/spec.md with your idea
forge build

The Problem

Going from idea to working code is slow. You spend hours on boilerplate, wiring up APIs, configuring CI/CD, and fixing the mismatches between frontend and backend that inevitably happen when one person (or one LLM prompt) tries to hold the full picture.

Copy-pasting ChatGPT output into files is tedious and error-prone. Existing code generators produce scaffolds, not working applications. And single-agent LLM tools hit context limits, hallucinate API contracts, and can't self-correct.

What Forge Does Differently

Multiple specialist agents that actually collaborate. A planner picks the right stack. A project manager writes focused instructions for each agent. Backend, frontend, CI/CD, and deploy agents run in parallel. A security agent audits the output. A reviewer catches bugs and sends fixes back to the agent that wrote the code — not a generic fixer.

Structured contracts, not prose. The backend agent outputs machine-readable API contracts (endpoints, request/response schemas, auth requirements). The frontend agent receives these contracts and generates code that matches exactly. No more POST /api/users vs POST /users mismatches.

Real impact:

A markdown spec becomes a working project in one command — API routes, DB models, React frontend, CI/CD pipeline, deployment config, security audit, and code review
Frontend/backend API contracts are validated automatically before the reviewer runs
Independent agents (backend, CI, deploy) run in parallel, cutting build time
Incremental builds (--feature "add dark mode") know what endpoints already exist
Every file write goes through a security firewall — no eval(), no path traversal, no hardcoded secrets

System Design at a Glance

                     .forge/spec.md
                           |
                     +-----------+
                     |  Planner  |  Picks stack: Rails / FastAPI / Go / Phoenix / ...
                     +-----+-----+
                           |
                   +-------+--------+
                   | Project Manager|  Assigns tasks to specialist agents
                   +-------+--------+
                           |
          +----------------+----------------+
          |                |                |         (parallel)
    +-----------+   +-----------+   +-----------+
    |  Backend  |   |    CI     |   |  Deploy   |
    | API routes|   | Actions,  |   | Railway / |
    | DB models |   | Dockerfile|   | Vercel    |
    +-----------+   +-----------+   +-----------+
          |
          | API contracts (structured JSON)
          v
    +-----------+
    | Frontend  |   Reads contracts, generates matching UI
    +-----------+
          |
    +-----------+
    | Security  |   OWASP audit on all code from ArtifactBus
    +-----------+
          |
    +-----------+
    | Reviewer  |   Validates, then routes fixes to original agent
    +-----------+

Key infrastructure:

Layer	What it does
ArtifactBus	Thread-safe shared store. Agents publish typed artifacts (code, decisions, reviews). Consumers query by path, agent, or type. Source of truth during the build.
ContractRegistry	Structured API/model/event contracts extracted from backend output. Frontend gets exact endpoint shapes. Security gets auth coverage checks. Persisted to `.forge/contracts.json` for incremental builds.
Parallel Scheduler	Computes dependency graph from agent assignments. Runs independent tasks concurrently (ThreadPoolExecutor). Same-agent tasks stay sequential.
A2A Protocol	Google's Agent-to-Agent spec. In-process by default (no HTTP). Run `forge agents start` to expose each as a standalone HTTP server.
Agentic Firewall	Every file write checked against path allowlist, blocklist, and content patterns. Audit log at `.forge/firewall_audit.log`.
Providers	Anthropic (default), OpenAI, Together, Groq, Ollama. Swap with `--provider` flag.

Quick Start

1. Install

pip install forge-ai                # classic mode (minimal deps)
pip install "forge-ai[adk]"        # multi-agent parallel mode (recommended)

Or from source:

git clone https://github.com/nebullii/forge
cd forge
pip install -e ".[adk]"

2. Set your API key

export ANTHROPIC_API_KEY=sk-ant-...
# OR
export OPENAI_API_KEY=sk-...
# OR
export TOGETHER_API_KEY=...
# OR run Ollama locally (no key needed): ollama serve

3. Create a project

forge new my-app -t web-app       # from template
# OR
forge new my-app                  # interactive picker

4. Write your spec

Edit .forge/spec.md — describe what you want in plain English:

cd my-app
nano .forge/spec.md          # or: vim, code, open — any text editor

Example spec:

# Project: Task Tracker

## What
A web app for managing personal tasks with user accounts.

## Features
- Email + password login (JWT)
- Create, complete, and delete tasks
- Dashboard showing pending vs completed counts

## Stack
React + Vite, FastAPI, SQLite

## Non-goals
No team features. No notifications.

5. Build

forge build                       # classic mode (sequential, works with any provider)
forge build --adk                 # multi-agent mode (parallel, 7 specialists)
forge build -p openai             # use a specific provider
forge build -v                    # verbose output (see agent tool calls)

6. Run locally

forge dev                         # auto-detects project type, starts dev server

7. Add features incrementally

forge build --feature "add dark mode toggle"
forge build --feature "add JWT refresh tokens"

Contracts from the previous build are loaded automatically so agents know what endpoints already exist.

System Design

High-Level Architecture

┌─────────────────────────────────────────────────────────────────┐
│  User                                                           │
│    forge build                                                  │
└────────────────────────┬────────────────────────────────────────┘
                         │
┌────────────────────────▼────────────────────────────────────────┐
│  CLI  (src/cli.py)                                              │
│    Reads .forge/spec.md + .forge/rules.md                       │
│    Scans for suspicious patterns                                │
│    Delegates to BuildOrchestrator                               │
└────────────────────────┬────────────────────────────────────────┘
                         │
┌────────────────────────▼────────────────────────────────────────┐
│  BuildOrchestrator  (src/orchestrator.py)                       │
│    Manages build phases, task state, and file writes            │
│    Enforces AgenticFirewall on every write                      │
│    Persists state to .forge/build-state.yaml after each task    │
│                                                                 │
│    classic mode              ADK mode (--adk)                   │
│    ────────────              ─────────────────                  │
│    PlannerAgent              ForgeADKOrchestrator               │
│    ProjectManagerAgent       (Google ADK LlmAgent)              │
│    CoderAgent × N tasks      7 specialized agents               │
│    ReviewerAgent             via A2A protocol                   │
└────────────────────────┬────────────────────────────────────────┘
                         │
        ┌────────────────┼────────────────┐
        ▼                ▼                ▼
  Provider Layer    Agent Layer      State Layer
  (src/providers/)  (src/agents/)    (src/state.py)
  Anthropic         BaseAgent        BuildState
  OpenAI-compat     8 agents         TaskState
  Ollama            A2A hooks        YAML persistence

Classic Mode Pipeline

Dependency-aware parallel pipeline. Tasks assigned to independent agents run concurrently; same-agent tasks run sequentially. Falls back to fully sequential when all tasks target the same agent.

.forge/spec.md
      │
      ▼
┌─────────────────┐
│  PlannerAgent   │  Analyzes spec → chooses stack → ordered task list
└────────┬────────┘
         │  {decisions, tasks[]}
         ▼
┌─────────────────────┐
│ ProjectManagerAgent │  Assigns each task to the right specialized agent
│                     │  Generates a self-contained prompt per task
└────────┬────────────┘
         │  tasks[].agent + tasks[].prompt
         ▼
┌──────────────────────────────────────────────────┐
│  Parallel Scheduler (src/scheduler.py)           │
│                                                  │
│  Builds dependency graph from agent assignments: │
│    coder → {backend, ci, deploy} → frontend → …  │
│                                                  │
│  Independent tasks run in ThreadPoolExecutor:    │
│    agent = route(task.agent)                     │
│    response = agent.invoke(task.prompt)          │
│    files → firewall → disk + ArtifactBus        │
│    backend response → ContractRegistry           │
│    frontend gets contracts + backend code         │
└────────┬─────────────────────────────────────────┘
         │
         ▼
┌────────────────────────────────────────┐
│  ReviewerAgent                        │
│    Reads all code from ArtifactBus    │
│    Contract validation (auto)         │
│    Error → rework by original agent   │
│    (one retry per file, then done)    │
└────────────────────────────────────────┘

ADK Mode Pipeline

Seven specialized agents coordinated by a Google ADK LlmAgent orchestrator. Independent agents run in parallel via a thread pool.

.forge/spec.md
      │
      ▼
┌────────────────────────────────────────────────────────────────┐
│  ForgeADKOrchestrator  (Google ADK LlmAgent)                   │
│                                                                │
│  The orchestrator is an LLM that decides which tools to call.  │
│  Each tool is an A2A call to a specialized agent.              │
└──────────┬─────────────────────────────────────────────────────┘
           │
           │  Step 1 [sequential]
           ▼
    ┌──────────────┐
    │ PlannerAgent │  spec → task plan + tech stack decisions
    └──────┬───────┘
           │
           │  Step 2 [sequential]
           ▼
    ┌─────────────────────┐
    │ ProjectManagerAgent │  plan → per-task agent assignments + prompts
    └──────┬──────────────┘
           │
           │  Step 3 [parallel — ThreadPoolExecutor]
           ├──────────────────────┬───────────────────┐
           ▼                      ▼                   ▼
    ┌─────────────┐      ┌──────────────┐    ┌──────────────┐
    │ BackendAgent│      │   CIAgent    │    │  DeployAgent │
    │ API, DB,    │      │ CI workflows,│    │ Railway /    │
    │ services    │      │ Dockerfile   │    │ Render /     │
    └──────┬──────┘      └──────────────┘    │ Vercel       │
           │                                 └──────────────┘
           │  Step 4 [sequential — needs backend API contracts]
           ▼
    ┌───────────────┐
    │ FrontendAgent │  React/Svelte/Vue UI matching backend endpoints
    └──────┬────────┘
           │
           │  Step 5 [sequential — audits all code]
           ▼
    ┌───────────────┐
    │ SecurityAgent │  OWASP audit, secret detection, patch generation
    └──────┬────────┘
           │
           │  Step 6 [sequential — reviews everything]
           ▼
    ┌───────────────┐
    │ ReviewerAgent │  Cross-file correctness, contract validation
    └──────┬────────┘
           │
           │  Step 7 [conditional — rework errors]
           ▼
    rework_file(agent, path, issue, ...)
           │  Sends each error back to its original agent
           │  One retry per file, then done
           └────────────────────────────────────────────

How agents share data (ArtifactBus + ContractRegistry):

Backend publishes CodeArtifact + structured ApiEndpointContract / DataModelContract
Frontend receives the full contracts and relevant backend code (not just filenames)
Security and Reviewer read all code content from the bus
After backend, contract validation runs automatically: "does the frontend call endpoints that exist?" — mismatches are flagged to the reviewer

Why this ordering is parallel-safe: Backend, CI, and Deploy all depend only on the planner's decisions. They don't read each other's output, so they can safely run concurrently. Frontend needs the backend's API contracts. Security audits the application code. Reviewer sees everything last. All shared state goes through the thread-safe ArtifactBus (RLock-protected).

A2A Protocol

In ADK mode, the orchestrator talks to each agent using Google's Agent-to-Agent (A2A) open protocol.

Orchestrator
    │
    │  A2AClient.send_task(Task)
    ▼
Agent
    │
    │  handle_a2a_task(task) → TaskResult
    │
    ├── Task.message.parts     — prompt text
    ├── Task.context           — decisions, spec, backend files, etc.
    │
    └── TaskResult.artifacts
          ├── type="text"      — raw LLM response
          └── type="files"     — generated (path, content) pairs

By default, all agents run in-process — no network overhead, no server management. Run forge agents start to expose each agent as a real HTTP server on its own port (useful for distributed builds or debugging individual agents).

Collaboration Layer

Agents don't just run in sequence — they share structured data through a collaboration layer that ensures frontend/backend contract alignment, thread-safe parallel execution, and targeted feedback when the reviewer finds issues.

┌──────────────────────────────────────────────────────────────────────────┐
│                        ArtifactBus (thread-safe)                        │
│                                                                         │
│  CodeArtifact ──────── DecisionArtifact ──────── ReviewArtifact         │
│  (path, content,       (key, value,              (passed, issues,       │
│   producer, version)    producer, reasoning)       target_path)          │
│                                                                         │
│  ContractArtifact ──── BuildLogArtifact ──────── ReworkRequestArtifact  │
│  (type, data,          (message, level,           (target_path,          │
│   producer)             producer)                  original_agent, issue) │
└────────────────────────────────┬─────────────────────────────────────────┘
                                 │ publish() / query()
     ┌───────────────────────────┼───────────────────────────┐
     │                           │                           │
     ▼                           ▼                           ▼
 Planner                    Backend                    Frontend
 publishes                  publishes                  reads contracts +
 DecisionArtifacts          CodeArtifacts +            backend code from bus,
                            contracts to registry      generates matching UI

ArtifactBus (src/collaboration/artifact_bus.py) — Thread-safe store for all generated content. Backed by threading.RLock. Agents publish artifacts; consumers query by path, task, agent, or type. The bus is the source of truth — not files on disk or tuple lists.

ContractRegistry (src/collaboration/contracts.py) — Structured API/model/event contracts extracted from backend output. The frontend agent gets exact endpoint shapes (method, path, request/response schema, auth). The security agent gets pre-validated auth coverage. The reviewer gets automated frontend-vs-backend mismatch detection.

Contract extraction works two ways:

Explicit — the backend agent outputs a contracts JSON block (preferred)
Fallback — regex extraction from FastAPI/Flask/Express/Rails route decorators

Contracts persist to .forge/contracts.json after each build, so incremental builds (forge build --feature "...") know what endpoints already exist.

Feedback Loop

When the reviewer or security agent finds issues, fixes route back to the original specialist — not a generic coder:

ReviewerAgent
    │  "app/routes/users.py has SQL injection"
    │
    ▼  lookup: who wrote this file?
ArtifactBus.latest("app/routes/users.py")
    │  → producer_agent = "backend"
    │
    ▼  route fix to backend specialist
BackendAgent.invoke("Fix this SQL injection in ...")
    │
    ▼  publish fixed version (version + 1)
ArtifactBus.publish(CodeArtifact(version=2, ...))

Capped at one retry per file to prevent infinite loops. Tracked via ReworkRequestArtifact for observability.

Parallel Task Scheduling

Classic mode now runs independent tasks concurrently when the plan spans multiple agent types. The scheduler computes a dependency graph from agent assignments:

Task dependencies (computed automatically):
─────────────────────────────────────────
  coder tasks    → no dependencies (run first)
  backend tasks  → wait for coder
  ci tasks       → wait for coder         ┐
  deploy tasks   → wait for coder         ├── run in parallel
  backend tasks  → (also parallel w/ ci)  ┘
  frontend tasks → wait for backend
  security tasks → wait for backend + frontend

Same-agent tasks run sequentially (in plan order). Cross-agent tasks respect the dependency rules above. Falls back to sequential when all tasks use the same agent.

Layer Map

src/
  cli.py                    — Entry point. All forge commands.
  orchestrator.py           — Build pipeline: both modes, parallel scheduler.
  scheduler.py              — Dependency-aware parallel task scheduler.
  config.py                 — Provider config (~/.forge/config.yaml).
  state.py                  — Resumable build state (schema-versioned YAML).
  context.py                — Token-budgeted project context assembly.

  collaboration/
    models.py               — Typed artifact models (CodeArtifact, ReviewArtifact, ...).
    artifact_bus.py         — Thread-safe shared artifact store (RLock-backed).
    contracts.py            — ContractRegistry + extraction + persistence.

  providers/
    base.py                 — BaseProvider ABC + exponential backoff retry.
    anthropic.py            — Anthropic Claude.
    openai_compat.py        — OpenAI / Together / Groq / any OpenAI-compatible.
    ollama.py               — Local Ollama models.

  agents/
    base.py                 — BaseAgent: invoke, extract_files, write_files, A2A hooks.
    planner.py              — Spec → task plan. Framework-agnostic stack selection.
    project_manager.py      — Plan → per-task agent assignments and prompts.
    coder.py                — General-purpose file generation (classic mode fallback).
    reviewer.py             — Code validation, severity classification, auto-fix.
    backend.py              — API routes, DB models, service layer + contract output.
    frontend.py             — React/Svelte/Vue components, contract-aware API integration.
    security_agent.py       — OWASP Top 10 audit, secret detection, patch generation.
    ci_cd.py                — GitHub Actions, Dockerfile, docker-compose.
    deploy.py               — Railway / Render / Vercel / Fly.io configs.

  a2a/
    types.py                — Pydantic models: Task, TaskResult, Artifact, AgentCard.
    client.py               — A2AClient: in-process or HTTP transport.
    server.py               — FastAPI A2A server factory (per agent).

  adk/
    llm_bridge.py           — Wraps BaseProvider as Google ADK BaseLlm.
    agent_runner.py         — ADKAgentRunner: bridges LlmAgent ↔ A2A protocol.
    orchestrator_agent.py   — Root ADK LlmAgent + tool routing.
    tools.py                — Tool functions + BuildContext (bus + registry + state).

  security/
    firewall.py             — AgenticFirewall: policy enforcement + audit log.

Build Modes

Classic Mode

forge build

Three-phase pipeline with dependency-aware parallel execution. Works with any LLM provider including local Ollama models. Independent tasks (e.g., backend + CI + deploy) run concurrently when the plan spans multiple agent types.

When to use: Standard projects, limited API budget, local models, or when you want the simplest setup with no extra dependencies.

ADK Mode

forge build --adk

Seven specialized agents coordinated by a Google ADK orchestrator. Backend, CI, and Deploy run in parallel. Each agent has a focused system prompt for its domain. Requires the adk extras package.

When to use: Complex projects with distinct frontend/backend/infra concerns, when build speed matters, or when you want domain-specialized code generation.

Incremental Mode

forge build --feature "add user authentication with JWT"
forge build --feature "add dark mode toggle to the settings page"

Plans only the tasks needed for the new feature. Reads the existing project files as context so the new code integrates correctly with what's already there.

Agent Reference

PlannerAgent

Analyzes the spec and produces a structured build plan.

Selects the technology stack based on the project type — no hardcoded defaults
Chooses between Rails, Django, FastAPI, Go, Phoenix, HTMX, React, SvelteKit, etc.
Breaks the project into 3-8 focused tasks ordered by dependency
Outputs YAML: decisions (stack, architecture, reasoning) + tasks[]

Stack selection logic:

Project type	Likely stack
Single-page tool / utility	Plain HTML + CSS + JS (one file)
Full-stack web app (CRUD, admin, forms)	Rails, Django, Laravel
Real-time features (chat, live updates)	Phoenix (Elixir)
Complex interactive SPA	React + Vite, SvelteKit
Simple web app, light interactivity	HTMX + FastAPI/Flask
API-only backend	FastAPI, Go (Gin/Chi), Rust (Axum)
CLI tool	Python (Click/Typer), Go (cobra)
Data pipeline or ML	Python + Pandas/SQLAlchemy
Static site or docs	Plain HTML, 11ty, Hugo

Not supported yet: Native mobile apps (iOS/Android). Forge can't generate Xcode projects or Gradle builds. For mobile-like experiences, use a PWA (Progressive Web App).

ProjectManagerAgent

Sits between the Planner and the executor. Takes the raw task list and enriches each task with:

agent — which specialized agent should execute this task
prompt — a self-contained, context-rich prompt for that agent including stack details, API contracts from upstream tasks, and exact file targets
contracts — the API interfaces this task exposes (consumed by dependent tasks)

Agent routing:

Task type	Assigned agent
Project setup, config files, utilities	`coder`
API routes, DB models, service layer	`backend`
UI components, pages, state management	`frontend`
GitHub Actions, Dockerfile	`ci`
Railway / Render / Vercel configs	`deploy`
Security audits	`security`

BackendAgent

Generates the full backend implementation.

REST API routes with proper error handling (correct HTTP status codes)
Database models using the framework's conventions (ActiveRecord, Django ORM, raw SQL)
Service layer — business logic separated from route handlers
Auth: session cookies for full-stack apps, JWT for API-only
CORS middleware when a frontend is present
Environment variable config — never hardcoded secrets

FrontendAgent

Generates the full frontend implementation.

React components in plain JavaScript (no TypeScript unless spec requires it)
Tailwind CSS for styling — no component libraries
React Router for SPA routing
fetch() with useState/useEffect for data — no React Query, Axios, or Redux
Matches backend API contracts exactly (same endpoints, same field names)
Loading states, error states, and empty states in every component

SecurityAgent

Audits all generated code for security issues.

OWASP Top 10 checks (injection, auth bypass, XSS, IDOR, etc.)
Hardcoded secret detection
Insecure direct object reference patterns
Missing input validation
Returns patched files when issues are found (not just a report)

CIAgent

Generates CI/CD infrastructure.

GitHub Actions workflow: install, lint, test on every push and PR
Single-stage Dockerfile for production
.dockerignore to exclude dev files, secrets, and version control
docker-compose.yml only when the stack includes Redis or multiple services
Does not generate a separate deploy workflow — Railway, Render, and Vercel auto-deploy from GitHub on push

DeployAgent

Generates deployment configuration.

Railway (railway.toml, Procfile)
Render (render.yaml)
Vercel (vercel.json)
Fly.io (fly.toml)
Target platform read from .forge/deploy.md

ReviewerAgent

Final validation pass over all generated files. Reads code from the ArtifactBus (not disk) so it sees the exact content every agent produced.

Broken imports and missing dependencies
API contract mismatches between frontend and backend (auto-detected via ContractRegistry)
Incomplete implementations (placeholders, missing error handling)
Severity classification: error (auto-fix attempted) vs warning
Targeted rework: errors route back to the specialist that wrote the file (backend bug → backend agent, not a generic coder). One retry per file.

Security Model

Every file write goes through AgenticFirewall before touching disk. The firewall reads its policy from .forge/firewall_policy.json.

What the firewall enforces

Path blocklist — agents cannot write to sensitive paths regardless of content:

.env, .env.local, .env.production, .ssh/, .aws/, .gnupg/,
.kube/, .git/, .npmrc, .pypirc, config/secrets.json,
/etc/, /var/, /private/

All other paths within the project root are allowed. This lets agents generate any project structure (e.g., MicAmplifier/, backend/, my-app/src/) without hitting false positives from a rigid allowlist.

Content pattern scanning — rejects files containing:

eval(), exec(), os.system(), subprocess.run(), __import__,
getattr(), setattr(), importlib.*

Shell scripts, CI configs, Dockerfiles, and Makefiles are exempt from pattern scanning — these files legitimately use subprocess calls and shell commands.

Audit log — every file write decision (permitted or denied) is logged to .forge/firewall_audit.log with timestamp and reason.

Spec safety

Before any agent sees the spec or rules files, the CLI scans them for patterns that suggest prompt injection or data exfiltration attempts: URLs, curl/wget commands, references to credentials, exfiltrate, pastebin, etc. Matches trigger a warning.

Templates

Template	Stack	Use case
`web-app`	React + FastAPI + SQLite + Tailwind	General-purpose web application
`api-only`	FastAPI + Pydantic + SQLite	REST API with no frontend
`ai-app`	React + FastAPI + OpenAI/Anthropic SDK	LLM-powered applications
`chrome-ext`	Manifest V3 + vanilla JS	Browser extensions
`cli-tool`	Click/Typer + Rich	Command-line tools
`data-viz`	Streamlit or Plotly/Recharts	Dashboards and data exploration
`slack-bot`	Python + slack-bolt	Slack integrations
`discord-bot`	Python + discord.py	Discord bots

forge new my-app -t web-app        # Create from template
forge templates                    # List all templates
forge new my-app                   # Interactive template picker

Each template ships with:

.forge/spec.md — example spec for that project type
.forge/rules.md — opinionated build constraints
.forge/deploy.md — deployment target configuration

CLI Reference

Project setup

forge new <name>                   # Create project (interactive template picker)
forge new <name> -t web-app        # Create with specific template
forge init                         # Add .forge/ to an existing project
forge templates                    # List available templates

Build

forge build                        # Classic mode: Plan → Build → Review
forge build --adk                  # ADK mode: 7 specialized agents in parallel
forge build -p anthropic           # Use a specific provider
forge build -f "add feature X"     # Incremental: add a feature to existing project
forge build --no-review            # Skip the review phase
forge status                       # Show current build progress and task list

Agents (ADK distributed mode)

forge agents start                 # Start all agents as HTTP servers (ports 8101-8108)
forge agents status                # Show running agents and PIDs
forge agents stop                  # Gracefully stop all agents

Agent port assignments:

Agent	Port
PlannerAgent	8101
BackendAgent	8102
FrontendAgent	8103
SecurityAgent	8104
CIAgent	8105
DeployAgent	8106
ReviewerAgent	8107
ProjectManagerAgent	8108

Development and config

forge dev                          # Auto-detect and start local dev server
forge dev --port 3000              # Custom port
forge config init                  # Create ~/.forge/config.yaml
forge config show                  # Show active configuration
forge config path                  # Print config file location
forge publish                      # Push project to GitHub

Configuration

Forge reads provider configuration from ~/.forge/config.yaml. Created automatically on first run, or manually with forge config init.

providers:
  - name: anthropic
    api_key: ${ANTHROPIC_API_KEY}
    model: claude-sonnet-4-20250514

  - name: openai
    api_key: ${OPENAI_API_KEY}
    model: gpt-4o

  - name: together
    api_key: ${TOGETHER_API_KEY}
    model: meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo

  - name: ollama
    base_url: http://localhost:11434
    model: llama3.1

Forge tries providers in order and uses the first one with valid credentials. Override with the --provider flag or -p shorthand.

Firewall policy

Each project has its own .forge/firewall_policy.json. Override defaults by editing it:

{
  "blocked_paths": [
    "^\\.env$",
    "\\.env\\.local",
    "\\.ssh/.*",
    "\\.git/.*"
  ],
  "blocked_patterns": [
    "eval\\(",
    "exec\\(",
    "os\\.system\\(",
    "__import__"
  ],
  "shell_blocked_patterns": [
    "curl\\s+.*\\|\\s*(?:sh|bash)",
    "chmod\\s+777"
  ]
}

Project directory layout

my-project/
  .forge/
    spec.md              — Project description (you write this)
    rules.md             — Build constraints (template defaults, editable)
    deploy.md            — Deployment target configuration
    firewall_policy.json — AgenticFirewall rules (editable)
    build-state.yaml     — Persisted build state (auto-generated, do not edit)
    decisions.md         — Tech stack decisions from PlannerAgent
    contracts.json       — Persisted API/model contracts (for incremental builds)
    review.yaml          — ReviewerAgent output
    firewall_audit.log   — All file write decisions with timestamps
    agent_pids.yaml      — Running agent PIDs when using distributed mode
  <generated project files>

Installation

From PyPI

pip install forge-ai               # Classic mode only (pyyaml only)
pip install "forge-ai[build]"      # Classic mode + all LLM provider SDKs
pip install "forge-ai[adk]"        # ADK multi-agent mode
pip install "forge-ai[build,adk]"  # Everything

From source

git clone https://github.com/nebullii/forge
cd forge
pip install -e ".[build,adk]"

Provider setup

Provider	Environment variable	Notes
Anthropic	`ANTHROPIC_API_KEY`	Recommended. Claude Sonnet is the default model.
OpenAI	`OPENAI_API_KEY`	GPT-4o default.
Together AI	`TOGETHER_API_KEY`	Cost-effective for longer builds.
Groq	`GROQ_API_KEY`	Fast inference.
Ollama	—	Run `ollama serve` locally, no key needed.

Contributing

Forge is MIT licensed. Contributions are welcome.

git clone https://github.com/nebullii/forge
cd forge
pip install -e ".[dev,build,adk]"
pytest

How the codebase is organized

src/providers/ — add a new LLM provider by subclassing BaseProvider
src/agents/ — add a new specialized agent by subclassing BaseAgent
src/collaboration/ — artifact bus, contract registry, typed models
src/scheduler.py — dependency graph and parallel task execution
templates/ — add a new project template with a .forge/ directory
src/adk/tools.py — add a new orchestrator tool for ADK mode

Design principles

Provider agnostic — swap Anthropic, OpenAI, or local Ollama with a flag
Two modes — classic (parallel, dependency-aware) or ADK (LLM-orchestrated, 7 agents)
Agents collaborate — shared artifact bus + structured contracts, not just file lists
A2A compatible — every agent is a standalone A2A service
Resumable — build state persisted after every task, resume on interrupt
Zero-trust writes — Agentic Firewall validates every file before disk
Thread-safe — all shared state behind RLock; safe for parallel execution
Minimal core — classic mode requires only pyyaml; all extras are opt-in

License

MIT. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
docs		docs
src		src
templates		templates
tests		tests
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup.sh		setup.sh

Folders and files

Latest commit

History

Repository files navigation

Forge

The Problem

What Forge Does Differently

System Design at a Glance

Quick Start

1. Install

2. Set your API key

3. Create a project

4. Write your spec

5. Build

6. Run locally

7. Add features incrementally

Contents

System Design

High-Level Architecture

Classic Mode Pipeline

ADK Mode Pipeline

A2A Protocol

Collaboration Layer

Feedback Loop

Parallel Task Scheduling

Layer Map

Build Modes

Classic Mode

ADK Mode

Incremental Mode

Agent Reference

PlannerAgent

ProjectManagerAgent

BackendAgent

FrontendAgent

SecurityAgent

CIAgent

DeployAgent

ReviewerAgent

Security Model

What the firewall enforces

Spec safety

Templates

CLI Reference

Project setup

Build

Agents (ADK distributed mode)

Development and config

Configuration

Firewall policy

Project directory layout

Installation

From PyPI

From source

Provider setup

Contributing

How the codebase is organized

Design principles

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages