VM0 Architecture

Overview
System Architecture
Infrastructure
- Firecracker Sandbox Backend
- Cloudflare R2 Object Storage
References

Overview

VM0 is a platform for running AI agent workflows in isolated sandbox environments. The platform consists of three core subsystems:

Compute: Sandbox execution (Firecracker microVMs)
Storage: User data persistence (Cloudflare R2)
Orchestration: Job queue and runner coordination (PostgreSQL)

High-Level Architecture

Execution Flow:

User CLI/API Request
  ↓
Web API (Next.js)
  ↓
Runner Executor (job queue)
  ↓
Compute Layer (Firecracker microVM)
  ↓ (downloads from)
Storage Layer (R2)
  ↓ (reports via webhooks)
Web API
  ↓
User receives results

System Architecture

Compute

The compute layer executes agent workflows in isolated sandbox environments.

Execution Backend: Firecracker

Self-hosted microVMs on bare metal Linux
Hardware-level isolation via KVM
3-5 second boot time
Network namespace isolation per VM
Jobs queued in runner_job_queue, runners poll and execute

Storage

The storage layer persists user data (volumes, artifacts, session state) in Cloudflare R2.

Storage Types

Volumes: Read-only data mounted at specified paths

Examples: Code repositories, dependencies, reference data
Defined in vm0.yaml

Artifacts: Read-write working directory

Agent output, modified files, generated assets
Versioned after each run
Used for checkpoints and resume

Data Flow

Upload:

CLI → tar.gz archive → presigned PUT URL → R2
Database records: storage_id, version_id, s3_key

Download:

Server → presigned GET URL (1h expiration)
  ↓
Storage manifest JSON → Sandbox
  ↓
Sandbox downloads directly from R2 (no API proxy)
  ↓
Extracts to mount paths

Orchestration

The orchestration layer coordinates job execution between web API and runners.

Job Notification:

Push: Ably realtime notifications for instant job pickup (~100-200ms)
Fallback: Polling every 30s catches missed notifications

Runner Behavior:

Subscribe to Ably channel runner-group:{org}/{name}
Receive job notification: { runId }
Claim job atomically via /api/runners/jobs/{id}/claim (sets claimed_at)
Execute in Firecracker VM
Report completion via webhook
Job deleted from queue

Runner Groups

Format: {org}/{name}

Official: vm0/* (e.g., vm0/production) - VM0-managed runners
User: {org-slug}/* (e.g., my-team/private) - Self-hosted runners

Authentication:

Official runners: HMAC signature using OFFICIAL_RUNNER_SECRET
User runners: JWT bearer token with userId claim

Infrastructure

Firecracker Sandbox Backend

Firecracker is an open-source VMM (Virtual Machine Monitor) developed by AWS that creates lightweight microVMs using Linux KVM.

Infrastructure Requirements

Hardware:

Bare metal Linux server
KVM support: /dev/kvm device
Cannot run on cloud VMs (nested virtualization limitations)

Software:

Firecracker v1.14.1 binary
Linux kernel v6.1.155 (for microVM)
Node.js 24.x, pnpm, pm2
mitmproxy (network observability)
debootstrap (rootfs build only)

Architecture

Runner Application: Rust application in crates/runner/

VM Configuration:

# runner.yaml
firecracker:
  binary: /usr/local/bin/firecracker
  kernel: /opt/firecracker/vmlinux

sandbox:
  vcpu: 2
  memory_mb: 2048
  max_concurrent: 1

Storage Architecture

Shared Read-Only Base:

ext4 rootfs (~500MB-1GB)
Content-addressed: /var/lib/vm0-runner/rootfs/{hash}/rootfs.ext4
Shared across all VMs via nbd-cow
Built via debootstrap + chroot in build-rootfs.sh

Per-VM Copy-on-Write (nbd-cow):

Userspace NBD-based COW backed by sparse file
Device: /dev/nbdN (writable block device)
Reads of unmodified blocks go to base image, writes captured in COW file
Enables instant boot without rootfs copy

Network Architecture

Isolation: Each VM in separate network namespace via pre-warmed namespace pool

Namespace Pool: Pre-allocated network namespaces for fast VM startup

Each namespace gets a unique veth pair
Namespace side: veth0 (e.g., 10.200.0.2)
Host side: vm0-ve-{pool}-{index} (e.g., vm0-ve-00-00)
Pool supports up to 64 pools × 256 namespaces

IP Allocation: 10.200.0.0/16 subnets

Guest fixed IP: 192.168.241.2 (same across VMs, isolated by namespace)
NAT/MASQUERADE: Guest traffic routed through namespace to external network

HTTP Proxy: mitmproxy (dynamically allocated port)

Intercepts all HTTP/HTTPS traffic
Logs requests/responses to per-run JSONL files
CA certificate injected into VM trust store
Proxy registry: {base_dir}/proxy-registry.json (flock-based coordination)

Execution Flow

Runner receives job via Ably push (or 30s polling fallback)
Creates Firecracker VM (3-5s boot)
Vsock connection to guest agent
Upload scripts, configure DNS, install proxy CA
Preflight check (curl to heartbeat endpoint)
Download storages from R2
Start agent CLI in background
Webhook reports progress
VM terminated on completion

Cloudflare R2 Object Storage

Cloudflare R2 is S3-compatible object storage with zero egress fees.

Configuration

Endpoint: https://{R2_ACCOUNT_ID}.r2.cloudflarestorage.com
Bucket: R2_USER_STORAGES_BUCKET_NAME
SDK: @aws-sdk/client-s3 with S3-compatible API
Region: Auto (global)

Storage Format

Archives: tar.gz compressed
S3 keys: Content-addressed by SHA-256 hash
Presigned URLs: 1-hour expiration for GET/PUT

Direct Download

Sandboxes download directly from R2 (no proxy through VM0 API):

VM0 API generates presigned GET URLs
Storage manifest JSON uploaded to sandbox
Sandbox's download.mjs script fetches from R2
Parallel downloads for multiple archives

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VM0 Architecture

Table of Contents

Overview

High-Level Architecture

System Architecture

Compute

Execution Backend: Firecracker

Storage

Storage Types

Data Flow

Orchestration

Runner Groups

Infrastructure

Firecracker Sandbox Backend

Infrastructure Requirements

Architecture

Storage Architecture

Network Architecture

Execution Flow

Cloudflare R2 Object Storage

Configuration

Storage Format

Direct Download

References

External

Community

FilesExpand file tree

architecture.md

Latest commit

History

architecture.md

File metadata and controls

VM0 Architecture

Table of Contents

Overview

High-Level Architecture

System Architecture

Compute

Execution Backend: Firecracker

Storage

Storage Types

Data Flow

Orchestration

Runner Groups

Infrastructure

Firecracker Sandbox Backend

Infrastructure Requirements

Architecture

Storage Architecture

Network Architecture

Execution Flow

Cloudflare R2 Object Storage

Configuration

Storage Format

Direct Download

References

External

Community