nextlevelbuilder
diff --git a/‎advanced/knowledge-graph.md‎
Lines changed: 224 additions & 5 deletions b/‎advanced/knowledge-graph.md‎
Lines changed: 224 additions & 5 deletions
diff --git a/‎advanced/scheduling-cron.md‎
Lines changed: 2 additions & 1 deletion b/‎advanced/scheduling-cron.md‎
Lines changed: 2 additions & 1 deletion
diff --git a/‎agents/context-files.md‎
Lines changed: 3 additions & 1 deletion b/‎agents/context-files.md‎
Lines changed: 3 additions & 1 deletion
diff --git a/‎channels/telegram.md‎
Lines changed: 17 additions & 1 deletion b/‎channels/telegram.md‎
Lines changed: 17 additions & 1 deletion
diff --git a/‎channels/whatsapp.md‎
Lines changed: 3 additions & 1 deletion b/‎channels/whatsapp.md‎
Lines changed: 3 additions & 1 deletion
diff --git a/‎deployment/upgrading.md‎
Lines changed: 3 additions & 2 deletions b/‎deployment/upgrading.md‎
Lines changed: 3 additions & 2 deletions
diff --git a/‎providers/openai.md‎
Lines changed: 7 additions & 1 deletion b/‎providers/openai.md‎
Lines changed: 7 additions & 1 deletion
@@ -28,6 +28,36 @@ Each entity and relation has a **confidence score** (0.0–1.0). Only items at o
 - Descriptions are one sentence maximum
 - Temperature 0.0 for deterministic results
 
+### Extract API
+
+Trigger extraction manually via the REST API:
+
+```bash
+POST /v1/agents/{agentID}/kg/extract
+Content-Type: application/json
+Authorization: Bearer <token>
+
+{
+  "text": "Conversation text to extract from...",
+  "user_id": "user-123",
+  "provider": "anthropic",
+  "model": "claude-sonnet-4-20250514",
+  "min_confidence": 0.75
+}
+```
+
+Response:
+```json
+{
+  "entities": 5,
+  "relations": 3,
+  "dedup_merged": 1,
+  "dedup_flagged": 0
+}
+```
+
+After extraction, inline dedup runs automatically on newly upserted entities — near-certain duplicates are merged immediately, possible duplicates are flagged for review.
+
 ### Relation types
 
 The extractor uses a fixed set of relation types:
@@ -73,15 +103,48 @@ After extraction, GoClaw automatically checks new entities for duplicates using
 
 **Flagged candidates** are stored in `kg_dedup_candidates` with status `pending`. You can list, dismiss, or manually merge them via the API.
 
-### Bulk duplicate scan
+### Dedup Management Workflow
+
+**1. Scan for duplicates** — Run a full scan across all entities:
+
+```bash
+POST /v1/agents/{agentID}/kg/dedup/scan
+Content-Type: application/json
+
+{"threshold": 0.90, "limit": 100}
+```
+
+Useful after bulk imports or initial onboarding. Results are added to the review queue.
+
+**2. Review candidates:**
 
-You can trigger a full scan across all entities:
+```bash
+GET /v1/agents/{agentID}/kg/dedup?user_id=xxx
+```
+
+Returns `DedupCandidate[]` with fields: `entity_a`, `entity_b`, `similarity`, `status`.
+
+**3. Merge:**
 
 ```bash
-POST /v1/agents/{agentID}/kg/scan-duplicates
+POST /v1/agents/{agentID}/kg/merge
+Content-Type: application/json
+
+{"target_id": "john-doe-uuid", "source_id": "j-doe-uuid"}
 ```
 
-This runs a self-join similarity scan and adds candidates to the review queue. Useful after bulk imports or initial onboarding.
+Re-points all relations from `source_id` to `target_id`, then deletes the source entity.
+
+**4. Dismiss:**
+
+```bash
+POST /v1/agents/{agentID}/kg/dedup/dismiss
+Content-Type: application/json
+
+{"candidate_id": "candidate-uuid"}
+```
+
+Marks the pair as not-duplicate — it won't appear in future review queues.
 
 ---
 
@@ -96,6 +159,16 @@ This runs a self-join similarity scan and adds candidates to the review queue. U
 | `entity_id` | string | Start point for relationship traversal |
 | `max_depth` | int | Traversal depth (default 2, max 3) |
 
+### 3-Tier Search Fallback
+
+The tool uses a 3-tier fallback strategy to ensure results are always returned:
+
+1. **Traversal** (when `entity_id` provided) — BFS outgoing traversal up to `max_depth`, returns up to 20 results with path info and relation types
+2. **Direct connections** (fallback if traversal returns nothing) — Bidirectional 1-hop relations, capped at 10
+3. **Text search** (fallback if no connections) — Full-text search on entity names/descriptions, returns up to 10 results with their relations (5 per entity)
+
+When all three tiers return nothing, the tool returns the top 10 existing entities as hints so the model knows what's available in the graph.
+
 ### Search modes
 
 **Text search** — Find entities by name or keyword:
@@ -119,6 +192,75 @@ Results include entity names, types, descriptions, depth, traversal path, and th
 
 ---
 
+## REST API Reference
+
+All endpoints require authentication (`Authorization: Bearer <token>`). Add `?user_id=<id>` to scope results to a specific user.
+
+| Method | Path | Description |
+|--------|------|-------------|
+| `GET` | `/v1/agents/{agentID}/kg/entities` | List or search entities |
+| `GET` | `/v1/agents/{agentID}/kg/entities/{entityID}` | Get entity with its relations |
+| `POST` | `/v1/agents/{agentID}/kg/entities` | Upsert entity |
+| `DELETE` | `/v1/agents/{agentID}/kg/entities/{entityID}` | Delete entity (cascades relations) |
+| `POST` | `/v1/agents/{agentID}/kg/traverse` | Traverse the graph from an entity |
+| `POST` | `/v1/agents/{agentID}/kg/extract` | LLM-powered extraction from text |
+| `GET` | `/v1/agents/{agentID}/kg/stats` | Graph statistics |
+| `GET` | `/v1/agents/{agentID}/kg/graph` | Full graph for visualization |
+| `POST` | `/v1/agents/{agentID}/kg/dedup/scan` | Scan for duplicate candidates |
+| `GET` | `/v1/agents/{agentID}/kg/dedup` | List dedup candidates |
+| `POST` | `/v1/agents/{agentID}/kg/merge` | Merge two entities |
+| `POST` | `/v1/agents/{agentID}/kg/dedup/dismiss` | Dismiss a dedup candidate |
+
+---
+
+## Data Model
+
+### Entity
+
+```json
+{
+  "id": "uuid",
+  "agent_id": "agent-uuid",
+  "user_id": "optional-user-id",
+  "external_id": "john-doe",
+  "name": "John Doe",
+  "entity_type": "person",
+  "description": "Backend engineer on the platform team",
+  "properties": {"team": "platform"},
+  "source_id": "optional-source-ref",
+  "confidence": 0.95,
+  "created_at": 1711900000,
+  "updated_at": 1711900000
+}
+```
+
+| Field | Description |
+|-------|-------------|
+| `external_id` | Human-readable slug (e.g., `john-doe`). Used for upsert dedup. |
+| `properties` | Arbitrary key-value metadata from extraction |
+| `source_id` | Optional reference to the source conversation or document |
+| `confidence` | Extraction confidence (0.0–1.0); surviving entity in merges keeps the higher value |
+
+### Relation
+
+```json
+{
+  "id": "uuid",
+  "agent_id": "agent-uuid",
+  "user_id": "optional-user-id",
+  "source_entity_id": "john-doe-uuid",
+  "relation_type": "works_on",
+  "target_entity_id": "project-alpha-uuid",
+  "confidence": 0.9,
+  "properties": {},
+  "created_at": 1711900000
+}
+```
+
+Relations are directional: `source --relation_type--> target`. Deleting an entity cascades and removes all its relations.
+
+---
+
 ## Entity Types
 
 | Type | Examples |
@@ -133,6 +275,83 @@ Results include entity names, types, descriptions, depth, traversal path, and th
 
 ---
 
+## Graph Statistics & Visualization
+
+### Statistics
+
+```bash
+GET /v1/agents/{agentID}/kg/stats?user_id=xxx
+```
+
+```json
+{
+  "entity_count": 42,
+  "relation_count": 87,
+  "entity_types": {
+    "person": 15,
+    "project": 8,
+    "concept": 12,
+    "task": 7
+  }
+}
+```
+
+### Full Graph for Visualization
+
+```bash
+GET /v1/agents/{agentID}/kg/graph?user_id=xxx&limit=200
+```
+
+Returns all entities and relations suitable for rendering in a graph UI. Default limit is 200 entities; relations are capped at 3× the entity limit.
+
+The web dashboard renders the graph using **ReactFlow** with **D3 Force Simulation** (`d3-force`) for automatic node positioning:
+
+- **Force layout** — `forceSimulation` computes node positions using link distance, charge repulsion (`forceManyBody`), centering (`forceCenter`), and collision avoidance (`forceCollide`). Forces scale by node count (tighter for small graphs, spread for large).
+- **Node sizing by type** — Each entity type has a different mass (organization=8, project=6, person=4, etc.), so hub entities naturally sit at the center.
+- **Degree centrality** — When entities exceed the display limit (50), the graph keeps the most-connected hub nodes. Nodes with ≥4 connections get a glow highlight.
+- **Interactive selection** — Clicking a node highlights its connected edges with labels, dims unrelated edges, and opens the entity detail dialog.
+- **Theme support** — Dual-theme color palette (dark/light) with per-entity-type colors. Theme changes update colors without re-running the layout.
+- **Performance** — Node components are `memo`-ized, layout runs in `setTimeout(0)` to avoid blocking, and edge updates use `useTransition` for responsive interaction.
+
+---
+
+## Shared Knowledge Graph
+
+By default, the knowledge graph is scoped per agent **and** per user — each user builds their own graph. When `share_knowledge_graph` is enabled in the agent's workspace sharing config, the graph becomes agent-level (shared across all users):
+
+```yaml
+workspace_sharing:
+  share_knowledge_graph: true
+```
+
+In shared mode, `user_id` is ignored for all KG operations — entities and relations from all users are stored and queried together. This is useful for team agents where everyone should see the same entity graph.
+
+> **Note:** `share_knowledge_graph` is independent of `share_memory`. You can share memory without sharing the graph, or vice versa.
+
+---
+
+## Automatic Extraction on Memory Write
+
+When an agent writes to its memory files (e.g., `MEMORY.md` or files under `memory/`), GoClaw automatically triggers KG extraction on the written content. This happens via the `MemoryInterceptor`, which calls the configured LLM to extract entities and relations from the new memory text.
+
+This means agents continuously build their knowledge graph as they learn — no manual `/kg/extract` calls needed for normal conversations. The extract API is available for bulk imports or external integrations.
+
+---
+
+## Confidence Pruning
+
+Remove low-confidence entities and relations in bulk using `PruneByConfidence`:
+
+```bash
+# Internal service call — prunes items below threshold
+# Returns count of pruned entities and relations
+PruneByConfidence(agentID, userID, minConfidence)
+```
+
+This is useful after bulk imports where many low-confidence items accumulate. Items with `confidence < minConfidence` are deleted; their relations cascade automatically.
+
+---
+
 ## Example
 
 After several conversations about a project, an agent's knowledge graph might contain:
@@ -159,4 +378,4 @@ An agent can then answer questions like *"Who is working on Project Alpha?"* by
 - [Memory System](/memory-system) — Vector-based long-term memory
 - [Sessions & History](/sessions-and-history) — Conversation storage
 
-<!-- goclaw-source: e7afa832 | updated: 2026-03-30 -->
+<!-- goclaw-source: a47d7f9f | updated: 2026-03-31 -->
@@ -103,6 +103,7 @@ goclaw cron delete <jobId>
 | `schedule.expr` | string | 5-field cron expression (for `cron`) |
 | `schedule.tz` | string | IANA timezone for cron expressions; omit to use the gateway default timezone |
 | `message` | string | Text the agent receives as its input |
+| `stateless` | bool | Run without session history — saves tokens for simple scheduled tasks. Default `false` |
 | `deliver` | bool | `true` = deliver result to a channel; `false` = agent processes silently. Auto-defaults to `true` when the job is created from a real channel (Telegram, etc.) |
 | `channel` | string | Target channel: `telegram`, `discord`, etc. Auto-filled from context when `deliver` is `true` |
 | `to` | string | Chat ID or recipient identifier. Auto-filled from context when `deliver` is `true` |
@@ -317,4 +318,4 @@ When a session's conversation history exceeds **60% of the context window**, the
 - [Skills](/skills) — inject domain knowledge so scheduled agents are more effective
 - [Sandbox](/sandbox) — isolate code execution during scheduled agent runs
 
-<!-- goclaw-source: 941a965 | updated: 2026-03-19 -->
+<!-- goclaw-source: a47d7f9f | updated: 2026-03-31 -->
@@ -125,6 +125,8 @@ _(Domain-specific knowledge goes here: coding standards, image generation techni
 **Open agent:** Per-user (generated on first chat)
 **Predefined agent:** Agent-level (optionally generated via LLM summoning)
 
+> **Auto-sync:** When you rename an agent, the `Name:` field in IDENTITY.md is automatically updated to match. Other fields remain unchanged.
+
 ### TOOLS.md
 
 **Purpose:** Local tool notes. Camera names, SSH hosts, TTS voice preferences, device nicknames.
@@ -372,4 +374,4 @@ FAQ bot creation with summoning:
 - [Summoning & Bootstrap](/summoning-bootstrap) — how SOUL.md and IDENTITY.md are LLM-generated
 - [Creating Agents](/creating-agents) — step-by-step agent creation
 
-<!-- goclaw-source: 57754a5 | updated: 2026-03-23 -->
+<!-- goclaw-source: a47d7f9f | updated: 2026-03-31 -->
@@ -143,6 +143,22 @@ flowchart TD
     BUFFER --> NEXT["Next mention:<br/>history included"]
 ```
 
+### Group Message Annotation
+
+In group chats, each message is prefixed with a `[From:]` annotation so the agent knows who is speaking:
+
+```
+[From: @username (Display Name)]
+Message content here
+```
+
+The label format depends on available user data:
+- Username + display name: `@username (Display Name)`
+- Username only: `@username`
+- Display name only: `Display Name`
+
+This annotation is also added to DM messages for consistent sender identification.
+
 ### Group Concurrency
 
 Group sessions support up to **3 concurrent agent runs**. When this limit is reached, additional messages are queued. This applies to all group and forum topic contexts.
@@ -275,4 +291,4 @@ Each Telegram instance maintains an isolated HTTP transport — no shared connec
 - [Browser Pairing](/channel-browser-pairing) — Pairing flow
 - [Sessions & History](/sessions-and-history) — Conversation history
 
-<!-- goclaw-source: 0dab087f | updated: 2026-03-26 -->
+<!-- goclaw-source: a47d7f9f | updated: 2026-03-31 -->
@@ -79,6 +79,8 @@ Bridge detects group chats via `@g.us` suffix in chat ID:
 
 Policies apply accordingly (DM policy for DMs, group policy for groups).
 
+In group chats, messages include a `[From:]` annotation with the sender's display name, allowing the agent to distinguish between participants.
+
 ### Message Format
 
 Messages are JSON objects:
@@ -142,4 +144,4 @@ isGroup := strings.HasSuffix(chatID, "@g.us")
 - [Larksuite](/channel-feishu) — Larksuite integration
 - [Browser Pairing](/channel-browser-pairing) — Pairing flow
 
-<!-- goclaw-source: 57754a5 | updated: 2026-03-18 -->
+<!-- goclaw-source: a47d7f9f | updated: 2026-03-31 -->
@@ -9,7 +9,7 @@ A GoClaw upgrade has two parts:
 1. **SQL migrations** — schema changes applied by `golang-migrate` (idempotent, versioned)
 2. **Data hooks** — optional Go-based data transformations that run after schema migrations (e.g. backfilling a new column)
 
-The `./goclaw upgrade` command handles both in the correct order. It is safe to run multiple times — it is fully idempotent. The current required schema version is **32**.
+The `./goclaw upgrade` command handles both in the correct order. It is safe to run multiple times — it is fully idempotent. The current required schema version is **33**.
 
 ```mermaid
 graph LR
@@ -225,6 +225,7 @@ These five migrations are auto-applied on startup when upgrading to v2.x. No man
 | 030 | Adds GIN indexes on `spans.metadata` (partial, `span_type = 'llm_call'`) and `sessions.metadata` JSONB columns for query performance |
 | 031 | Adds `tsv tsvector` generated column + GIN index to `kg_entities` for full-text search; creates `kg_dedup_candidates` table for entity deduplication review |
 | 032 | Creates `secure_cli_user_credentials` for per-user CLI credential injection; adds `contact_type` column to `channel_contacts` |
+| 033 | Cron payload columns | Promotes `stateless`, `deliver`, `deliver_channel`, `deliver_to`, `wake_heartbeat` from `payload` JSONB to dedicated columns on `cron_jobs` |
 
 ### Breaking Changes in v2.x
 
@@ -277,4 +278,4 @@ Before each upgrade, check the release notes for:
 - [Database Setup](/deploy-database) — PostgreSQL and pgvector setup
 - [Observability](/deploy-observability) — monitor your gateway post-upgrade
 
-<!-- goclaw-source: e7afa832 | updated: 2026-03-30 -->
+<!-- goclaw-source: a47d7f9f | updated: 2026-03-31 -->
@@ -95,10 +95,16 @@ OpenAI function calling works out of the box. GoClaw converts internal tool defi
 | `HTTP 400` on o-series | Unsupported parameter | Avoid setting `temperature` with o-series models |
 | Vision not working | Model doesn't support images | Use gpt-4o or gpt-4o-mini |
 
+### Developer Role (GPT-4o+)
+
+For native OpenAI endpoints (`api.openai.com`), GoClaw automatically maps the `system` role to `developer` when sending requests. The `developer` role has higher instruction priority than `system` for GPT-4o and newer models.
+
+This mapping only applies to native OpenAI infrastructure. Other OpenAI-compatible backends (Azure OpenAI, proxies, Qwen, DeepSeek, etc.) continue to use the standard `system` role.
+
 ## What's Next
 
 - [OpenRouter](/provider-openrouter) — access 100+ models through one API key
 - [Anthropic](/provider-anthropic) — native Claude integration
 - [Overview](/providers-overview) — provider architecture and retry logic
 
-<!-- goclaw-source: 57754a5 | updated: 2026-03-18 -->
+<!-- goclaw-source: a47d7f9f | updated: 2026-03-31 -->