From c72c821a9bf86db2d49bc58373218beaaa16a694 Mon Sep 17 00:00:00 2001
From: Federico Kamelhar <federico.kamelhar@oracle.com>
Date: Thu, 30 Apr 2026 23:05:33 -0400
Subject: [PATCH] =?UTF-8?q?docs:=20rewrite=20README=20=E2=80=94=20content-?=
 =?UTF-8?q?first,=20drop=20GIFs,=20link=20to=20website?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

The previous README leaned on three large GIFs (`hero.gif`,
`oracle_26ai/demo.gif`, `build-an-agent.gif`) and stale exact-count
test badges (``2,987 unit / 330+ integration``). The website at
oracle-samples.github.io/locus is now the authoritative content
source for screenshots / diagrams / GIFs; this README's job is the
30-second orientation that points the reader there.

Changes
-------
- Drop all three demo GIFs.
- Keep the two informative SVG diagrams (``agent-loop.svg``,
  ``architecture.svg``).
- Trim badges to status-style only — Python version, license, mypy
  strict, ruff clean, OCI day-0. No more brittle test counts.
- Replace the "Architecture / One run end-to-end / Quick start /
  Capabilities" sequence with a tighter flow:
    1. One-line pitch + bash install.
    2. Hello-agent code block (kept verbatim — it's the load-bearing
       teaser).
    3. ``What you get`` table — every row links to the matching
       concept page on the website.
    4. ``The agent loop`` (one SVG + 4-bullet narration + link).
    5. ``Architecture`` (one SVG, 1-line caption).
    6. ``Multi-agent`` table (six in-process patterns + A2A).
    7. ``Quick start`` (12-line scheduling agent).
    8. ``Tutorials`` table grouped by track + 3 demo links.
    9. ``Repo layout`` text tree.
   10. ``Contributing`` with the new ``hatch run check`` gate.
   11. ``Citing GSAR`` BibTeX block.
   12. ``License`` one-liner.
- README dropped from 502 → 292 lines.
- Tutorial counter says ``39 progressive tutorials`` (was 38; matches
  current state after !100 / PR #19).
- ``Six coordination patterns plus A2A`` framing throughout (matches
  the alignment landed in PR #7).

Drive-by
--------
- New ``Citing GSAR`` section with BibTeX block pointing at
  ``arXiv:2604.23366`` so production users / academics have a clean
  citation handle.
- Documentation row links go to website concept pages, not local
  files — readers landing on GitHub now find their way to the
  curated website content rather than ad-hoc markdown.

Signed-off-by: Federico Kamelhar <federico.kamelhar@oracle.com>
---
 README.md | 600 ++++++++++++++++++------------------------------------
 1 file changed, 195 insertions(+), 405 deletions(-)
diff --git a/README.md b/README.md
index 5c70259..b630cbf 100644
--- a/README.md
+++ b/README.md
@@ -3,197 +3,157 @@
 </p>
 
 <p align="center">
-  <img src="https://img.shields.io/badge/Python-3.11+-blue.svg" alt="Python 3.11+">
-  <img src="https://img.shields.io/badge/License-UPL--1.0-green.svg" alt="License">
-  <img src="https://img.shields.io/badge/tests-2987%20unit%20%2F%20330%2B%20integration-brightgreen.svg" alt="Tests">
-  <img src="https://img.shields.io/badge/mypy-strict-brightgreen.svg" alt="mypy">
-  <img src="https://img.shields.io/badge/ruff-clean-brightgreen.svg" alt="ruff">
-  <img src="https://img.shields.io/badge/OCI%20GenAI-day%200-orange.svg" alt="OCI GenAI day-0">
-</p>
-
-<p align="center">
-  <strong>Build agents that finish.</strong>
+  <strong>Build AI workflows that actually ship.</strong><br>
+  Oracle Generative AI · Multi-Agent · Reasoning · Orchestrator SDK.
 </p>
 
 <p align="center">
-  Retrieve · reason · remember · recover.<br>
-  Idempotent tools so they don't double-charge. Reflexion so they don't loop on a wrong premise.<br>
-  Durable memory so they survive restarts. Eval so you can prove they shipped.
-</p>
-
-<p align="center">
-  Built on OCI GenAI · Oracle 26ai · OCI Object Storage. Day-0 model support.<br>
-  <strong>2,987</strong> unit tests + <strong>330+</strong> live integration tests on every commit.
+  <img src="https://img.shields.io/badge/Python-3.11%E2%80%933.14-blue.svg" alt="Python 3.11–3.14">
+  <img src="https://img.shields.io/badge/License-UPL--1.0-green.svg" alt="License">
+  <img src="https://img.shields.io/badge/mypy-strict-brightgreen.svg" alt="mypy strict">
+  <img src="https://img.shields.io/badge/ruff-clean-brightgreen.svg" alt="ruff clean">
+  <img src="https://img.shields.io/badge/OCI%20GenAI-day%200-orange.svg" alt="OCI GenAI day-0">
 </p>
 
 <p align="center">
-  <a href="#architecture">Architecture</a> ·
-  <a href="#one-run-end-to-end">A run, end to end</a> ·
-  <a href="#quick-start">Quick start</a> ·
-  <a href="#capabilities">Capabilities</a> ·
-  <a href="examples/">39 tutorials</a> ·
+  <a href="https://oracle-samples.github.io/locus/">Documentation</a> ·
+  <a href="https://oracle-samples.github.io/locus/concepts/agent-loop/">Architecture</a> ·
+  <a href="https://oracle-samples.github.io/locus/concepts/multi-agent/">Multi-agent</a> ·
+  <a href="https://oracle-samples.github.io/locus/concepts/gsar/">GSAR</a> ·
+  <a href="examples/">Tutorials</a> ·
   <a href="CONTRIBUTING.md">Contributing</a>
 </p>
 
-<p align="center">
-  <img src="docs/img/hero.gif" alt="Locus + Oracle 26ai end-to-end: skill, RAG, Reflexion, idempotent write, durable checkpoint — one agent.run()." width="100%">
-</p>
-
 ---
 
-## Architecture
-
-<p align="center">
-  <img src="docs/img/architecture.svg" alt="Locus architecture: every layer of the agent stack — reasoning, multi-agent, tools, hooks, streaming, models, RAG, memory, eval — native, on one runtime." width="100%">
-</p>
-
-Ten layers, one runtime. The diagram is the source of truth for what locus ships.
+Spin up a **swarm** of specialists. Hand a conversation off across an
+**escalation desk**. Run an **orchestrator** of experts in parallel.
+Wire up a **state graph** that loops until confident. Mesh agents
+**across processes** with A2A. Or just ship one self-correcting agent
+that knows when to stop.
 
-## One run, end to end
-
-A real `agent.run_sync()` against Oracle 26ai. The agent loads a **skill**
-from disk, **retrieves** from a native VECTOR index, **reasons** in a
-Reflexion loop, calls an **idempotent** write tool, and **checkpoints** to
-OCI Object Storage so the conversation resumes from `thread_id` on the next
-process. Same agent, five services, no glue.
-
-<p align="center">
-  <img src="docs/img/sequence-26ai.svg" alt="Sequence diagram: locus loads a researcher skill, retrieves from Oracle 26ai, runs Reflexion, fires an idempotent email tool, and checkpoints to OCI Object Storage — fifteen messages across six surfaces in a single agent run." width="100%">
-</p>
-
-The diagram is the design. The runnable program is at
-[`examples/demos/oracle_26ai/`](examples/demos/oracle_26ai/) — `demo.py`
-plus a one-shot `setup_corpus.py` that ingests five sample documents into
-Oracle 26ai. Run against the live free-tier ADB it produces:
-
-![Locus + Oracle 26ai end-to-end run.](examples/demos/oracle_26ai/demo.gif)
-
-```text
- AGENT REPLY
- Sent a 2-sentence HNSW summary citing the top three corpus hits
- ("hnsw", "embeddings", "ivf") to me@org.com.
-
- TRACE
-   1. skills(skill_name='researcher')
-   2. search_corpus(topic='HNSW')        ← Oracle 26ai VECTOR similarity
-   3. email_report(to='me@org.com')      ← @tool(idempotent=True)
-
- iterations: 4   tools: 3   email body sends: 1   checkpoint persisted: ✅
-```
-
-## What you get
-
-| | |
-|---|---|
-| **🧠 Reasoning** | `reflexion=True` (self-evaluate) and `grounding=True` (LLM-as-judge claim verification) on `Agent(...)`. `CausalChain` is a separate graph builder for explicit cause-effect chains. **GSAR** typed-grounding layer (`locus.reasoning.gsar`) for the regulated / safety-critical case — four-way claim partition, evidence-typed score, three-tier `{proceed, regenerate, replan}` decision (`arXiv:2604.23366`). |
-| **🤝 Multi-agent** | Pipelines · Orchestrator + Specialists · Swarm · Handoff · StateGraph · Functional — six in-process patterns sharing one event type, plus **A2A** for cross-process meshes. |
-| **🛡 Idempotent tools** | `@tool(idempotent=True)` — the ReAct loop dedupes repeat calls. The model can't double-charge, double-book, or double-page. |
-| **💾 Durable memory** | Four native checkpointers (OCI Object Storage, in-memory, file, HTTP) plus five storage backends (PostgreSQL, OpenSearch, Redis, SQLite, Oracle 26ai) auto-wrapped via `StorageBackendAdapter` or the `*_checkpointer()` factories. |
-| **🔎 RAG on your data** | Seven vector stores, OCI Cohere + OpenAI embeddings, multimodal (PDF text + OCR, image OCR, audio transcription). Oracle 26ai is the day-1 native target. |
-| **🧩 Skills + Playbooks** | AgentSkills.io filesystem-first skills + declarative YAML/Python playbooks with a `PlaybookEnforcer`. |
-| **📡 Streaming + Server** | Typed events for `match`-statement consumers · SSE · drop-in FastAPI `AgentServer` with `thread_id` persistence (scoped to the bearer principal so two API keys can't read each other's threads). |
-| **🪝 Hooks** | `LoggingHook` / `StructuredLoggingHook` · `TelemetryHook` (OpenTelemetry) · `ModelRetryHook` · `GuardrailsHook` + `ContentFilterHook` · `SteeringHook` (LLM-as-judge tool approval). |
-| **🪙 MCP both ways** | `MCPClient` consumes external Anthropic-spec MCP servers. `LocusMCPServer` exposes locus tools as MCP. Round-trip. |
-| **🌐 Multi-modal providers** | `Agent(web_search=..., web_fetch=..., image_generator=..., speech_provider=...)` auto-registers a matching `@tool`. Built-in `HTTPXWebFetcher` + OpenAI search-preview / DALL-E / TTS+Whisper; bring your own via the four one-method Protocols. |
-| **📊 Evaluation** | `EvalCase` / `EvalRunner` / `EvalReport` — regression suites, custom evaluators, pass / score / duration reporting. |
-| **🛂 Termination algebra** | Eight composable stop conditions on `Agent(termination=...)`: `MaxIterations \| TextMention("DONE") & ConfidenceMet(0.9)` is real Python (`__or__` / `__and__` overloads). |
-| **🧰 Models** | OCI GenAI native (V1 + SDK transport, 90+ models, day-0) · OpenAI · Anthropic · Ollama. One auth surface for OCI: profile, session token, instance / resource principal. |
-
-## Quick start
+Six multi-agent shapes plus A2A. One Oracle-native runtime. Every
+model on OCI the day it lands.
 
 ```bash
 pip install "locus[oci]"
-export OCI_PROFILE=DEFAULT   # any profile in ~/.oci/config
 ```
 
-A scheduling agent in 12 lines. The model uses the built-in date tool to resolve
-"next Friday", then calls a write tool that's `@tool(idempotent=True)` — so even if
-the LLM retries mid-iteration, only one meeting ships:
+## Hello, agent
 
 ```python
-from locus import Agent, tool
-from locus.tools.builtins import get_today_date
+from locus import Agent
+from locus.tools.decorator import tool
+from locus.memory.backends import OCIBucketBackend
+from locus.core.termination import MaxIterations, ToolCalled, ConfidenceMet
+
+@tool
+def search_flights(origin: str, destination: str, date: str) -> list[dict]:
+    """Search the GDS for available flights."""
+    return gds.search(origin, destination, date)
 
 @tool(idempotent=True)
-def book_meeting(date: str, attendees: list[str]) -> dict:
-    """Book a meeting. Idempotent — re-fires return the cached event."""
-    return calendar.book(date, attendees)        # your real calendar call
+def book_flight(flight_id: str, customer_id: str) -> dict:
+    """Book a flight. Re-fires return the cached receipt — never double-charge."""
+    return billing.charge_and_book(flight_id, customer_id)
 
 agent = Agent(
-    model="oci:openai.gpt-5.5",                  # any OCI GenAI model ID
-    tools=[get_today_date, book_meeting],
-    system_prompt="You are a scheduling assistant.",
+    model="oci:openai.gpt-5.5",
+    tools=[search_flights, book_flight],
+    system_prompt="You are a travel concierge. Find a flight, then book it.",
+    reflexion=True,                                      # self-correct mid-run
+    checkpointer=OCIBucketBackend(                       # survive every restart
+        bucket="locus-threads",
+        namespace="<your-namespace>",
+    ),
+    termination=(
+        ToolCalled("book_flight") & ConfidenceMet(0.9)
+    ) | MaxIterations(8),
 )
 
-print(agent.run_sync(
-    "Book a 30-min sync next Friday with alice@ and bob@."
-).message)
-# → "Booked a 30-min sync for next Friday, 2026-05-01, with alice@ and bob@.
-#    Event ID: evt-001."
+result = agent.run_sync(
+    "Book a flight from JFK to NRT on 2026-05-04 for customer C-42.",
+    thread_id="th-c42-jfk-nrt",                          # resumable conversation
+)
+print(result.message)
+# → Booked AA-181 (JFK→NRT, 2026-05-04). Confirmation BK-58291.
 ```
 
-Three iterations, two tool calls. No Project OCID, no `Saver` adapter, no
+That's the whole interface: `model=`, `tools=`, plus the knobs you
+need. No graph editor. No YAML DAG. No `Saver` adapter. No
 `dict[str, Any]` state.
 
-![Build an agent in your editor, then run it.](examples/demos/build-an-agent.gif)
-
-> The GIF runs [`examples/demos/agent_quickstart.py`](examples/demos/agent_quickstart.py)
-> — a different three-tool program against `oci:openai.gpt-5.5` showing the trace.
+## What you get
 
----
+The full surface in one table. Each row links to its concept page in
+the [documentation](https://oracle-samples.github.io/locus/).
 
-## Capabilities
+| | |
+|---|---|
+| **[🧠 Reasoning](https://oracle-samples.github.io/locus/concepts/reasoning/)** | `reflexion=True` (self-evaluate), `grounding=True` (LLM-as-judge claim verification), `CausalChain` for explicit cause-effect graphs. **[GSAR](https://oracle-samples.github.io/locus/concepts/gsar/)** typed-grounding layer for safety-critical pipelines — four-way claim partition + three-tier `{proceed, regenerate, replan}` decision ([`arXiv:2604.23366`](https://arxiv.org/abs/2604.23366)). |
+| **[🤝 Multi-agent](https://oracle-samples.github.io/locus/concepts/multi-agent/)** | Composition · Orchestrator + Specialists · Swarm · Handoff · StateGraph · Functional. Six in-process patterns sharing one event type, plus **A2A** for cross-process meshes. |
+| **[🛡 Idempotent tools](https://oracle-samples.github.io/locus/concepts/idempotency/)** | `@tool(idempotent=True)` — the ReAct loop dedupes repeat calls. The model can't double-charge, double-book, or double-page. |
+| **[💾 Durable memory](https://oracle-samples.github.io/locus/concepts/checkpointers/)** | Four native checkpointers (OCI Object Storage, in-memory, file, HTTP) plus five storage backends (PostgreSQL, OpenSearch, Redis, SQLite, Oracle 26ai). One `BaseCheckpointer` Protocol — no adapter layer. |
+| **[🔎 RAG on your data](https://oracle-samples.github.io/locus/concepts/rag/)** | Seven vector stores, OCI Cohere + OpenAI embeddings, multimodal (PDF text + OCR, image OCR, audio transcription). Oracle 26ai is the day-1 native target. |
+| **[🧩 Skills + Playbooks](https://oracle-samples.github.io/locus/concepts/skills/)** | AgentSkills.io filesystem-first skills + declarative YAML/Python playbooks with a `PlaybookEnforcer` that validates tool calls against step constraints. |
+| **[📡 Streaming + Server](https://oracle-samples.github.io/locus/concepts/server/)** | Typed events for `match`-statement consumers · SSE · drop-in FastAPI `AgentServer` with `thread_id` persistence (scoped to the bearer principal so two API keys can't read each other's threads). |
+| **[🪝 Hooks](https://oracle-samples.github.io/locus/concepts/hooks/)** | `LoggingHook` / `StructuredLoggingHook` · `TelemetryHook` (OpenTelemetry) · `ModelRetryHook` · `GuardrailsHook` + `ContentFilterHook` · `SteeringHook` (LLM-as-judge tool approval). |
+| **[🪙 MCP](https://oracle-samples.github.io/locus/concepts/mcp/)** | `MCPClient` consumes external Anthropic-spec MCP servers. `LocusMCPServer` exposes locus tools as MCP. Round-trip. |
+| **[🌐 Multi-modal providers](https://oracle-samples.github.io/locus/concepts/multi-modal-providers/)** | `Agent(web_search=…, web_fetch=…, image_generator=…, speech_provider=…)` auto-registers a matching `@tool`. Built-in `HTTPXWebFetcher` + OpenAI search-preview / DALL-E / TTS+Whisper; bring your own via the four one-method Protocols. |
+| **[📊 Evaluation](https://oracle-samples.github.io/locus/concepts/evaluation/)** | `EvalCase` / `EvalRunner` / `EvalReport` — regression suites, custom evaluators, pass / score / duration reporting. |
+| **[🛂 Termination algebra](https://oracle-samples.github.io/locus/concepts/termination/)** | Eight composable stop conditions on `Agent(termination=…)`: `MaxIterations \| TextMention("DONE") & ConfidenceMet(0.9)` is real Python (`__or__` / `__and__` overloads). |
+| **[🧰 Models](https://oracle-samples.github.io/locus/concepts/models/)** | OCI GenAI native (V1 + SDK transport, 90+ models, day-0) · OpenAI · Anthropic · Ollama. One auth surface for OCI: profile, session token, instance / resource principal. |
+
+## The agent loop
+
+Every locus agent runs the same four-node loop —
+**Think → Execute → Reflect → Terminate** — with one router deciding
+transitions and one immutable state value flowing through.
 
-### Memory & checkpointing — 4 native + 5 storage-backed
+<p align="center">
+  <img src="docs/img/agent-loop.svg" alt="Locus agent loop: Think → Execute → Reflect → Terminate, with idempotent dedupe at Execute, Reflexion / Grounding / Causal at Reflect, and composable termination algebra at Terminate." width="100%">
+</p>
 
-The checkpointer is a first-class `Agent` argument. Four backends are
-direct `BaseCheckpointer` subclasses — pass them straight to `Agent`.
-The other five expose a simpler dict-shaped storage interface and ship
-with `*_checkpointer()` factories that wrap them with
-`StorageBackendAdapter`:
+- **Think** streams the model's reasoning + the next action.
+- **Execute** runs tool calls in parallel; tools tagged
+  `@tool(idempotent=True)` are deduped against the run's history so
+  retries return the cached receipt instead of re-firing the body.
+- **Reflect** runs Reflexion / Grounding / Causal on cadence, on tool
+  error, or when loop-detection trips — the router routes its
+  judgement back into the next Think.
+- **Terminate?** Typed stop conditions composable with `|` and `&`.
+  Inspect, unit-test, audit; termination is just data.
 
-| Backend | When you use it | How to construct |
-|---|---|---|
-| **OCI Object Storage** *(native)* | Cloud-native; lifecycle policies handle retention | `OCIBucketBackend(bucket_name=..., namespace=...)` |
-| **In-memory** *(native)* | Unit tests | `MemoryCheckpointer()` |
-| **File** *(native)* | Local dev, deterministic tests | `FileCheckpointer(directory="./checkpoints")` |
-| **HTTP** *(native)* | Delegate to a custom checkpoint service | `HTTPCheckpointer(base_url=...)` |
-| **Oracle 26ai** *(storage)* | Your durable store *is* your DB; JSON columns, vacuum, full-text | `oracle_checkpointer(...)` |
-| **PostgreSQL** *(storage)* | Already running PG (often alongside `pgvector` for RAG) | `postgresql_checkpointer(dsn=...)` |
-| **OpenSearch** *(storage)* | Search-stack-native; metadata queries by index | `opensearch_checkpointer(...)` |
-| **Redis** *(storage)* | Hot conversations, low latency, TTL semantics | `redis_checkpointer(url=...)` |
-| **SQLite** *(storage)* | Single-process, embedded | `sqlite_checkpointer(path=...)` |
+Every node emits a typed, **write-protected** event. The same stream
+powers SSE in `AgentServer`, the OpenTelemetry telemetry hook, the
+structured logging hook, and your `async for event in agent.run(…)`
+consumer.
 
-```python
-from locus.memory.backends.oci_bucket import OCIBucketBackend
+[Read the full architecture →](https://oracle-samples.github.io/locus/concepts/agent-loop/)
 
-agent = Agent(
-    model="oci:openai.gpt-5.5",
-    checkpointer=OCIBucketBackend(bucket_name="my-app", namespace="ns"),
-)
+## Architecture
 
-# Different process, different worker — same conversation:
-await agent.run("Continue where we left off.", thread_id="user-42")
-```
+<p align="center">
+  <img src="docs/img/architecture.svg" alt="Locus architecture: ten layers — reasoning, multi-agent, tools, hooks, streaming, models, RAG, memory, eval — native, on one runtime." width="100%">
+</p>
 
-Source: [`src/locus/memory/`](src/locus/memory/) ·
-concept doc: [`docs/concepts/checkpointers.md`](docs/concepts/checkpointers.md).
+Ten layers, one runtime. The diagram is the source of truth for what
+locus ships.
 
-### Multi-agent — six in-process patterns plus A2A
+## Multi-agent — six in-process patterns plus A2A
 
-Locus does not pick a single multi-agent metaphor. Different problems want
-different shapes — locus ships six in-process patterns and **A2A** for
-cross-process meshes, all sharing the same `Agent` and event types:
+Different problems want different shapes. All six in-process patterns
+plus A2A share the same `Agent` class and the same event taxonomy.
 
-| Pattern | What it's for | Where it lives |
+| Pattern | When | Source |
 |---|---|---|
-| **Pipeline** (Sequential / Parallel) | Linear chains; fan-out + merge | [`src/locus/agent/composition.py`](src/locus/agent/composition.py) |
-| **Orchestrator + Specialist** | Router decides which expert handles each sub-task | [`src/locus/multiagent/orchestrator.py`](src/locus/multiagent/orchestrator.py) |
-| **Swarm** | Peer-to-peer task queue with `SharedContext` | [`src/locus/multiagent/swarm.py`](src/locus/multiagent/swarm.py) |
-| **Handoff** | Explicit role transfers carrying conversation history | [`src/locus/multiagent/handoff.py`](src/locus/multiagent/handoff.py) |
-| **StateGraph** | DAG with cycles, conditional edges, subgraphs | [`src/locus/multiagent/graph.py`](src/locus/multiagent/graph.py) |
-| **Functional** | `Send` / `SendBatch` for map/reduce | [`src/locus/multiagent/functional.py`](src/locus/multiagent/functional.py) |
-| **A2A protocol** | Cross-runtime messaging via `AgentCard` | [`src/locus/a2a/`](src/locus/a2a/) |
+| **Composition** (Sequential / Parallel / Loop) | linear chains; fan-out + merge; revise-until-confidence | [`agent/composition.py`](src/locus/agent/composition.py) |
+| **Orchestrator + Specialists** | one router decides which expert handles each sub-task | [`multiagent/orchestrator.py`](src/locus/multiagent/orchestrator.py) |
+| **Swarm** | open-ended research; peer-to-peer; shared context | [`multiagent/swarm.py`](src/locus/multiagent/swarm.py) |
+| **Handoff** | escalation desks; conversation moves with full history | [`multiagent/handoff.py`](src/locus/multiagent/handoff.py) |
+| **StateGraph** | explicit DAG with cycles, conditional edges, subgraphs | [`multiagent/graph.py`](src/locus/multiagent/graph.py) |
+| **Functional** | map / reduce over agents; asyncio-native composition | [`multiagent/functional.py`](src/locus/multiagent/functional.py) |
+| **A2A** | cross-process / cross-runtime; capability discovery | [`a2a/protocol.py`](src/locus/a2a/protocol.py) |
 
 ```python
 from locus import Agent
@@ -201,302 +161,132 @@ from locus.agent import SequentialPipeline
 
 researcher = Agent(model=model, system_prompt="Find three key facts.")
 critic     = Agent(model=model, system_prompt="Find flaws in the previous output.")
-writer     = Agent(model=model, system_prompt="Compose a one-paragraph brief.")
+finalizer  = Agent(model=model, system_prompt="Synthesize a one-paragraph answer.")
 
-result = await SequentialPipeline(agents=[researcher, critic, writer]).run(
-    "Vector databases."
-)
+result = await SequentialPipeline(researcher, critic, finalizer).run("…")
 ```
 
-### RAG — 8 vector stores, multimodal corpus
-
-```python
-from locus.rag import RAGRetriever, OCIEmbeddings, OracleVectorStore
+[All multi-agent patterns →](https://oracle-samples.github.io/locus/concepts/multi-agent/)
 
-retriever = RAGRetriever(
-    embedder=OCIEmbeddings(model_id="cohere.embed-english-v3.0"),
-    store=OracleVectorStore(dsn="mydb_high", user="ADMIN", password=..., dimension=1024),
-)
-await retriever.add_file("manual.pdf")     # PDF text + image OCR + audio transcription
-results = await retriever.retrieve("How do I rotate API keys?", limit=5)
+## Quick start
 
-agent = Agent(model=..., tools=[retriever.as_tool()])
+```bash
+pip install "locus[oci]"
+export OCI_PROFILE=DEFAULT   # any profile in ~/.oci/config
 ```
 
-| Surface | Implementations |
-|---|---|
-| **Vector stores** | Oracle 26ai (native `VECTOR`) · OpenSearch · Qdrant · Pinecone · pgvector · Chroma · in-memory |
-| **Embeddings** | Cohere on OCI GenAI · OpenAI |
-| **Multimodal** | PDF text extraction + OCR · image OCR + caption · audio transcription |
-| **Retrieval** | Cosine / dot / Euclidean · top-k · metadata filtering · spotlight injection-safe |
+A scheduling agent in 12 lines. The model uses the built-in date tool
+to resolve "next Friday", then calls a write tool that's
+`@tool(idempotent=True)` — so even if the LLM retries mid-iteration,
+only one meeting ships:
 
-Source: [`src/locus/rag/`](src/locus/rag/).
+```python
+from locus import Agent, tool
+from locus.tools.builtins import get_today_date
 
-### Reasoning — agents that self-correct
+@tool(idempotent=True)
+def book_meeting(date: str, attendees: list[str]) -> dict:
+    """Book a meeting. Idempotent — re-fires return the cached event."""
+    return calendar.book(date, attendees)
 
-```python
 agent = Agent(
     model="oci:openai.gpt-5.5",
-    tools=[search, summarize, validate_claim],
-    reflexion=True,        # self-evaluate per turn
+    tools=[get_today_date, book_meeting],
+    system_prompt="You are a scheduling assistant.",
 )
-```
-
-Three reasoning modules:
-
-- **Reflexion** ([Shinn et al., 2023](https://arxiv.org/abs/2303.11366)) — the
-  agent evaluates its own last step *before* stacking another tool call on top
-  of a wrong premise. First-class on `Agent`: pass `reflexion=True` (or a
-  `ReflexionConfig`). Configure confidence thresholds, diminishing-returns
-  detection, per-iteration cadence.
-- **Grounding** — LLM-as-judge claim verification: every factual statement
-  the agent emits is checked against retrieved context. First-class on
-  `Agent`: pass `grounding=True` (or a `GroundingConfig`).
-- **Causal** — explicit cause-effect chains so you can audit *why* the agent
-  did what it did. Available as a standalone graph builder (`CausalChain`);
-  call its API from a tool or hook to attach nodes/edges as the agent
-  observes facts. Not currently wired through an `Agent` kwarg.
-
-Source: [`src/locus/reasoning/`](src/locus/reasoning/).
-
-### Tools — idempotent, MCP both ways, executor-aware
-
-```python
-@tool(idempotent=True)
-def transfer(from_acct: str, to_acct: str, amount: float) -> dict: ...
-```
 
-- **`@tool`** auto-derives a JSON schema from your typed Python function
-  signature — the model sees a contract, not a docstring.
-- **`@tool(idempotent=True)`** dedupes repeat calls with identical arguments inside
-  a single run — eliminates the model-double-fires-a-write-tool class of bug.
-- **MCP** works in both directions:
-  - `MCPClient` consumes external MCP servers — hook any MCP-published tool into
-    your agent.
-  - `LocusMCPServer` exposes your locus tools as an MCP server so other agents can
-    consume yours.
-- **Executors** — `SequentialExecutor`, `ConcurrentExecutor`, `CircuitBreakerExecutor`
-  for parallel / fault-tolerant tool execution.
-
-### Hooks — observability, guardrails, steering
-
-Built-in hook providers, plus your own. Hooks fire on
-`before / after × invocation × tool × model` and `iteration_start / iteration_end`:
-
-- **`LoggingHook`** / **`StructuredLoggingHook`** — agent + tool traces.
-- **`TelemetryHook`** / **`NoOpTelemetryHook`** — counters, latencies,
-  OpenTelemetry-compatible.
-- **`ModelRetryHook`** — retry on transient model failures.
-- **`GuardrailsHook`** + **`ContentFilterHook`** — PII / SQL / XSS /
-  command-injection regex policies.
-- **`SteeringHook`** — LLM-as-judge tool approval. The agent's about to call
-  `send_email`? A second model gets to vote.
-
-Source: [`src/locus/hooks/`](src/locus/hooks/).
-
-### Streaming + server
-
-```python
-from locus.core.events import ThinkEvent, ToolStartEvent, TerminateEvent
-
-async for event in agent.run("Plan a trip to Paris."):
-    match event:
-        case ThinkEvent(reasoning=r):         print(f"💭 {r}")
-        case ToolStartEvent(tool_name=n):     print(f"🔧 {n}")
-        case TerminateEvent(final_message=m): print(f"✅ {m}")
-```
-
-Typed events stream as the agent runs. For HTTP streaming over SSE,
-locus ships a reference [`AgentServer`](src/locus/server/) (FastAPI) —
-drop in your agent factory and you get `/invoke`, `/stream`, plus
-`GET /threads/{id}` and `DELETE /threads/{id}` for thread management.
-The client picks the `thread_id` in the request body; the server
-prefixes it with the bearer principal hash before persisting, so two
-API keys sharing one server can't read each other's threads.
-
-### Skills + Playbooks
-
-- **Skills** ([AgentSkills.io](https://agentskills.io) spec) — filesystem-first
-  capability disclosure. Drop a `SKILL.md` plus supporting files in a directory,
-  point your agent at it, the model picks up a new capability progressively.
-- **Playbooks** — declarative step-by-step execution. Loader supports YAML,
-  JSON, or Python. For workflows where you want a deterministic agent path with
-  a `PlaybookEnforcer` validating each step.
-
-Source: [`src/locus/skills/`](src/locus/skills/) ·
-[`src/locus/playbooks/`](src/locus/playbooks/).
-
-### Evaluation harness
-
-```python
-from locus.evaluation import EvalCase, EvalRunner
-
-cases = [
-    EvalCase(
-        name="basic-arithmetic",
-        prompt="What is 2+2?",
-        expected_output_contains=["4"],
-    ),
-    # ...
-]
-runner = EvalRunner(agent=agent)
-report = await runner.run(cases)
-print(report.passed, report.total_cases, report.avg_score, report.total_duration_ms)
-```
-
-Run regression suites against your agent. Match on
-`expected_output_contains` / `expected_output_not_contains` / `expected_tools`
-or pass a custom evaluator. Source:
-[`src/locus/evaluation/`](src/locus/evaluation/).
-
-### Termination algebra
-
-```python
-from locus.core.termination import MaxIterations, ToolCalled, ConfidenceMet
-
-stop = MaxIterations(10) | (ToolCalled("send_report") & ConfidenceMet(0.9))
-agent = Agent(..., termination=stop)
+print(agent.run_sync(
+    "Book a 30-min sync next Friday with alice@ and bob@."
+).message)
+# → "Booked a 30-min sync for next Friday, 2026-05-01, with alice@ and bob@.
+#    Event ID: evt-001."
 ```
 
-Eight composable stop conditions — `MaxIterations`, `TokenLimit`,
-`TimeLimit`, `TextMention`, `ToolCalled`, `ConfidenceMet`, `NoToolCalls`,
-`CustomCondition` — plus `__or__` (`|`) and `__and__` (`&`) operator
-overloads on every condition. Source:
-[`src/locus/core/termination.py`](src/locus/core/termination.py).
-
-### Models
-
-| Provider | Transports | Notes |
-|---|---|---|
-| **OCI GenAI** | V1 (`/openai/v1`, real SSE) + SDK | 90+ models, day-0 support, no Project OCID required |
-| **OpenAI** | `chat/completions` | Native, including reasoning families (gpt-4o, gpt-4.1, gpt-5*, o-series) |
-| **Anthropic** | Messages API | Claude 4.x, prompt caching aware |
-| **Ollama** | Local HTTP | For air-gapped / single-laptop dev |
-
-OCI auth surface: config profile (laptops/CI), session token, instance principal
-(OCI VMs / OKE), resource principal (OCI Functions). Same surface for V1 and SDK
-transports.
-
----
-
-## Installation extras
-
-```bash
-# Core (no model providers, no storage)
-pip install locus
+[Full quickstart →](https://oracle-samples.github.io/locus/how-to/quickstart/)
 
-# Model providers
-pip install "locus[openai]"
-pip install "locus[anthropic]"
-pip install "locus[ollama]"
-pip install "locus[oci]"
+## Tutorials
 
-# Storage backends
-pip install "locus[sqlite]"
-pip install "locus[redis]"
-pip install "locus[postgresql]"
-pip install "locus[opensearch]"
+[`examples/`](examples/) has 39 progressive tutorials, each a single
+runnable file. The full set runs end-to-end in CI on every commit;
+each tutorial is a working program against a real model.
 
-# Bundles
-pip install "locus[models]"        # all LLM providers
-pip install "locus[checkpoints]"   # all storage backends
-pip install "locus[all]"           # everything
-```
-
-## More examples
+| Track | Highlights |
+|---|---|
+| **Foundations** | [`01_basic_agent`](examples/tutorial_01_basic_agent.py) · [`05_agent_hooks`](examples/tutorial_05_agent_hooks.py) · [`07_state_management`](examples/tutorial_07_state_management.py) |
+| **Tools & MCP** | [`12_mcp_integration`](examples/tutorial_12_mcp_integration.py) · [`38_multimodal_providers`](examples/tutorial_38_multimodal_providers.py) |
+| **Reasoning** | [`14_reasoning_patterns`](examples/tutorial_14_reasoning_patterns.py) · [`39_gsar_typed_grounding`](examples/tutorial_39_gsar_typed_grounding.py) |
+| **Multi-agent** | [`11_swarm_multiagent`](examples/tutorial_11_swarm_multiagent.py) · [`16_agent_handoff`](examples/tutorial_16_agent_handoff.py) · [`17_orchestrator_pattern`](examples/tutorial_17_orchestrator_pattern.py) · [`34_a2a_protocol`](examples/tutorial_34_a2a_protocol.py) |
+| **RAG** | [`22_rag_basics`](examples/tutorial_22_rag_basics.py) · [`24_rag_agents`](examples/tutorial_24_rag_agents.py) |
+| **Production** | [`19_guardrails_security`](examples/tutorial_19_guardrails_security.py) · [`20_checkpoint_backends`](examples/tutorial_20_checkpoint_backends.py) · [`28_agent_server`](examples/tutorial_28_agent_server.py) · [`37_termination`](examples/tutorial_37_termination.py) |
 
-[`examples/`](examples/) has 39 progressive tutorials, each a single runnable
-file. Highlights:
+End-to-end demos:
 
-- [`tutorial_01_basic_agent.py`](examples/tutorial_01_basic_agent.py) — start here
-- [`tutorial_05_agent_hooks.py`](examples/tutorial_05_agent_hooks.py) — hook system
-- [`tutorial_11_swarm_multiagent.py`](examples/tutorial_11_swarm_multiagent.py) — swarm
-- [`tutorial_14_reasoning_patterns.py`](examples/tutorial_14_reasoning_patterns.py) — reflexion / grounding / causal
-- [`tutorial_16_agent_handoff.py`](examples/tutorial_16_agent_handoff.py) — multi-agent handoff
-- [`tutorial_17_orchestrator_pattern.py`](examples/tutorial_17_orchestrator_pattern.py) — orchestrator + specialists
-- [`tutorial_22_rag_basics.py`](examples/tutorial_22_rag_basics.py) — RAG over a vector store
-- [`tutorial_27_hooks_advanced.py`](examples/tutorial_27_hooks_advanced.py) — guardrails + steering
-- [`tutorial_34_a2a_protocol.py`](examples/tutorial_34_a2a_protocol.py) — Agent-to-Agent protocol
-- [`tutorial_38_multimodal_providers.py`](examples/tutorial_38_multimodal_providers.py) — web search, fetch, image, speech providers
-- [`tutorial_39_gsar_typed_grounding.py`](examples/tutorial_39_gsar_typed_grounding.py) — GSAR typed-grounding layer (`arXiv:2604.23366`)
+- [`examples/demos/po_approval/`](examples/demos/po_approval) — three agents (Procurement / Compliance / Approval Officer) debate a vendor PO against a live Oracle 26ai catalogue. Idempotent writes. Human consent gate.
+- [`examples/demos/oracle_26ai/`](examples/demos/oracle_26ai) — full Oracle stack: OCI GenAI + Oracle 26ai vectors + skills + Reflexion + idempotent submit + checkpoints to OCI Object Storage.
+- [`examples/demos/trip_team/`](examples/demos/trip_team) — same multi-agent shape on a Tokyo travel corpus.
 
 ## Repo layout
 
-```
+```text
 src/locus/
 ├── agent/          Agent runtime, config, composition pipelines
 ├── core/           AgentState, Message, events, termination algebra
 ├── loop/           ReAct nodes (Think, Execute, Reflect)
 ├── memory/         BaseCheckpointer + 9 backends
 ├── models/         Provider registry + OCI native, OpenAI, Anthropic, Ollama
-├── tools/          @tool decorator, registry, builtins, executors, schema
-├── hooks/          Hook events, registry, 5 built-ins
-├── streaming/      AsyncIterator events, SSE, console handler
-├── reasoning/      Reflexion, grounding, causal analysis
-├── rag/            8 vector stores, embeddings, multimodal retrieval
-├── multiagent/     Swarm, orchestrator, handoff, graph, functional pipelines
-├── skills/         AgentSkills.io progressive disclosure
-├── playbooks/      Declarative step-by-step execution
-├── evaluation/     EvalCase, EvalRunner, EvalReport
-├── integrations/   MCP (fastmcp) — both directions
-├── server/         FastAPI HTTP wrapper (reference app)
-└── a2a/            Agent-to-Agent protocol
+├── multiagent/     Composition, Orchestrator+Specialist, Swarm, Handoff, Graph, Functional
+├── a2a/            Cross-process Agent-to-Agent protocol
+├── reasoning/      Reflexion, Grounding, Causal, GSAR (typed grounding)
+├── rag/            Embeddings + 7 vector stores + retrievers
+├── providers/      Multi-modal: web search, web fetch, image, speech
+├── tools/          @tool decorator, registry, builtins, executors
+├── hooks/          Logging, telemetry, retry, guardrails, steering
+├── skills/         AgentSkills.io filesystem-first capability disclosure
+├── playbooks/      Declarative step plans + PlaybookEnforcer
+├── server/         FastAPI AgentServer with thread persistence
+├── eval/           EvalCase + EvalRunner + EvalReport
+└── integrations/   MCP (client + server)
+
+tests/
+├── unit/           Deterministic, no external deps. Runs in CI on every PR.
+└── integration/    Live LLM / OCI / Oracle 26ai. Gated on credentials.
 ```
 
-## Testing
+## Contributing
+
+See [CONTRIBUTING.md](CONTRIBUTING.md). Quick start:
 
 ```bash
-hatch run test          # 2987 unit tests, no services required (~6 s)
-hatch run typecheck     # mypy strict
-hatch run lint          # ruff + format check
-hatch run all           # everything
+git clone https://github.com/oracle-samples/locus.git
+cd locus
+pip install -e ".[dev,all]"
+hatch run check        # ruff format-check + ruff lint + mypy
+hatch run test         # unit tests across the supported Python matrix
+pre-commit install     # auto-run gitleaks, EOL, ruff, mypy on commit
 ```
 
-Integration tests live in [`tests/integration/`](tests/integration/) and skip
-cleanly when their service isn't available — see
-[`tests/integration/conftest.py`](tests/integration/conftest.py) for the env-var
-matrix and [`TESTING_LOCAL.md`](TESTING_LOCAL.md) for the full local setup
-(Docker, Oracle 26ai wallet, OCI bucket, OpenSearch, Redis, PG, Qdrant).
+Every PR runs through:
 
-## Trusted in production
+- **format-check + ruff + mypy** (Python 3.11 + 3.14)
+- **unit tests** (Python 3.11 / 3.12 / 3.13 / 3.14 matrix)
+- **pre-commit** (gitleaks, EOL, whitespace, doc8, markdownlint, YAML format, codespell, ruff, ruff-format)
+- **DCO sign-off** (`git commit -s`)
 
-Locus powers internal agentic workloads at Oracle. Every commit runs the full
-test matrix against real OCI GenAI, Oracle 26ai, OCI Object Storage, OpenSearch,
-Redis, and PostgreSQL — not mocks.
+## Citing GSAR
 
-If you're already on OCI, locus is the SDK that was *built on* the same
-primitives you're already paying for.
+If you use the GSAR layer (typed grounding) in research or production
+write-ups, please cite the paper:
 
-## Contributing
-
-See [`CONTRIBUTING.md`](CONTRIBUTING.md) — it's the long version. Short version:
-
-1. Sign the [Oracle Contributor Agreement](https://oca.opensource.oracle.com).
-2. Branch from `main`. Use [Conventional Commits](https://conventionalcommits.org).
-3. `hatch run all` must pass.
-4. Open a merge request.
-
-We treat new model providers, new checkpointer / RAG backends, hooks, evaluators,
-docs, and tests as first-class contributions.
-
-## Security
-
-See [`SECURITY.md`](SECURITY.md) for vulnerability reporting.
-
-Built-in: error-message sanitization (strips credentials, paths, OCIDs),
-tool-argument validation against declared schemas, SQL identifier validation
-in DB backends, write-protected hook events, and optional LLM-powered steering
-for real-time tool approval.
+```bibtex
+@article{kamelhar2026gsar,
+  title  = {GSAR: Typed Grounding for Hallucination Detection and Recovery in Multi-Agent LLMs},
+  author = {Kamelhar, Federico A.},
+  journal = {arXiv preprint arXiv:2604.23366},
+  year   = {2026},
+}
+```
 
 ## License
 
-Copyright (c) 2025, 2026 Oracle and/or its affiliates. Released under the
-[Universal Permissive License v1.0](LICENSE).
-
-## Links
-
-- [How-to: OCI GenAI models](docs/how-to/oci-models.md)
-- [Oracle 26ai vector search](https://docs.oracle.com/en/database/oracle/oracle-database/23/vecse/)
-- [OCI GenAI documentation](https://docs.oracle.com/en-us/iaas/Content/generative-ai/home.htm)
-- [AgentSkills.io specification](https://agentskills.io)
-- [Oracle Contributor Agreement](https://oca.opensource.oracle.com)
+[Universal Permissive License v1.0](LICENSE.txt). Built inside Oracle.
+Used in production. Open to everyone.