Sovereign Edge

A privacy-first personal AI system — five expert agents, live data grounding, edge-deployed on a Jetson Orin.

Architecture · Experts · Setup · Configuration · Development · Contributing

Demo

Demo coming soon — see Verifying the Deployment for what to expect.

What is Sovereign Edge?

Sovereign Edge is an always-on personal intelligence system that runs five specialized AI agents on a Jetson Orin Nano (or any Linux ARM64/x86 host). It wires live data sources — arXiv, HuggingFace papers, Jina web search, Bible API — into a LangGraph multi-agent pipeline, then routes generation through a free-tier cloud LLM gateway with automatic failover.

Every message is classified in <10ms by an ONNX-quantized DistilBERT router before a single token is generated. PII is detected and forced to local inference. Cloud API keys are free-tier only. A scheduled morning pipeline delivers actionable briefs before your workday starts.

This is a single-user, single-owner system. Access is gated to one Telegram chat ID. All design decisions optimize for privacy, low cost, and perpetual operation on constrained hardware.

Why

Cloud AI services see everything you ask them. Sovereign Edge keeps your conversations, decisions, and personal data entirely on your own hardware — a Jetson Orin Nano running 5 specialized LangGraph agents. No data leaves your network. Multi-provider LLM routing (4 cloud providers + local Ollama) means you get the best model for each task without vendor lock-in. Built with production infrastructure: SOPS+Age encryption, systemd services, ONNX intent routing under 10ms, structured observability.

Expert Agents

Expert	Domain	Live Data Source	Morning Brief
Spiritual	Scripture study, prayer, devotionals	bible-api.com (KJV)	05:15 — daily devotional
Career	Job search, resume coaching, market intel	Jina web search	06:00 + 18:00 rescan
Intelligence	AI/ML research synthesis, trend monitoring	arXiv, HuggingFace Daily Papers	05:30 — digest
Creative	Writing, content strategy, social media	Jina web search	07:00 — content prompt
Goals	Personal goal tracking, daily check-ins	SQLite goal store	07:30 — top 3 + action

Each expert runs a LangGraph subgraph — a multi-node pipeline with live data retrieval, LLM synthesis, and structured output validation via instructor + Pydantic. If LangGraph is unavailable, each falls back to a direct LLMGateway.complete() call gracefully.

Architecture

graph TD
    A["Telegram / Discord<br/>(owner-only)"] --> B

    subgraph router ["Intent Router · &lt;10ms"]
        B["Embedding similarity<br/>(Ollama)"] --> C["ONNX DistilBERT<br/>(INT8, 6-class)"]
        C --> D["Keyword fallback"]
        D --> PII["PII check → force LOCAL"]
    end

    PII --> E

    subgraph orch ["LangGraph Orchestrator"]
        E["Director<br/>(query planning)"] --> F["Expert Dispatch"]
    end

    F --> S["Spiritual<br/>Bible API"]
    F --> CA["Career<br/>Jina Search"]
    F --> IN["Intelligence<br/>arXiv · HF Papers"]
    F --> CR["Creative<br/>Jina Search"]
    F --> G["Goals<br/>SQLite"]

    S & CA & IN & CR & G --> LLM

    subgraph llm ["LLM Gateway · LiteLLM"]
        LLM["Groq → Gemini → Mistral<br/>→ Ollama (local fallback)"]
    end

    LLM --> MEM

    subgraph mem ["Memory"]
        MEM["Episodic (Mem0)<br/>Semantic (LanceDB)<br/>Procedural (SQLite)"]
    end

    MEM --> OUT["Telegram response"]

    subgraph sched ["APScheduler · 7 cron jobs"]
        SCH["Morning pipeline<br/>05:00 – 18:00 CT"]
    end

    SCH --> orch

Request lifecycle in brief

Message in — auth check (owner chat ID only), rate limit (2s gap), 2000-char cap
Route — ONNX DistilBERT classifies intent in <10ms; PII forces local routing
Plan — Director LLM decides which experts to invoke and in what order
Execute — Each expert runs its LangGraph subgraph (live data fetch → LLM synthesis)
Memory — Result persisted to Mem0 episodic store + SQLite skill library
Respond — Formatted reply streamed back to Telegram

See Architecture for the full request lifecycle, HITL flow, memory tiers, and design decisions.

Morning Pipeline

Delivered automatically each day at the configured wake time (SE_MORNING_WAKE_HOUR, default 05:00 in SE_TIMEZONE):

Time	Brief	Expert
05:00	Health check — all experts validated	System
05:15	Daily devotional with live scripture	Spiritual
05:30	AI/ML digest — arXiv + HuggingFace papers	Intelligence
06:00	Job market scan — live DFW listings	Career
07:00	Daily creative content prompt	Creative
07:30	Top 3 urgent goals + one concrete action	Goals
18:00	Evening career rescan	Career

Tech Stack

View full stack

Layer	Technology	Notes
Runtime	Python 3.11+, asyncio	Fully async I/O throughout
Package management	uv workspace	Monorepo with per-package `pyproject.toml`
Agent orchestration	LangGraph `StateGraph`	Subgraphs per expert + director graph
LLM routing	LiteLLM 1.82.6	Pinned — 1.82.7+ had supply chain issues
Structured output	instructor + Pydantic	All expert responses validated at schema level
Cloud LLMs	Groq, Gemini, Mistral	Free-tier only; automatic failover
Local inference	Ollama (`qwen3:0.6b`)	Fallback when all cloud providers fail
Embeddings	Ollama (`qwen3-embedding:0.6b`)	Tier 1 intent routing + semantic search
Intent classification	ONNX DistilBERT INT8	<10ms on Jetson CPU; keyword fallback
Vector store	LanceDB	Embedded, no server process, ~300 MB
Conversation memory	SQLite (WAL mode)	Skill patterns + conversation history
Episodic memory	Mem0	Semantic recall across sessions (optional)
Observability	structlog + SQLite trace store	JSON in prod, colorized in dev
Secrets	SOPS + Age encryption	Encrypted secrets committed safely to git
Scheduling	APScheduler `AsyncIOScheduler`	7-job cron pipeline, misfire-tolerant
Interface	python-telegram-bot, discord.py	Owner-only whitelist on both
Deployment	systemd on ARM64/x86 Linux	Jetson Orin Nano, Raspberry Pi, VPS

Quick Start

Prerequisites

Python 3.11+
uv package manager
Ollama running locally
At least one free API key: Groq, Gemini, or Mistral
A Telegram bot token from @BotFather

Install

git clone https://github.com/t-timms/sovereign-edge.git
cd sovereign-edge

# Install all workspace packages
uv sync --all-packages

# Pull local models
ollama pull qwen3:0.6b
ollama pull qwen3-embedding:0.6b

Configure

cp .env.example .env

Minimum required variables:

SE_TELEGRAM_BOT_TOKEN=your_bot_token
SE_TELEGRAM_OWNER_CHAT_ID=your_numeric_chat_id
SE_GROQ_API_KEY=gsk_...          # or any single cloud LLM key
SE_OLLAMA_HOST=http://localhost:11434

Run

uv run python -m telegram_bot

For production deployment on Jetson with systemd and SOPS-encrypted secrets, see Deployment.

Personalization

Sovereign Edge is designed for one person. All personalization happens in .env — no code changes needed.

Career targeting

SE_CAREER_TARGET_LOCATION="Dallas Fort Worth TX"
SE_CAREER_TARGET_ROLES="ML Engineer, AI Engineer, LLM Engineer"
SE_CAREER_DIFFERENTIATORS="GRPO fine-tuning, LangGraph agents, vLLM serving"

Intelligence — repo-aware paper matching

Papers from arXiv are annotated when they match your active projects:

SE_REPO_TOPICS="sovereign-edge:langgraph,mcp,agents; bible-ai:rag,orpo,fine-tuning"

See Configuration for all SE_ variables.

Documentation

Document	Contents
Architecture	Request flow, LLM gateway, memory tiers, HITL, security model
Experts	Capabilities, data sources, and response formats for each agent
Deployment	Jetson setup, systemd service, SOPS secrets, remote deploy
Configuration	All `SE_` environment variables with defaults and descriptions
Development	Local setup, testing, code quality, adding new experts
Contributing	Branch strategy, commit standards, PR checklist, ship workflow
Troubleshooting	Common errors and fixes
Changelog	Version history and release notes
Security	Vulnerability disclosure policy and security model

Full architecture deep-dive: ARCHITECTURE.md

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
.github		.github
agents		agents
docs		docs
evals		evals
packages		packages
prompts		prompts
scripts		scripts
secrets		secrets
services		services
systemd		systemd
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
.sops.yaml		.sops.yaml
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
Taskfile.yml		Taskfile.yml
docker-compose.jetson.yml		docker-compose.jetson.yml
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sovereign Edge

Demo

What is Sovereign Edge?

Why

Expert Agents

Architecture

Request lifecycle in brief

Morning Pipeline

Tech Stack

Quick Start

Prerequisites

Install

Configure

Run

Personalization

Career targeting

Intelligence — repo-aware paper matching

Documentation

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sovereign Edge

Demo

What is Sovereign Edge?

Why

Expert Agents

Architecture

Request lifecycle in brief

Morning Pipeline

Tech Stack

Quick Start

Prerequisites

Install

Configure

Run

Personalization

Career targeting

Intelligence — repo-aware paper matching

Documentation

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages