You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Auto-referenced by the dispatcher skill. Last updated: 2026-03-22.
Health: ✅ working | ⚠️ degraded | ❌ broken | 💤 not installed/inactive
Models
Model
Access
Cost
Best For
Limits
Health
Claude Opus 4.6
Claude Code (self)
Paid (primary)
Architecture, complex reasoning, code review
Rate limits apply
✅
Claude Sonnet 4.5
Claude Code (subagent)
Paid
Fast code gen, web research, parallel tasks
Rate limits apply
✅
Claude Haiku
API
Cheap
Classification, triage, routing
Lower capability
✅
Gemini 2.5 Pro
CLI (gemini) / API
Free tier (250K TPM, 1K RPD)
Best quality free, multimodal, 1M context
5-15 RPM
⚠️ (reinstalled 2026-03-22)
Grok (xAI)
API / Web
$175/mo free credits
Contrarian takes, general tasks
Requires data-sharing opt-in
⚠️ (verify key)
Perplexity
Web
Free/Pro
Search-grounded research, citations
No code execution
✅
Ollama (llama3)
Local (ollama)
Free ($0)
Offline inference, privacy-sensitive tasks
4.7GB, GPU-bound
✅
SambaNova
API (DevLoop)
Free (20M tok/day)
Highest-volume batching, R&D
OpenAI-compat
✅
Mistral Large
API (DevLoop)
Free (1B tok/month)
Code (Codestral), queued batch
2 RPM
✅
Cerebras
API (DevLoop)
Free (1M tok/day)
Speed-critical tasks
2,600 tok/s
✅
Groq
API (DevLoop)
Free (~14K req/day)
Real-time lint/review, whisper
30-60 RPM
✅
Mercury 2
API
10M free tokens (one-time)
Fast diffusion inference
OpenAI-compat
✅
OpenRouter
API (DevLoop)
29 free models, 1K RPD
Aggregator/fallback router
Varies by model
⚠️ (verify key)
Token budget routing:
TIER 0 (local): Ollama — $0, your GPU
TIER 1 (free API): SambaNova → Mistral → Cerebras → Groq → Gemini
TIER 2 (cheap): OpenRouter, Grok credits, Mercury 2
TIER 3 (expensive): Claude Code — supervised work only
NO Chinese models (ban in effect — MiniMax correlation, spam risk). DeepSeek/Kimi via OpenRouter allowed for non-sensitive summarization only when explicitly noted.
3D Reconstruction
Tool
Access
Cost
Quality
Speed
Health
VGGT (Meta)
Local (5GB weights needed)
Free
High (multi-view, not hallucination)
Sub-second
⚠️ (weights not yet downloaded)
Trellis
HuggingFace Space / API
Free (quota)
Good
Moderate
✅ (DTR pipeline, 1st fallback)
Hunyuan3D-2.1 (Tencent)
HuggingFace Space / Gradio API
Free
Good (4-view support)
95-120s
✅ (tested in DTR)
Rodin / Hyper3D
fal.ai API
~$0.40/job
High (quad-mesh, 4K PBR, 21K verts)
Moderate
✅ (tested, in api_reconstruct_v2.py)
Tripo AI
fal.ai API
~$0.40/job
Good
Moderate
✅ (DTR pipeline, 4th fallback)
RealityScan (Epic)
Desktop / REST+gRPC API
Free under $1M revenue
Best-in-class (photogrammetry)
Slow (batch)
✅
COLMAP
Local (no-CUDA build)
Free
High (photogrammetry)
Slow
⚠️ (failed at 360p; untested at 1080p — use with U2Net masking)
Blender
Local
Free
N/A (mesh cleanup/import)
Manual
✅
RunPod Serverless
Cloud API
~$0.10/job
Good (Trellis)
Moderate
✅ (DTR pipeline, 2nd fallback)
GCP Spot L4
Cloud API
~$0.12/job
Good (Trellis)
Moderate
✅ (DTR pipeline, 3rd fallback)
DTR fallback chain:Trellis (free HF) → RunPod ($0.10) → GCP ($0.12) → Tripo ($0.40)Rule: All photogrammetry must use U2Net masking.