Fixes prompt cache regression in Claude Code that causes up to 20x cost increase on resumed sessions
-
Updated
Jun 13, 2026 - JavaScript
Fixes prompt cache regression in Claude Code that causes up to 20x cost increase on resumed sessions
Enhanced prompt cache and sticky session management plugin for OpenCode, especially effective for AI API relay/gateway setups with stable SHA256-based cache keys.
45% cost reduction measured. The only Claude Code plugin built from CC source analysis — cache expiry prevention, SubTask auto-delegation, zero-cost context restoration, real-time dashboard. Max Plan + API pay-per-use.
Freeze Claude Code's prompt prefix so DeepSeek's automatic cache always hits — alignment proxy + coalescing + keepalive, installable as a CC plugin. Measured 64% cheaper on real Claude Code traffic.
Context-aware caching proxy for Anthropic and OpenAI — maximizes prompt-cache hit rates by anchoring long-lived context into stable boundaries. Drop-in for Claude Code, Codex, Cursor, and custom harnesses.
Local-first memory and prompt-cache layer for Claude Code
Keep your Claude Code prompt cache warm during breaks. Saves up to 92% on cache rebuild costs.
Autonomous loop runner + Beads-powered methodology for AI coding agents. Carve code, not chaos.
Chinese-first project baseline handoff kit for AI-assisted development.
Local-first spend & cache-efficiency dashboard for Claude Code. Reads ~/.claude/projects, tells you exactly what's costing you. No telemetry, no API key.
Zero-dependency OpenAI-compatible API bridge for OpenAI & Anthropic, with prompt injection and an Anthropic prompt-cache keepalive. Bring your own key.
Diagnose your Anthropic prompt cache hit rate and find the exact reason you're missing it.
Cache-aware context surgery proxy for Claude Code — cut dead weight from history without breaking the prompt cache
Local facade and install scripts for OpenClaw third-party OpenAI-compatible Responses providers, with allowlisted routing and reversible setup.
A beautiful local terminal dashboard for Codex prompt cache visibility.
Project canonical Markdown docs into AI-native projections — llms.txt (Howard 2024 spec), llms-full.txt (segmented), AGENTS.md (AAIF/Linux Foundation Dec 2025), CLAUDE.md (Anthropic verbatim, ≤200 lin
Substrate memory: full-KB injection into long-context LLM. Recall as shape activation, not retrieval. Three integration paths (lens plugin / MCP / hook).
Multi-line, mode-switchable status line for Claude Code with prompt-cache TTL, hit rate, burn rate, skill/MCP usage
VS Code extension for claude-code-cache-fix — one-click interceptor activation
Heartbeat Monitor pattern against Claude Code 5-min prompt-cache TTL
Add a description, image, and links to the prompt-cache topic page so that developers can more easily learn about it.
To associate your repository with the prompt-cache topic, visit your repo's landing page and select "manage topics."