l0-cache

A lightweight CLI proxy written in Rust that reduces LLM token consumption by filtering, truncating, and compressing command output. Designed for AI coding assistants (Claude Code, Gemini CLI, Cursor) running on macOS, Linux, and remote servers.

The Problem

AI coding assistants read the full output of every shell command. A single cargo test or git log can produce thousands of lines, consuming tokens that add no value. The relevant information is almost always in the first few lines (headers, command echo) and the last few lines (errors, summary).

The Solution

l0-cache wraps any command and applies a pipeline of universal filters:

command output
  --> ANSI escape stripping
  --> line collapsing (identical & prefix-based) (×N)
  --> unified-diff context collapsing (keep changes, drop unchanged runs)
  --> blank line squeezing
  --> head/tail buffering (30 + 30 lines)
  --> metrics logging
  --> filtered output

Typical savings: 50-80% fewer tokens per command invocation.

How it compares

l0-cache is universal-first: it compresses any command's output with generic, format-aware filters instead of maintaining a parser per tool. That trades some best-case ratio on well-known commands for zero per-tool maintenance and instant support for unknown/proprietary CLIs. Where it stands apart is the intelligence around the filtering.

	l0-cache	rtk	snip	Lean Ctx
Filtering	universal + format-aware (head/tail, collapse, diff)	100+ per-command parsers	127 YAML filters	MCP server + cache
Works on unknown/custom commands	✅	partial	partial	partial
Adaptive auto-tuning (by exit code)	✅	❌	❌	❌
Safety guard (blocks dangerous cmds)	✅	❌	❌	❌
Full-output recovery on failure	✅ `--recover`	✅	❌	✅ (cache)
Transparent hook	Claude Code, Gemini CLI	many	many	MCP
Per-command config	JSON	TOML	YAML	—
Stats: dashboard / `--discover` / JSON	✅	✅	✅	partial
New runtime dependencies	none	none	none (Go)	—

Uniquely ours: failure-backoff / success-decay auto-tuning that reacts to exit codes, a guard that blocks rm -rf / / reverse shells / credential exfiltration / DROP DATABASE, systematic credential redaction in telemetry, and tested hardening (bounded memory, signal/process-group forwarding, OOM caps).

Where the others win: deeper per-command semantic output (e.g. grouped test failures) and a longer list of pre-wired agents. For maximum ratio on a fixed set of known commands, rtk/snip go further; for a safe, adaptive, zero-maintenance proxy that works on everything, that's l0-cache.

Installation

Quick Install (non-interactive)

Install l0-cache locally to ~/.local/bin/ with a single command:

curl -fsSL https://raw.githubusercontent.com/fabriziosalmi/l0-cache/master/install.sh | bash

Interactive Installer

Clone the repository and run the setup script for a guided interactive install:

git clone https://github.com/fabriziosalmi/l0-cache.git
cd l0-cache
./install.sh

Manual Build

cargo build --release
cp target/release/l0-cache /usr/local/bin/

Verify

l0-cache --version
# l0-cache 0.1.0 (abc1234)

Usage

# Filtered output (default: 30 head + 30 tail, threshold 100 lines)
l0-cache cargo test

# Full output, metrics still logged
l0-cache --raw cargo test

# Interactive commands pass through unchanged
l0-cache -i vim file.txt

# Token savings report
l0-cache --stats
l0-cache --stats --since 7d

# Custom head/tail
l0-cache --head 50 --tail 50 cargo build

# More tail lines on error (default: 120)
l0-cache --tail-error 200 cargo test

# Auto-tuning is enabled by default. To disable it:
l0-cache --no-auto cargo test

# Diagnose system installation, shell configuration, and active LLM editors
l0-cache --doctor

# Custom success optimization floor and failure backoff ceiling
l0-cache --auto-floor 15 --auto-ceiling 500 cargo test

# Custom token divisor ratio (e.g. 8 bytes per token)
l0-cache --token-factor 8 cargo test

Options

--raw                Print output verbatim (no head/tail truncation, no collapsing
                     or JSON squashing); ANSI is still stripped and a 1 MB/line and
                     256 MB total OOM cap still apply. Metrics are still logged.
-i, --interactive    Passthrough mode (stdin/stdout/stderr inherited)
--head N             Lines to keep from start (default: 30)
--tail N             Lines to keep from end (default: 30)
--tail-error N       Tail lines on non-zero exit (default: 120)
--threshold N        Only truncate if output exceeds N lines (default: 100)
--only-errors        Keep only lines matching error/warn/fail/panic/exception/etc.
--recover            On a failing command whose output was truncated, save the full
                     output to a temp file and point to it in the banner (so the agent
                     can read the omitted lines without re-running). Off by default.
--idle-timeout N     SIGKILL the command (and its process group) after N seconds with
                     no output (prevents interactive-prompt deadlocks). 0 = off.
--no-auto            Disable adaptive auto-tuning of parameters
--quiet, -q          Suppress l0-cache's own stderr notices (e.g. auto-tuning)
--guard              Force-enable the safety guard (see "Safety Guard" below)
--no-guard           Force-disable the safety guard
--doctor             Diagnose system installation, shell environment, and active LLM editors
--auto-floor N       Floor for success optimization decay (default: 10)
--auto-ceiling N     Ceiling for failure backoff tail expansion (default: 1000)
--token-factor N     Divisor for token estimation (default: 4)
--stats              Show token savings report
--since DURATION     Filter stats/discover (e.g. 7d, 24h, 30m)
--discover           Show an optimization advisory (keep / drop / footprint) from metrics
--json               Output --stats as JSON instead of the dashboard
--cost-per-mtok N    USD per million tokens; when > 0, show cost saved in --stats/--discover
--reset-stats        Delete ALL recorded telemetry (destructive, cannot be undone)
--completions SHELL  Generate shell completions (bash, zsh, fish, elvish, powershell)
--version            Print version with git commit hash

Telemetry Dashboard

l0-cache --stats renders an aggregated savings report — total runs, tokens saved, and per-command efficiency with proportional bars:

┌─ l0-cache TELEMETRY ───────────────────────────────── last 7d ─┐
│ Runs        35                                                 │
│ Saved       12.5k  of 17.4k raw                                │
│ Efficiency   71.8%  █████████████████░░░░░░░                   │
├────────────────────────────────────────────────────────────────┤
│ COMMAND     RUNS   SAVED  EFFIC. IMPACT                        │
│ cargo         15    8.2k   78.5% █████████░░░  ↑ best          │
│ git           12    3.1k   65.3% ████████░░░░                  │
│ npm            8    1.2k   54.2% ██████░░░░░░                  │
└────────────────────────────────────────────────────────────────┘

--doctor shares the same boxed visual language for its health report. Color is emitted only on an interactive terminal — piping or redirecting (or setting NO_COLOR) yields clean, escape-free text; FORCE_COLOR=1 forces it on for CI captures.

Per-command configuration (optional)

There is no config file by default. When you want different head/tail budgets per command — without per-tool parsers — drop a small JSON file at $XDG_CONFIG_HOME/l0-cache/config.json (or ~/.config/l0-cache/config.json):

{
  "defaults": { "recover": true },
  "commands": {
    "cargo":  { "tail_error": 300, "head": 50 },
    "git":    { "head": 10, "tail": 40 },
    "docker": { "head": 10, "tail": 80 }
  }
}

Tunable keys per command: head, tail, tail_error, threshold, only_errors, recover. Commands are matched by resolved name (so sh -c "cargo test" matches cargo). Precedence is explicit CLI flag > config > built-in default, and auto-tuning then adjusts from the resolved base. A missing or malformed file is ignored (with one stderr note), never fatal.

Claude Code Integration (optional)

Normally you (or your AI assistant) prefix a command with l0-cache explicitly. For Claude Code, the bundled claude-hook.sh can do that for you, transparently: it installs a PreToolUse hook that rewrites the simple Bash commands Claude Code runs so they go through l0-cache — the model never has to prefix anything.

It is off by default and designed to stay out of the way:

Conservative — only a single, simple program invocation is ever wrapped. Anything with shell operators (&&, ||, ;, |, redirects, $(...), backticks, &), multiple lines, stateful builtins (cd, export, source, eval, exec, set, …), shell constructs (for/while/if/case), or interactive/TUI/REPL programs (vim, less, ssh, python, psql, …) is passed through untouched. Already-wrapped commands are left as-is.
Fail-safe — if l0-cache or jq is missing, or anything errors, the command runs unchanged. The hook never blocks a command and never sets a permissionDecision, so wrapped commands still go through your normal Claude Code permissions.
Runtime toggle — enable/disable instantly, no restart.

./claude-hook.sh install     # write the wrapper + register the hook (idempotent; needs jq)
./claude-hook.sh enable      # turn it ON  (instant)
./claude-hook.sh disable     # turn it OFF (instant)
./claude-hook.sh status      # show install/enabled state + l0-cache version
./claude-hook.sh uninstall   # remove the hook registration and wrapper

Activation: after install (or any change to settings.json), start a new Claude Code session so the hook is loaded — hooks are read at session startup. The enable/disable toggle then takes effect immediately.

The hook honors $CLAUDE_CONFIG_DIR and $XDG_CONFIG_HOME. It edits Claude Code's settings.json (saving a timestamped backup) and stores its on/off state as an empty toggle file at ~/.config/l0-cache/hook.enabled.

Note

l0-cache is not a persistent cache — it filters output on the fly and does not store results to replay. The only thing written to disk is the metrics log (see Metrics). If a session shows no savings, the hook simply never wrapped a command in it — confirm with l0-cache --stats and ./claude-hook.sh status.

Other agents (Gemini CLI)

Transparent wrapping needs a hook that can rewrite the command. Two agents support that today — Claude Code (PreToolUse) and Gemini CLI (BeforeTool/run_shell_command) — and agent-hook.sh installs the same conservative, fail-safe wrapper for either (it also enables --recover):

./agent-hook.sh install gemini    # or: install claude   (default)
./agent-hook.sh enable            # shared on/off toggle for all installed agents
./agent-hook.sh status gemini

Cursor and most other agents expose a hook that can only allow/deny a command, not rewrite it, so they cannot be wrapped transparently. For those, agent-rules.sh install cursor|cline|copilot|codex drops a project rule telling the model to prefix noisy read-only commands with l0-cache (or agent-rules.sh print to paste it anywhere). This is best-effort (model-dependent), not a hard hook.

Safety Guard

When l0-cache detects it is running inside an AI coding assistant (Claude Code, Gemini CLI, Cursor/VS Code terminals), it enables a best-effort guard that blocks a few obviously destructive commands before they run, exiting with code 126:

recursive force-removal of a critical system path (rm -rf /, /etc, /usr, …), including inside sh -c "…" wrappers and with trailing-slash/glob variants;
reverse shells / socket redirections (/dev/tcp, /dev/udp);
credential exfiltration (curl/wget/nc/ssh touching id_rsa, .env, shadow, …);
DROP DATABASE via psql/mysql/sqlite3/sqlcmd.

Control it explicitly with --guard / --no-guard, or the L0_CACHE_GUARD environment variable (1/true/on to force on, 0/false/off to force off). Precedence: --no-guard → --guard → L0_CACHE_GUARD → auto-detect.

This is a guard rail, not a sandbox. It pattern-matches argv and shell payloads and can be bypassed by a determined caller — do not rely on it as a security boundary. Bypass an intentional command with --no-guard.

Architecture

Single-threaded, synchronous design. Zero async. The only thread ever spawned is an optional output-inactivity watchdog, and only when --idle-timeout is set.

l0-cache <command>
  |
  +-- sh -c '<command> 2>&1'     # merge stderr into stdout
  |
  +-- read_line_lossy()          # UTF-8 lossy, 1MB line cap
  |
  +-- FilterPipeline             # streaming, O(head+tail) memory
  |     |-- strip_ansi()
  |     |-- CollapseLines
  |     |-- WhitespaceSqueeze
  |     +-- HeadTailBuffer
  |
  +-- write_output()             # BrokenPipe-safe
  |
  +-- append_metric()            # JSONL, O_APPEND, 0600 perms
  |
  +-- exit(child_exit_code)      # 128+N for signal-killed processes

Memory Model

Filtered mode: O(head + tail) lines in memory, regardless of output size
Raw mode: capped at 256 MB, then truncation with warning
Line length: capped at 1 MB per line to prevent OOM on binary input

Signal Handling

The captured child runs in its own process group. l0-cache installs SIGINT and SIGTERM handlers that forward the signal to that group, so Ctrl-C and a directed kill <pid> (or timeout, systemd, docker stop) terminate the whole child subtree — not just the sh wrapper — and l0-cache then propagates the child's status.
SIGPIPE: ignored in l0-cache, BrokenPipe handled in code so metrics are logged before exit
Exit codes: POSIX 128+N convention for signal-killed children

Metrics

Each invocation logs a JSON line to ~/.local/share/l0-cache/metrics.jsonl:

{
  "ts": "2024-01-15T10:30:00Z",
  "cmd": "cargo",
  "args": "test --all",
  "bytes_raw": 15000,
  "bytes_final": 3000,
  "tokens_raw": 3750,
  "tokens_final": 750,
  "tokens_saved": 3000,
  "lines_raw": 500,
  "lines_final": 62,
  "truncated": true,
  "strategy": "head_tail",
  "exit_code": 0,
  "duration_ms": 1234,
  "version": "0.1.0"
}

Data directory resolution: $XDG_DATA_HOME/l0-cache/ then $HOME/.local/share/l0-cache/ then /etc/passwd lookup (for containers, cron, systemd).

File permissions are set to 0600. Auto-rotation at 10 MB.

Cross-Platform Support

Environment	Build target	Status
macOS arm64/x86_64	native	Tested
Ubuntu 22.04 / 24.04	`x86_64-unknown-linux-gnu`	Tested
Alpine Linux	`x86_64-unknown-linux-musl`	Static binary
LXC container	same as host	`/etc/passwd` fallback
Proxmox VE	same as host	Works in host and guests
cron / systemd	same as host	`/etc/passwd` fallback
SSH with PTY	same as host	Signals work normally
SSH without PTY	same as host	See known limitations

Cross-Compilation from macOS

# Prerequisites
brew install filosottile/musl-cross/musl-cross
rustup target add x86_64-unknown-linux-gnu
rustup target add x86_64-unknown-linux-musl

# Build
make build-linux    # glibc, dynamic
make build-alpine   # musl, fully static

# Deploy
make deploy-linux HOST=user@server
make deploy-alpine HOST=user@alpine-host

Shell Completions

# Bash
l0-cache --completions bash > /etc/bash_completion.d/l0-cache

# Zsh
l0-cache --completions zsh > ~/.zsh/completions/_l0-cache

# Fish
l0-cache --completions fish > ~/.config/fish/completions/l0-cache.fish

Known Limitations

SSH without PTY: when running ssh host l0-cache cargo build (no -t flag), Ctrl-C may not reach the child process. Use ssh -t host l0-cache cargo build instead.
Binary output: detected on the first 8 KB. If a command produces text followed by binary data after 8 KB, the binary portion is processed as text (with UTF-8 lossy conversion).
Shell requirement: l0-cache requires /bin/sh or /usr/bin/sh for the 2>&1 merge. In distroless containers without a shell, use l0-cache -i for passthrough mode (no stderr merge, no filtering).

Hardening

The following protections are in place for production use across diverse environments:

UTF-8 lossy reads: never drops lines on invalid encoding
Line length cap: 1 MB per line prevents OOM on binary/minified input
Raw mode cap: 256 MB prevents OOM on massive output
SIGPIPE handling: metrics are always logged, even when piped to head
Exit code 128+N: POSIX-correct for signal-killed processes
$HOME fallback: /etc/passwd lookup for containers, cron, systemd
Metrics rotation: auto-rename to .old at 10 MB
File permissions: 0600 on metrics file
Shell check: clear error when /bin/sh is missing

Development

cargo test           # 186 tests (unit + E2E integration)
cargo clippy         # 0 warnings enforced
cargo build --release

License

MIT. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github/workflows		.github/workflows
docs		docs
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
agent-hook.sh		agent-hook.sh
agent-rules.sh		agent-rules.sh
benchmark.sh		benchmark.sh
build.rs		build.rs
claude-hook.sh		claude-hook.sh
install.sh		install.sh
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

l0-cache

The Problem

The Solution

How it compares

Installation

Quick Install (non-interactive)

Interactive Installer

Manual Build

Verify

Usage

Options

Telemetry Dashboard

Per-command configuration (optional)

Claude Code Integration (optional)

Other agents (Gemini CLI)

Safety Guard

Architecture

Memory Model

Signal Handling

Metrics

Cross-Platform Support

Cross-Compilation from macOS

Shell Completions

Known Limitations

Hardening

Development

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

l0-cache

The Problem

The Solution

How it compares

Installation

Quick Install (non-interactive)

Interactive Installer

Manual Build

Verify

Usage

Options

Telemetry Dashboard

Per-command configuration (optional)

Claude Code Integration (optional)

Other agents (Gemini CLI)

Safety Guard

Architecture

Memory Model

Signal Handling

Metrics

Cross-Platform Support

Cross-Compilation from macOS

Shell Completions

Known Limitations

Hardening

Development

License

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages