size escrow off real prompt + output cap with headroom + per-job ceiling by ffaerber · Pull Request #16 · token-for-token/t4t

ffaerber · 2026-05-25T17:58:18Z

Previous maxPayment formula padded by a fixed 1M tokens on each side off
max_tokens (default 1024). That under-budgets long-context requests
(Gemini 2M, GPT-5 long inputs) — provider's honest claimJob reverts on
PaymentTooHigh, gateway times out, provider gets slashed for an honest
client-sized prompt.

New compute_max_payment sizes the escrow off the estimated prompt length
(chars/4 fallback) and the requested or default output cap, each padded
by T4T_ESCROW_HEADROOM_RATIO. Optional T4T_MAX_ESCROW_PER_JOB rejects
oversized requests with HTTP 413 instead of locking that much xBZZ.

Previous maxPayment formula padded by a fixed 1M tokens on each side off max_tokens (default 1024). That under-budgets long-context requests (Gemini 2M, GPT-5 long inputs) — provider's honest claimJob reverts on PaymentTooHigh, gateway times out, provider gets slashed for an honest client-sized prompt. New compute_max_payment sizes the escrow off the estimated prompt length (chars/4 fallback) and the requested or default output cap, each padded by T4T_ESCROW_HEADROOM_RATIO. Optional T4T_MAX_ESCROW_PER_JOB rejects oversized requests with HTTP 413 instead of locking that much xBZZ.

…ment Two defensive changes so an honest provider can't get slashed for an honest workload that overshoots the escrow: 1. Before calling chatCompletion, worker derives the maximum completion tokens the on-chain maxPayment can pay for (given the provider's declared per-million prices and a conservative chars/4 prompt estimate), and lowers req.max_tokens if it's higher (or absent). Any backend that honors max_tokens then physically cannot produce a response whose actualPayment would exceed maxPayment. 2. In the claim path, re-read the on-chain Job to get the authoritative maxPayment (defense against a gateway that tampers with notify.body), then clip actualPayment = min(actual, maxPayment). A backend that ignores max_tokens still claims what it can instead of reverting with PaymentTooHigh and burning to timeoutJob's 3x slash.

claude added 2 commits May 23, 2026 09:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

size escrow off real prompt + output cap with headroom + per-job ceiling#16

size escrow off real prompt + output cap with headroom + per-job ceiling#16
ffaerber wants to merge 2 commits into
mainfrom
claude/token-limits-proxy-gateway-Mr36B

ffaerber commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ffaerber commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants