Add pre-flight rate limiting with token accounting by TTK95 · Pull Request #2 · TTK95/opencode

TTK95 · 2026-04-21T07:33:42Z

Extends the existing per-provider sliding-window rate limiter with token
windows and a pre-flight check() that blocks requests before they hit
the wire. Stops RWTH (and other) rate-limited providers from surfacing
generic "An internal error occurred" messages after a 429 — the limit
is learned on the first 429, persisted to opencode.json, and on
subsequent runs the pre-flight gate throws a RateLimitError with a
friendly "retry in Ns" message that the retry layer honours via resetAt.

rate-limit.ts: token windows (minute + day), check() gate, recordUsage,
estimateRequestTokens, configure() to seed from opencode.json options,
onRateLimitError persists token limits too
provider.ts fetch wrapper: pre-flight check, RateLimit.configure from
options.rateLimit, tick with token estimate
session/processor.ts: call RateLimit.recordUsage on finish-step so
the pending estimate is replaced with actual usage
session/retry.ts: RateLimitError is retryable; delay() honours resetAt
provider/error.ts: friendly message for 429 responses; mark retryable
session/message-v2.ts: RateLimitError NamedError, converts to APIError
shape in fromError so the SDK type surface is unchanged
config/provider.ts: tokensPerMinute / tokensPerDay schema fields
cli/cmd/stats.ts: RATE LIMITS section reads persisted limits

https://claude.ai/code/session_01S3gS4AgfQQBNHvgZqkuiRa

Extends the existing per-provider sliding-window rate limiter with token windows and a pre-flight check() that blocks requests before they hit the wire. Stops RWTH (and other) rate-limited providers from surfacing generic "An internal error occurred" messages after a 429 — the limit is learned on the first 429, persisted to opencode.json, and on subsequent runs the pre-flight gate throws a RateLimitError with a friendly "retry in Ns" message that the retry layer honours via resetAt. - rate-limit.ts: token windows (minute + day), check() gate, recordUsage, estimateRequestTokens, configure() to seed from opencode.json options, onRateLimitError persists token limits too - provider.ts fetch wrapper: pre-flight check, RateLimit.configure from options.rateLimit, tick with token estimate - session/processor.ts: call RateLimit.recordUsage on finish-step so the pending estimate is replaced with actual usage - session/retry.ts: RateLimitError is retryable; delay() honours resetAt - provider/error.ts: friendly message for 429 responses; mark retryable - session/message-v2.ts: RateLimitError NamedError, converts to APIError shape in fromError so the SDK type surface is unchanged - config/provider.ts: tokensPerMinute / tokensPerDay schema fields - cli/cmd/stats.ts: RATE LIMITS section reads persisted limits https://claude.ai/code/session_01S3gS4AgfQQBNHvgZqkuiRa

TTK95 merged commit ba0bc34 into dev Apr 21, 2026

TTK95 deleted the claude/rwth-api-rate-limits-algv0 branch April 21, 2026 07:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add pre-flight rate limiting with token accounting#2

Add pre-flight rate limiting with token accounting#2
TTK95 merged 1 commit into
devfrom
claude/rwth-api-rate-limits-algv0

TTK95 commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

TTK95 commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants