Skip to content

Conversation

@echobt
Copy link
Contributor

@echobt echobt commented Jan 21, 2026

This PR addresses the issue of missing rate limiting in the API server.

Changes

  • Added tower-governor dependency for rate limiting.
  • Configured rate limiting middleware in run_server with 60 requests per second and a burst of 10.
  • This prevents simple DoS attacks and resource exhaustion from abuse of the embedding generation endpoint.

Verification

  • Verified that the rate limiting middleware is correctly applied.
  • Ran existing tests to ensure no regressions.

This commit adds rate limiting to the API server using tower-governor. It limits requests to 60 per second with a burst of 10 to prevent DoS attacks and resource exhaustion.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants