feat/rate limit implementation #742

sujal12344 · 2025-10-25T12:41:16Z

PR Checklist

Please check if your PR fulfills the following requirements:

The commit message follows our guidelines: https://voltagent.dev/docs/community/contributing/#commit-convention

Bugs / Features

Related issue linked Rate Limiting #5
Tests for the changes have been added (36 test cases)
Docs have been added / updated (added example in example/with-rate-limiting)
Changesets have been added https://voltagent.dev/docs/community/contributing/#creating-a-changeset

What is the current behavior?

Currently, VoltAgent has no built-in mechanism to control the frequency of LLM calls and tool executions. This can lead to:

Exceeding API rate limits from LLM providers (OpenAI, Anthropic, Google, etc.)
Unexpected API throttling or account suspension
Uncontrolled costs in production environments
Poor resource management in multi-tenant scenarios
No way to implement custom usage quotas

Users must implement rate limiting manually in their application code, which is:

Error-prone and inconsistent across different agents
Difficult to test and maintain
Not integrated with VoltAgent's observability features

What is the new behavior?

Configurable Rate Limiting
- Control LLM calls globally or per provider
- Limit tool executions individually
- Support for requests-per-minute constraints
Flexible Strategies
- Fixed Window Counter (MVP)
- Option to throw error immediately or delay until quota resets
Multi-Scope Support
- Global LLM limits
- Provider-specific limits (OpenAI, Anthropic, etc.)
- Tool-specific limits

Example Project (examples/with-rate-limiting/)

6 working examples demonstrating all features:
1. Basic LLM rate limiting (throw strategy)
2. Delay strategy (auto-wait behavior)
3. Provider-specific limits
4. Tool-specific limits
5. Combined LLM + tool limits
6. Monitoring and statistics
Ready-to-run with Google Gemini API

fixes (issue) #5

Screenshots with running project with rate limiting

changeset-bot · 2025-10-25T12:41:21Z

⚠️ No Changeset found

Latest commit: 8de4af4

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

sujal12344 · 2025-10-25T12:45:29Z

Hey @omeraplak @marinoska @necatiozmen , I’ve submitted a PR — #742
Could you please review it and share your feedback when you get a chance?

omeraplak · 2025-10-25T17:49:59Z

Hey @sujal12344 ,
Thank you! I'll review it soon

omeraplak · 2025-10-27T19:13:25Z

Hey @sujal12344 , thanks a lot for the PR! 🔥

I have a couple of questions:

Since we define rateLimits on the Agents and each agent has only one model, how does the providers field under rateLimits work?

rateLimits: {
  providers: {
    openai: {
      maxRequestsPerMinute: 5,
      onExceeded: "throw"
    },
    anthropic: {
      maxRequestsPerMinute: 3,
      onExceeded: "delay"
    }
  }
}

Is this meant for future support when dynamic providers are added?

When defining rate limits for tools, is there a way to add IntelliSense support? That way, tool names could be resolved dynamically.
It would be nice if the user could be notified via a hook when a rate limit is reached.
Right now, onExceeded returns a decision, but maybe there could also be a hook with the same name?
What do you think about defining a global rate limit through the new VoltAgent() instance itself?

sujal12344 added 2 commits October 25, 2025 17:37

feat: implement rate limiting to control external API requests

46ea0ce

chore: add example usage for rate limiting

8de4af4

sujal12344 changed the title ~~Feat/rate limit implementation~~ feat/rate limit implementation Oct 25, 2025

sujal12344 mentioned this pull request Oct 25, 2025

Rate Limiting #5

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat/rate limit implementation #742

feat/rate limit implementation #742

Uh oh!

sujal12344 commented Oct 25, 2025 •

edited

Loading

Uh oh!

changeset-bot bot commented Oct 25, 2025

Uh oh!

sujal12344 commented Oct 25, 2025 •

edited

Loading

Uh oh!

omeraplak commented Oct 25, 2025

Uh oh!

omeraplak commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

feat/rate limit implementation #742

Are you sure you want to change the base?

feat/rate limit implementation #742

Uh oh!

Conversation

sujal12344 commented Oct 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Checklist

Bugs / Features

What is the current behavior?

What is the new behavior?

Screenshots with running project with rate limiting

Uh oh!

changeset-bot bot commented Oct 25, 2025

⚠️ No Changeset found

Uh oh!

sujal12344 commented Oct 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

omeraplak commented Oct 25, 2025

Uh oh!

omeraplak commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sujal12344 commented Oct 25, 2025 •

edited

Loading

sujal12344 commented Oct 25, 2025 •

edited

Loading