[Feat] Dynamic Rate Limiter - Allow defining a rate limit policy by model + error #16513

ishaan-jaff · 2025-11-12T02:36:54Z

[Feat] Dynamic Rate Limiter - Allow defining a rate limit policy by model + error

Fixes LIT-1389

This PR implements a dynamic rate limit policy that allows fine-grained control over when rate limits are enforced based on provider-specific error thresholds. When using dynamic rate limiting (rpm_limit_type: "dynamic" or tpm_limit_type: "dynamic"), rate limits will only be enforced when specific error types exceed configured thresholds per provider.

model_list:
  - model_name: gpt-4
    litellm_params:
      model: openai/gpt-4
  - model_name: claude-3-sonnet
    litellm_params:
      model: bedrock/anthropic.claude-3-sonnet-20240229-v1:0

litellm_settings:
  # Define provider-specific error thresholds
  dynamic_rate_limit_policy:
    openai:
      BadRequestErrorThreshold: 3
      RateLimitErrorThreshold: 5
      TimeoutErrorThreshold: 2
    bedrock:
      ContentPolicyViolationErrorThreshold: 4
      RateLimitErrorThreshold: 10
    azure:
      BadRequestErrorThreshold: 5

Relevant issues

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
I have added a screenshot of my new test passing locally
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem

Type

🆕 New Feature
✅ Test

Changes

vercel · 2025-11-12T02:36:59Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
litellm	Error			Nov 12, 2025 2:36am

litellm/proxy/hooks/parallel_request_limiter_v3.py

+                                f"[Dynamic Rate Limit] Tracked failure for deployment {deployment_id}, "
+                                f"provider {custom_llm_provider}, error type {error_type}"


To fix this problem, avoid logging sensitive identifiers such as API keys, their hashes, deployment IDs (if derived from credentials), or any values that originate from authorization secrets. In this context, the deployment_id logged on line 1553 potentially exposes sensitive data, so we should redact it or replace it with a non-sensitive stand-in (e.g., "[REDACTED]"). Changing only the log output while keeping the rest of the logic intact preserves functionality.

How to fix:

On line 1553, when logging, mask or redact the deployment_id to prevent leakage of sensitive information.

You may use a general placeholder like "[REDACTED]" or something more descriptive, depending on real use-case knowledge.

No new imports are needed, and the edit only affects the single logging line.

Files/regions to change:

Edit the logging statement in litellm/proxy/hooks/parallel_request_limiter_v3.py, specifically lines 1553-1555.

haneefs · 2025-11-13T02:26:16Z

litellm/proxy/hooks/parallel_request_limiter_v3.py

        rpm_limit_type = metadata.get("rpm_limit_type")
        tpm_limit_type = metadata.get("tpm_limit_type")

+        # Get dynamic rate limit policy from general_settings


Is this only going to work with rpm_limit_type/tpm_limit_type at key level. We also have that at team level.

ishaan-jaff added 6 commits November 11, 2025 17:48

add dynamic_rate_limit_policy

e6d72cf

use DynamicRateLimitPolicy

12a90af

use _get_dynamic_rate_limit_policy

06cd163

fix 3ZQOB+URDfIfBef//HgChss+HJbzZvdo3pRxc6Gl

5bd4c11

fix DynamicRateLimitPolicy

1bdae3a

refactor DynamicRateLimitHandler

3f84cb6

github-advanced-security bot found potential problems Nov 12, 2025

View reviewed changes

haneefs reviewed Nov 13, 2025

View reviewed changes

@@ -1550,7 +1550,7 @@
                                             error_type=error_type,
                                         )
                                         verbose_proxy_logger.debug(
-                                            f"[Dynamic Rate Limit] Tracked failure for deployment {deployment_id}, "
+                                            f"[Dynamic Rate Limit] Tracked failure for deployment [REDACTED], "
                                             f"provider {custom_llm_provider}, error type {error_type}"
                                         )

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Feat] Dynamic Rate Limiter - Allow defining a rate limit policy by model + error #16513

[Feat] Dynamic Rate Limiter - Allow defining a rate limit policy by model + error #16513

ishaan-jaff commented Nov 12, 2025 •

edited

Loading

Uh oh!

vercel bot commented Nov 12, 2025

Uh oh!

Check failure

Copilot Autofix

haneefs Nov 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		f"[Dynamic Rate Limit] Tracked failure for deployment {deployment_id}, "
		f"provider {custom_llm_provider}, error type {error_type}"

Uh oh!

[Feat] Dynamic Rate Limiter - Allow defining a rate limit policy by model + error #16513

Are you sure you want to change the base?

[Feat] Dynamic Rate Limiter - Allow defining a rate limit policy by model + error #16513

Conversation

ishaan-jaff commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!