Skip to main content

Our approach to API rate limits

Updated over a week ago

Your rate limit depends on your usage tier, and is currently measured in three key metrics:

  1. Requests per minute (RPM)

  2. Input tokens per minute (ITPM)

  3. Output tokens per minute (OTPM)

If you exceed any of these rate limits, you will get a 429 error describing which rate limit was exceeded, along with a retry-after header indicating how long to wait.

Rate limits are set at the organization level and are defined by usage tiers. Each tier has different spend and rate limits, with automatic tier advancement based on usage thresholds up to Tier 4.

You can view your organization's current tier and limits in the Anthropic Console.

More information on usage tiers and rate limits can be found in our API docs.

Did this answer your question?