Anthropic Confirms Rapid Depletion of Claude Code Quotas

Anthropic Confirms Rapid Depletion of Claude Code Quotas

Users of Claude Code, the AI-powered coding assistant from Anthropic, are facing significant challenges due to rapid token usage and unexpected depletion of quotas. This situation has disrupted daily workflows for many developers who rely on the service.

Anthropic Acknowledges Quota Issues

Anthropic has officially recognized the challenges users are facing. The company stated, “people are hitting usage limits in Claude Code way faster than expected. We’re actively investigating… it’s the top priority for the team.”

User Experiences on Social Platforms

On platforms such as Discord and Reddit, numerous users have expressed frustration. One user, subscribed to the Claude Pro plan at $200 annually, noted, “it’s maxed out every Monday and resets at Saturday.” They reported limited access, stating they could only use Claude for 12 out of 30 days. Similarly, another developer shared their experience, revealing they used up their Max 5 plan, costing $100 per month, in just one hour of work.

Potential Reasons for Increased Token Usage

  • Quota Reductions: Last week, Anthropic announced a reduction in quotas during peak hours, impacting approximately 7% of users.
  • Promotional Changes: The end of a promotion on March 28, which doubled usage limits outside peak hours, may also have contributed.
  • Software Bugs: Reports suggest bugs within Claude Code could be causing inflated token costs. A user mentioned finding two independent bugs leading to cost increases between 10-20 times.

Strategies to Mitigate Usage

Some users have found that downgrading to an earlier version of Claude Code significantly reduced their token usage. One user confirmed, “Downgrading to 2.1.34 made a very noticeable difference.” Additionally, the documentation states that prompt caching is intended to lower processing times and costs for repetitive tasks.

Caching Limitations

However, the caching system currently has limitations. The cache lasts only five minutes, meaning brief pauses can result in increased costs upon resumption. Developers can extend the cache lifetime to one hour, but this incurs a higher token cost. Specifically, one-hour cache write tokens are twice the price of base input tokens.

Lack of Clarity in Usage Limits

Anthropic does not provide clear usage limits for its plans, which complicates the situation for developers. For instance, the Pro plan offers “at least five times the usage per session” compared to the free service, while the Standard Team plan claims “1.25x more usage per session than the Pro plan.” Consequently, users must frequently check their dashboards to monitor their consumed quotas.

The Broader Context of AI Tool Pricing

These issues highlight an ongoing negotiation between users and providers regarding acceptable pricing and usage models for AI development. As users aim to manage costs, providers strive to maintain profitability. The disconnect between encouraging developers to integrate AI into various workflows and having a quota system that limits functionality is striking. A user pointed out that “rate-limit errors need to be caught explicitly,” noting that unnoticed errors can quickly drain daily budgets.

As these challenges continue, both users and developers hope for improvements that balance functionality and cost, ensuring that tools like Claude Code remain viable for everyday use.

Next