I hit Claude’s new usage limits — and It changed how I use AI forever

Claude logo on phone
(Image credit: Shutterstock)

I’ve spent the last several months treating Claude like an infinite resource. It was my digital intern, my sounding board and my assistant that could handle whatever I threw at it. But last week, right in the middle of a big project I'm working on, it dropped the ball.

It wasn't a Claude outage or a hallucination that derailed my work, it was a usage cap.

If you’re a power user of Anthropic’s Claude 4.6 Sonnet or 4.6 Opus, you’ve likely seen the warning: "You have reached your message limit until 4 PM." It’s a jarring moment that turns a seamless workflow into a complete standstill. And crazy enough, you'll see this message even if you're a Pro user like me.

Article continues below

But after the initial frustration wore off, I realized that I needed to change my usage strategy. Here is how I’ve pivoted to stay productive — and why this could be useful for anyone regardless of what subscription tier you use. From free to Pro to Max, here's how to work around those limits.

Usage limits are tightening

Claude

(Image credit: Shutterstock)

AI is expensive. It's one of the biggest reasons OpenAI pulled the plug on Sora. Between the massive compute power required to run LLMs (Large Language Models) and the surging user base, companies like Anthropic and OpenAI are tightening the leash.

Even on Claude Pro ($20/month), limits aren't fixed; they fluctuate based on demand. If you're working on a complex project with long attachments, you’ll burn through your "allowance" faster than you think.

3 ways I changed my AI strategy

Person typing on a laptop in a low lit room

(Image credit: Olena Malik / Getty Images)

To keep working without being "locked out," I had to stop treating Claude like a chatty coworker and start treating it like a high-priced consultant. That meant:

  • No more "thinking out loud." I used to send five or six short messages to "warm up" an idea. Now, that’s a waste of credits. Instead, I now draft my full context in a Notepad file first. I combine the goal, the constraints and the raw data into one "Mega-Prompt." This results in getting a better first-draft response and saves 80% of my message overhead.
  • The "model-hopping" workflow. I’ve always used multiple chatbots at once, and this strategy helps a lot to increase productivity when usage limits are low. Knowing what chatbots are better for certain projects, helps. To stay within my token budget, I use Claude for creative brainstorming and coding (where its "human" tone shines), but I switch to ChatGPT for data analysis or Google Gemini for quick research tasks. By spreading the load, I rarely hit the ceiling on any single platform.
  • Reducing follow up. I’ve started using System Instructions more effectively. I tell Claude exactly what the final output should look like in the first message to help reduce follow up. If I still have questions, I paste the response into Gemini and go from there.

The bottom line

AI is only getting more advanced, using more power and well, that means it's going to get more expensive. For that reason, the era of "infinite AI" is over. That was a gift of the early beta days. As these tools become integrated into professional workflows though, we have to move on from casual chatting to intentional prompting.

Hitting a limit is frustrating, but it forced me to be a more concise, clear and efficient communicator. If you want to get the most out of your $20-a-month subscription, stop chatting — and give my strategies a try. Let me know in the comments what you do when you hit your usage limits.


Google News

Follow Tom's Guide on Google News and add us as a preferred source to get our up-to-date news, analysis, and reviews in your feeds.


More from Tom's Guide

TOPICS
Amanda Caswell
AI Editor

Amanda Caswell is one of today’s leading voices in AI and technology. A celebrated contributor to various news outlets, her sharp insights and relatable storytelling have earned her a loyal readership. Amanda’s work has been recognized with prestigious honors, including outstanding contribution to media.

Known for her ability to bring clarity to even the most complex topics, Amanda seamlessly blends innovation and creativity, inspiring readers to embrace the power of AI and emerging technologies. As a certified prompt engineer, she continues to push the boundaries of how humans and AI can work together.

Beyond her journalism career, Amanda is a long-distance runner and mom of three. She lives in New Jersey.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.