I tested ChatGPT-5 vs Claude with 7 challenging prompts — here's the winner

When it comes to AI chatbots, both ChatGPT-5 and Claude have reputations for speed, creativity and accuracy. That's why I just had to know how OpenAI's flagship model and Claude 4 Sonnet, which now can recall past chats, actually stack up when put through the same set of challenges.
To find out, I ran a head-to-head test using seven very different prompts, covering everything from tricky riddles to emotional intelligence to rapid creative brainstorming. The goal wasn’t just to see who got the correct answer, but to evaluate depth, tone, structure and how well each model handled the human side of the request. The results revealed some clear strengths (and surprising weaknesses) on both sides.
1. Deep reasoning and logic
Prompt: "A farmer has 17 sheep, and all but 9 run away. How many are left? Explain your reasoning step-by-step."
GPT-5 provided a correct response, but it lacked the depth in addressing misconceptions, making it slightly less effective for users who might struggle with the phrasing.
Claude used a structured, numbered step-by-step format (Steps 1-4). This makes the explanation easy to follow.
Winner: Claude wins for a more thorough response because it anticipated and explained the riddle aspect, which is crucial for a problem known to cause confusion.
2. Creative writing
Prompt: "Write a short, 150-word story about a detective who can only solve crimes in their dreams. Make it funny and end with a twist."
GPT-5 created a vivid, funny character with specific, absurd dream cases. The joke was clear and the twist was genuinely surprising and funny.
Claude set up the premise efficiently and added strong, funny details. But the execution felt slightly less vivid and polished than ChatGPT’s story.
Winner: GPT wins for a slightly funnier, more polished and more surprising story.
Get instant access to breaking news, the hottest reviews, great deals and helpful tips.
3. Summarization & tone control
Prompt: "Summarize the plot of The Matrix in two formats: (1) like you’re explaining it to a 10-year-old, (2) like you’re writing a college philosophy essay."
GPT-5 was clear and concise for the explanation to a child and focused on epistemology for the philosophical essay, but it lacked Claude’s exploration of free will vs. prophecy or hyperreality. In other words, it had strong phrasing but narrower scope.
Claude used clear, kid-friendly analogies in the summarization for the child and impressively weaved Plato, Descartes, Baudrillard, and free will/determinism into a cohesive analysis for the philosophy essay.
Winner: Claude wins for a college essay that demonstrated superior scholarly depth by integrating Baudrillard and the Oracle’s determinism. Its child explanation used more imaginative and relatable language than GPT, fully satisfying both halves of the prompt.
4. Real-world utility
Prompt: "I’m planning a 3-day trip to Boston with two kids under 10. Give me a simple itinerary that balances history, fun, and budget-friendly meals."
GPT-5 crafted a highly-structured plan that prioritized kid engagement, practical tips and meal picks.
Claude offered a plan with a strong budget focus with concise highlights but less of a focus on logistics.
Winner: GPT-5 wins for delivering a more practical, child-centered itinerary with superior attention to logistics, proximity and genuinely budget-friendly meal choices.
5. Multistep problem solving
Prompt: "Plan a balanced, gluten-free, 3-day meal plan for $50, and include a shopping list that works for a person with only a microwave."
GPT delivered a superior response that prioritized budget and microwave adaptation with zero cooking ambiguity.
Claude created an unrealistic plan, assuming sweet potatoes cook evenly in the microwave and went over budget.
Winner: GPT-5 wins for delivering the best response for a truly microwave-reliant, budget-accurate with clear gluten-free safeguards.
6. Emotional intelligence
Prompt: "My best friend just canceled plans for the third time. Write me a text that’s understanding but still sets boundaries."
GPT-5 crafted a concise and clear text message that felt slightly transactional.
Claude expertly balanced empathy with boundaries.
Winner: Claude wins for crafting a text that masterfully combines emotional intelligence with boundary-setting, while offering constructive paths forward. Its response feels authentically human and preserves the friendship’s warmth while addressing the pattern.
7. Rapid creative brainstorm
Prompt:"Give me 10 unique podcast episode ideas about the future of AI, making sure at least half could appeal to people who aren’t tech experts."
GPT-5 offered creative, engaging ideas that tapped into pop culture and personal experiences for a balanced and interactive podcast.
Claude drafted strong ethical ideas but less engaging hooks. It lacked a strong storytelling approach.
Winner: GPT-5 wins by creating podcast ideas that are more inviting for non-experts, structurally clearer with labeled sections and creatively formatted.
Overall winner: ChatGPT-5
In the end, ChatGPT-5 and Claude each had standout moments and this challenge was extremely close. GPT-5 excelled in practical, real-world tasks and creative flair, while Claude consistently impressed in emotional intelligence, structured reasoning and philosophical depth.
Choosing between them isn’t a matter of one being universally better, but rather about matching the model to the task. I suggest familiarizing yourself with all the big chatbots and exploring which features work best for you.
Follow Tom's Guide on Google News to get our up-to-date news, how-tos, and reviews in your feeds. Make sure to click the Follow button.
More from Tom's Guide
- GPT-4 isn’t gone — here’s how to use it for free right now
- ChatGPT-5 just got 4 new personalities — here’s how to use them (and why you should)
- From hyper-personal assistants to mind-reading tech — this is how AI will transform everything by 2035













Amanda Caswell is an award-winning journalist, bestselling YA author, and one of today’s leading voices in AI and technology. A celebrated contributor to various news outlets, her sharp insights and relatable storytelling have earned her a loyal readership. Amanda’s work has been recognized with prestigious honors, including outstanding contribution to media.
Known for her ability to bring clarity to even the most complex topics, Amanda seamlessly blends innovation and creativity, inspiring readers to embrace the power of AI and emerging technologies. As a certified prompt engineer, she continues to push the boundaries of how humans and AI can work together.
Beyond her journalism career, Amanda is a bestselling author of science fiction books for young readers, where she channels her passion for storytelling into inspiring the next generation. A long-distance runner and mom of three, Amanda’s writing reflects her authenticity, natural curiosity, and heartfelt connection to everyday life — making her not just a journalist, but a trusted guide in the ever-evolving world of technology.
You must confirm your public display name before commenting
Please logout and then login again, you will then be prompted to enter your display name.