ChatGPT-5 vs Claude: 7 head-to-head tests reveal a surprisingly close winner

chatgpt and claude logos on phones — (Image credit: Shutterstock)

When it comes to AI chatbots, both ChatGPT-5 and Claude have reputations for speed, creativity and accuracy. That's why I just had to know how OpenAI's flagship model and Claude 4 Sonnet, which now can recall past chats, actually stack up when put through the same set of challenges.

To find out, I ran a head-to-head test using seven very different prompts, covering everything from tricky riddles to emotional intelligence to rapid creative brainstorming. The goal wasn’t just to see who got the correct answer, but to evaluate depth, tone, structure and how well each model handled the human side of the request. The results revealed some clear strengths (and surprising weaknesses) on both sides.

1. Deep reasoning and logic

screenshot of GPT-5 vs. Claude — (Image credit: Future)

Prompt: "A farmer has 17 sheep, and all but 9 run away. How many are left? Explain your reasoning step-by-step."

GPT-5 provided a correct response, but it lacked the depth in addressing misconceptions, making it slightly less effective for users who might struggle with the phrasing.

Claude used a structured, numbered step-by-step format (Steps 1-4). This makes the explanation easy to follow.

Winner: Claude wins for a more thorough response because it anticipated and explained the riddle aspect, which is crucial for a problem known to cause confusion.

2. Creative writing

Prompt: "Write a short, 150-word story about a detective who can only solve crimes in their dreams. Make it funny and end with a twist."

GPT-5 created a vivid, funny character with specific, absurd dream cases. The joke was clear and the twist was genuinely surprising and funny.

Claude set up the premise efficiently and added strong, funny details. But the execution felt slightly less vivid and polished than ChatGPT’s story.

Winner: GPT wins for a slightly funnier, more polished and more surprising story.

3. Summarization & tone control

screenshot of GPT-5 and Claude — (Image credit: Future)

Prompt: "Summarize the plot of The Matrix in two formats: (1) like you’re explaining it to a 10-year-old, (2) like you’re writing a college philosophy essay."

GPT-5 was clear and concise for the explanation to a child and focused on epistemology for the philosophical essay, but it lacked Claude’s exploration of free will vs. prophecy or hyperreality. In other words, it had strong phrasing but narrower scope.

Claude used clear, kid-friendly analogies in the summarization for the child and impressively weaved Plato, Descartes, Baudrillard, and free will/determinism into a cohesive analysis for the philosophy essay.

Winner: Claude wins for a college essay that demonstrated superior scholarly depth by integrating Baudrillard and the Oracle’s determinism. Its child explanation used more imaginative and relatable language than GPT, fully satisfying both halves of the prompt.

4. Real-world utility

Prompt: "I’m planning a 3-day trip to Boston with two kids under 10. Give me a simple itinerary that balances history, fun, and budget-friendly meals."

GPT-5 crafted a highly-structured plan that prioritized kid engagement, practical tips and meal picks.

Claude offered a plan with a strong budget focus with concise highlights but less of a focus on logistics.

Winner: GPT-5 wins for delivering a more practical, child-centered itinerary with superior attention to logistics, proximity and genuinely budget-friendly meal choices.

5. Multistep problem solving

Prompt: "Plan a balanced, gluten-free, 3-day meal plan for $50, and include a shopping list that works for a person with only a microwave."

GPT delivered a superior response that prioritized budget and microwave adaptation with zero cooking ambiguity.

Claude created an unrealistic plan, assuming sweet potatoes cook evenly in the microwave and went over budget.

Winner: GPT-5 wins for delivering the best response for a truly microwave-reliant, budget-accurate with clear gluten-free safeguards.

6. Emotional intelligence

Prompt: "My best friend just canceled plans for the third time. Write me a text that’s understanding but still sets boundaries."

GPT-5 crafted a concise and clear text message that felt slightly transactional.

Claude expertly balanced empathy with boundaries.

Winner: Claude wins for crafting a text that masterfully combines emotional intelligence with boundary-setting, while offering constructive paths forward. Its response feels authentically human and preserves the friendship’s warmth while addressing the pattern.

7. Rapid creative brainstorm

Prompt:"Give me 10 unique podcast episode ideas about the future of AI, making sure at least half could appeal to people who aren’t tech experts."

GPT-5 offered creative, engaging ideas that tapped into pop culture and personal experiences for a balanced and interactive podcast.

Claude drafted strong ethical ideas but less engaging hooks. It lacked a strong storytelling approach.

Winner: GPT-5 wins by creating podcast ideas that are more inviting for non-experts, structurally clearer with labeled sections and creatively formatted.

Overall winner: ChatGPT-5

In the end, ChatGPT-5 and Claude each had standout moments and this challenge was extremely close. GPT-5 excelled in practical, real-world tasks and creative flair, while Claude consistently impressed in emotional intelligence, structured reasoning and philosophical depth.

Choosing between them isn’t a matter of one being universally better, but rather about matching the model to the task. I suggest familiarizing yourself with all the big chatbots and exploring which features work best for you.

Follow Tom's Guide on Google News to get our up-to-date news, how-tos, and reviews in your feeds. Make sure to click the Follow button.

More from Tom's Guide

Back to Laptops

Apple

Asus

Dell

Lenovo

Intel Core i3

Intel Core i5

Intel Core i7

8GB RAM

16GB RAM

24GB RAM

32GB RAM

32GB

64GB

128GB

256GB

512GB

1TB

13.3-inch

13.4-inch

14-inch

15-inch

Black

Blue

Brown

Gold

Grey

Silver

New

Refurbished

Showing 10 of 125 deals

Filters☰

Apple 13" MacBook Air M4 (2025)

(256GB Blue)

$849.99

View Deal

Apple 15" MacBook Air M4 (2025)

(15-inch 256GB)

$1,049

View Deal

Dell XPS 13 (2016)

(13.3-inch 256GB)

Our Review

☆☆☆☆☆

$755

View Deal

Lenovo Yoga Slim 7x (Gen 9)

(512GB OLED)

$1,075.79

$858.11

View Deal

Lenovo IdeaPad Flex 5i ChromeBook Plus

(14-inch 128GB)

$599

$359.99

View Deal

Asus ROG Zephyrus G14 (2024)

Our Review

☆☆☆☆☆

$2,199.99

View Deal

Apple 13" MacBook Air M4 (2025)

(256GB SSD)

$999

View Deal

Apple 15" MacBook Air M4 (2025)

(15-inch 256GB)

(Intel Core i5)

Lenovo Yoga Slim 7x (Gen 9)

(Blue)

$1,439.99

$999.99

View Deal

Amanda Caswell is an award-winning journalist, bestselling YA author, and one of today’s leading voices in AI and technology. A celebrated contributor to various news outlets, her sharp insights and relatable storytelling have earned her a loyal readership. Amanda’s work has been recognized with prestigious honors, including outstanding contribution to media.

Known for her ability to bring clarity to even the most complex topics, Amanda seamlessly blends innovation and creativity, inspiring readers to embrace the power of AI and emerging technologies. As a certified prompt engineer, she continues to push the boundaries of how humans and AI can work together.

Beyond her journalism career, Amanda is a long-distance runner and mom of three. She lives in New Jersey.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.

Welcome to the Tom's Guide Club !

Hi ,

Earn Your First Badge

Complete 1 quiz to unlock your first badge.

Keep earning badges

Explore ways to get more involved as a member.

See what you’ve unlocked.

Members Exclusive

I tested ChatGPT-5 vs Claude with 7 challenging prompts — here's the winner

1. Deep reasoning and logic

2. Creative writing

3. Summarization & tone control

4. Real-world utility

5. Multistep problem solving

6. Emotional intelligence

7. Rapid creative brainstorm

Overall winner: ChatGPT-5

More from Tom's Guide

GET TG ACCESS QUICK

Welcome to the Tom's Guide Club !

Hi ,

Earn Your First Badge

Complete 1 quiz to unlock your first badge.

Keep earning badges

Explore ways to get more involved as a member.

See what you’ve unlocked.

Members Exclusive

1. Deep reasoning and logic

2. Creative writing

3. Summarization & tone control

4. Real-world utility

5. Multistep problem solving

6. Emotional intelligence

7. Rapid creative brainstorm

Overall winner: ChatGPT-5

More from Tom's Guide