I gave ChatGPT and Gemini the same image prompts — one completely crushed the other

logos of ChatGPT and Gemini — (Image credit: Future)

Jump To:

1. Hyper-realistic scene
2. Abstract concept visualization
3. Complex text integration
4. Mixed-style fusion
5. Technical precision
6. Emotion-driven art
7. Pop culture mashup
Bonus round: Fix the flaws

AI image generators are getting better by the week. But not all bots are created equal.

I pitted Gemini vs ChatGPT across seven wildly different image-generation prompts to see how each handled realism, abstraction, text integration and emotional storytelling.

From futuristic cityscapes to steampunk owls and violin water-sculptures, each prompt was designed to push the models’ creative and technical limits.

1. Hyper-realistic scene

ChatGPT vs Gemini AI images of future worlds — (Image credit: Future)

Prompt: “Generate a photorealistic image of a futuristic Tokyo street at night in 2070, with neon holograms, flying cars and a rainy pavement reflecting colorful lights. Include intricate details like a glowing vending machine selling robot pets.”

ChatGPT explicitly included “ROBOT PETS” in the image, directly addressing the critical detail of the glowing vending machine specified in the prompt. This suggests ChatGPT prioritized the requested intricate element.

Gemini offered a very similar image including holograms and flying cars, however, they are less central to the prompt’s core requirements.

Winner: ChatGPT wins for adherence to instructions, including the glowing vending machine selling robot pets, a key element explicitly requested in the prompt.

2. Abstract concept visualization

AI images of a guitar from ChatGPT and Gemini — (Image credit: Future)

Prompt: “Create an image representing ‘the sound of a violin made entirely of water.’ Use surreal shapes, fluid textures and dynamic motion.”

ChatGPT emphasized fluid motion and abstract water tendrils, which produced a very surreal and artistic image. The entire violin is formed from flowing water, capturing the concept metaphorically.

Gemini rendered a highly realistic, 3D image with clear texture and detailed reflections. The bow and strings are intact, making it functionally believable. This image is less surreal as it’s more “a violin made of water” than “the sound of one.” Emotion and movement are present but subtler.

Winner: ChatGPT wins for its artistic interpretation of an abstract concept. The chatbot arguably came closer to fulfilling the emotional and imaginative request in the prompt.

3. Complex text integration

AI generated movie posters from ChatGPT and Gemini — (Image credit: Future)

Prompt: “Design a vintage movie poster titled ‘Galactic Samurai’ with Japanese calligraphy, a cyborg warrior, and a glowing katana. Include small English text at the bottom: ‘In theaters 2025.’”

ChatGPT nailed the old-school, grindhouse-style poster vibe with paper texture and muted tones. The readable Japanese calligraphy of “Galactic” was a nice touch. The chatbot delivered a clean design, symmetrical structure and excellent vintage balance.

Gemini clearly rendered and matched the “galactic” aesthetic with electric blue accents to deliver a striking composition. The cyborg samurai is front and center with great posture and intense lighting. The image feels like a modern action poster with a retro sci-fi influence.

Winner: ChatGPT crafted a classic poster design with vintage vibes and legible Japanese.

4. Mixed-style fusion

AI generated images of steampunk owls — (Image credit: Future)

Prompt: “Generate an image of a steampunk owl with mechanical gears, Victorian-era brass details, and glowing neon eyes. Combine realism with cartoonish proportions.”

ChatGPT crafted an image of a fully built mechanical owl with expert detailing of the brass gears and paneling. Intricate and consistent, the steampunk theme is clear.

Gemini delivered an impressively rendered steampunk owl with strong aesthetic polish. It’s detailed, charismatic and technically sound, though slightly more on the realistic side than cartoonish.

Winner: ChatGPT wins for nailing the Victorian brass aesthetic with ornate latticework, mechanical symmetry and visible gearwork.

5. Technical precision

AI generated images by ChatGPT and Gemini — (Image credit: Future)

Prompt: “Draw a cross-sectional diagram of a sci-fi spaceship engine with labeled parts: plasma core, cooling vents, gravity stabilizers. Use a technical illustration style.”

ChatGPT drafted a blueprint that feels like an actual engineering manual or vintage NASA schematic. The labeling is correct and the technical cross-section was believable.

Gemini delivered on style, structure and sci-fi feel. However, the chatbot loses heavily on execution due to multiple spelling errors and missed labeling. Those slip-ups are hard to overlook, especially in this circumstance.

Winner: ChatGPT wins for the most realistic image with technical details and accurate spelling.

6. Emotion-driven art

AI generated images of playgrounds by chatGPT and gemini — (Image credit: Future)

Prompt: “Visualize ‘nostalgia’ as a landscape. Include symbolic elements like an abandoned playground, fading polaroid photos floating in the air and a sunset with muted colors.”

ChatGPT featured a very worn, rusted swing set and slide. The overgrowth and empty stillness imply deep abandonment and the passage of time.

Gemini created an image that feels more like a recent past than a long-forgotten memory. The playground is still intact and functional, though unused.

Winner: Tie. If we’re going for emotion-as-art, ChatGPT’s version wins. But, if we’re leaning toward nostalgia-as-memory, Gemini’s version is the better choice.

7. Pop culture mashup

Prompt: “Create a scene where a Pixar-style raccoon chef is cooking ramen in a Studio Ghibli-inspired enchanted forest. Include a friendly fire spirit as a kitchen assistant.”

ChatGPT leaned heavily into a hand-painted, Pixar-meets-Ghibli aesthetic. The raccoon has large expressive eyes and rounded features, clearly Pixar-style. The forest has soft brush strokes and rich natural hues reminiscent of Princess Mononoke or My Neighbor Totoro.

Gemini used a 3D-rendered, almost claymation look. The lighting and character design lean more toward a whimsical, soft toy-inspired style. It’s still “Pixar-like,” but less painterly and more diorama-esque.

Winner: ChatGPT wins for a story-rich, expressive, animated-style art.

Bonus round: Fix the flaws

Updated_improved image from ChatGPT and Gemini with added shadows — (Image credit: Future)

Prompt: “Improve the previous image by adding more realistic shadows and fixing distorted proportions.”

ChatGPT improved the image, giving the raccoon well-balanced proportions, rounded face, expressive eyes and limbs that match its Pixar-like body. Its posture while cooking is fluid and believable.

Gemini improved proportions, making them much better than before. The raccoon no longer looks stiff, but it still leans toward a more toy-like design. The arms and hands remain a bit chunky, giving it a plush look.

Winner: ChatGPT still wins for strong ambient lighting with directional shadows under the table, bowl, and raccoon. The flame glows warmly, casting light upward. Natural light filters in from behind.

Overall winner: ChatGPT

While both Gemini and ChatGPT have made major strides in AI image generation, ChatGPT consistently delivered more accurate, emotionally resonant and stylistically cohesive results across a wide range of prompts.

From capturing abstract concepts to integrating text and improving technical flaws, it proved to be the more reliable creative partner. Gemini showed potential, particularly in polish and atmosphere, but often missed key details or leaned too far into realism at the expense of imagination.

If you're looking for an AI that blends instruction-following with artistic interpretation, ChatGPT still leads the pack.

More from Tom's Guide

Apple

Asus

Dell

Lenovo

Intel Core i3

Intel Core i5

Intel Core i7

8GB RAM

16GB RAM

24GB RAM

32GB

64GB

128GB

256GB

512GB

1TB

2TB

4TB

13.3-inch

13.4-inch

14-inch

15-inch

Black

Blue

Gold

Grey

Silver

New

Refurbished

EMMC

SSD

Showing 10 of 143 deals

Filters☰

Apple 13" MacBook Air M4 (2025)

(256GB SSD)

$899

View Deal

Apple 15" MacBook Air M4 (2025)

(15-inch 256GB)

$1,095

$919

View Deal

Dell XPS 13 Rose Gold

(13.3-inch 128GB)

$1,334.99

$278

View Deal

Lenovo Yoga Slim 7x (Gen 9)

(512GB OLED)

$1,075.79

$858.11

View Deal

Lenovo IdeaPad Flex 5i ChromeBook Plus

(14-inch 128GB)

$499

View Deal

Asus ROG Zephyrus G14 (2024)

Our Review

☆☆☆☆☆

$2,199.99

View Deal

Apple 13" MacBook Air M4 (2025)

(256GB Blue)

$999

$899

View Deal

Apple 15" MacBook Air M4 (2025)

(15-inch 512GB)

(13.3-inch 128GB)

Our Review

☆☆☆☆☆

$675

View Deal

Lenovo Yoga Slim 7x (Gen 9)

(Blue)

$1,439.99

$1,159.99

View Deal

Amanda Caswell is the AI Editor at Tom's Guide and one of today’s leading voices in AI and technology.

A celebrated contributor to various news outlets, her sharp insights and relatable storytelling have earned her a loyal readership. Amanda’s work has been recognized with prestigious honors, including outstanding contribution to media.

Known for her ability to bring clarity to even the most complex topics, Amanda seamlessly blends innovation and creativity, inspiring readers to embrace the power of AI and emerging technologies.

As a certified prompt engineer, she continues to push the boundaries of how humans and AI can work together.

Beyond her journalism career, Amanda is a long-distance runner and mom of three. She lives in New Jersey.