Love it or hate it, Grok 4 is crushing it — here’s how

(Image credit: VINCENT FEURAY / Getty Images)

When it comes to chatbots, it's easy to forget about Grok because it seems like other big tech is always in the news. With Google's Nano Banana starting new trends and OpenAI's ChatGPT hyping their latest models, Elon Musk's chatbot simply exists in the background.

I've definitely found myself rolling my eyes at some of Grok's decisions, especially when it comes to image generation. However, it's clear that there are some reasons to sit in awe of what Elon Musk calls “the smartest AI in the world.”

As someone who has spent hours testing it, the truth is, it's not just hype. From near-instant web searches to jaw-dropping results on complex engineering queries, Grok 4 is delivering in ways its predecessors and rivals haven’t quite managed. Whether you love the direction or cringe at the controversies, Grok 4 may always be the underdog that quietly crushes it.

What makes xAI's Grok different

I now think @xAI has a chance of reaching AGI with @Grok 5. Never thought that before. https://t.co/FaBUYegl3DSeptember 17, 2025

Elon Musk posted on X highlighting that Grok 4 is at the top of the ARC-AGI leaderboard. To understand why that's impressive, it's important to become familiar with how models are tracked on it.

Essentially, the ARC-AGI leaderboard is a scoreboard for AI, that not only tracks how many problems a model can solve, but also how efficiently it solves them. In other words, it's measuring both the brain and the resourcefulness of the model. High performance with low cost per task is what matters most.

So, Grok's position at the very top is extrememly significant because it means the xAI model is not only keeping up with rivals like Gemini and ChatGPT, but outpacing them on some of the toughest benchmark criteria possible.

Beating every other chatbot suggests that Grok 4 is powerful and efficient, which is exactly the type of breakthrough that supports true progress in the evolution of artifical general intelligence (AGI).

Where Grok still stumbles

Whether used on X or on the standalone platform, real-time search pulls in fresh infromation from both the web and X, so it can keep up with breaking news at a moment's notice.

However, the accuracy and bias concerns are what critics keep coming back to. Grok has made some claims that turned out false, and there are questions about how its alignment is being guided (e.g. how much Musk’s own views factor in).

The model also struggles with issues of content moderation after xAI scrambled to pull posts and update filters when anitsemitc content popped up.

The takeaway

Despite the model beating it's rivals, questions still remain like, will it stay reliable as usage increases? Will “garbage data” or bias creep back in under pressure? How well will xAI handle moderation long-term? The past controversies suggest it’s an ongoing battle.

There are no doubts that Grok is not perfect. It carries some extremely controversial baggage, but the proof of what it does better in terms of speed, real-time data and flexible thinking makes it a serious contender in the AI race.

More from Tom's Guide

Apple

Asus

Dell

Lenovo

AMD Ryzen

Intel Core i5

Intel Core i7

8GB RAM

16GB RAM

24GB RAM

32GB RAM

32GB

64GB

128GB

256GB

512GB

1TB

2TB

4TB

13.3-inch

13.4-inch

14-inch

15-inch

Black

Blue

Gold

Grey

Silver

New

Refurbished

Showing 10 of 172 deals

Filters☰

Apple 13" MacBook Air M4 (2025)

(256GB SSD)

$899

View

Apple 15" MacBook Air M4 (2025)

(15-inch 512GB)

$1,399

$1,054.95

View

Dell XPS 13 Rose Gold

(13.3-inch 128GB)

$1,334.99

$278

View

Lenovo Yoga Slim 7x (Gen 9)

(512GB OLED)

$1,075.79

$858.11

View

Lenovo Chromebook Plus 14

(Grey)

Our Review

☆☆☆☆☆

$639.99

$419.99

View

Asus ROG Zephyrus G14 (2025)

(14-inch 1TB)

Our Review

☆☆☆☆☆

$1,799.99

View

Apple 13" MacBook Air M4 (2025)

(256GB Blue)

$999

$899

View

Apple 15" MacBook Air M4 (2025)

(15-inch 512GB)

(13.3-inch 128GB)

Our Review

☆☆☆☆☆

$675

View

Lenovo Yoga Slim 7x (Gen 9)

(1TB Blue)

$1,099

View

TOPICS

Amanda Caswell is the AI Editor at Tom's Guide and one of today’s leading voices in AI and technology.

A celebrated contributor to various news outlets, her sharp insights and relatable storytelling have earned her a loyal readership. Amanda’s work has been recognized with prestigious honors, including outstanding contribution to media.

Known for her ability to bring clarity to even the most complex topics, Amanda seamlessly blends innovation and creativity, inspiring readers to embrace the power of AI and emerging technologies.

As a certified prompt engineer, she continues to push the boundaries of how humans and AI can work together.

Beyond her journalism career, Amanda is a long-distance runner and mom of three. She lives in New Jersey.