Grok gets eyes — X-based chatbot can now analyze images

Grok 2
(Image credit: Shutterstock)

Elon Musk's artificial intelligence company, xAI, has unveiled a major new update to its AI assistant called Grok. The latest iteration now incorporates vision capabilities, enabling Grok to analyze and comprehend images, alongside its existing text functionalities.

Grok can already generate images using the Flux model from Black Forest Labs and it was the last of the major AI chat products not to include image analysis, also known as AI vision.

With the introduction of this vision feature, Grok can analyze images linked to posts on the X platform, interpret visual content such as documents, diagrams, and photographs and understand spatial relationships within images to help better describe the contents.

You could use this to come up with recipe ideas based on a photo of ingredients, identify the location of a landmark inside a photo shared on X or even explain the results of a graph. The last part could be particularly useful on a news-heavy platform like Grok.

How vision works in Grok

Grok

(Image credit: xAI Grok)

Users will soon notice a new button on posts containing images on the X platform. When clicked it sends the image to Grok, allowing users to pose questions or request analyses of the visual content. It could also be used to help with describing images for people with sight issues.

We haven’t seen official benchmarks yet but according to xAI Grok's vision capabilities hold their own against established models from OpenAI, Google and Anthropic. To this end, the company has introduced a new benchmark, RealWorldQA, designed to evaluate the model’s proficiency in understanding and reasoning about the physical world through images.

The announcement led to varied reactions from the AI community and users with some enthusiastic about how fast Grok is advancing, while others remained cautious, questioning its performance against established AI models.

What comes next for Grok

Elon Musk-owned xAI has a 200,000 GPU data center built for the sole purpose of training future versions of Grok. I think it's safe to say we’re going to see big things from the model in the future.

Specifically related to vision capabilities, these could find their way into robots. Musk owns Tesla, which also has its own robotics division. In the future, we may also see video and voice analysis from Grok as these are features already in place with Gemini and ChatGPT.

While this update marks a notable advancement for Grok, it's clear that the model is still in development compared to more mature AI models like Gemini or ChatGPT. As with all rapidly evolving AI technologies, we'll need to monitor both the upgraded capabilities and the ethical considerations of these developments in the months ahead.

More from Tom's Guide

Category
Arrow
Arrow
Back to MacBook Air
Brand
Arrow
Processor
Arrow
RAM
Arrow
Storage Size
Arrow
Screen Size
Arrow
Colour
Arrow
Storage Type
Arrow
Condition
Arrow
Price
Arrow
Any Price
Showing 10 of 70 deals
Filters
Arrow
Show more
Ritoban Mukherjee

Ritoban Mukherjee is a freelance journalist from West Bengal, India whose work on cloud storage, web hosting, and a range of other topics has been published on Tom's Guide, TechRadar, Creative Bloq, IT Pro, Gizmodo, Medium, and Mental Floss.

With contributions from
Read more
Grok logo on a phone handset on a keyboard
What is Grok? — everything you need to know about xAI's chatbot
Grok logo on a phone handset on a keyboard
Grok 3 AI model unveiled with '10x more power' than Grok 2 — what you need to know
Grok
Grok is coming to iPhone — Elon's X-based chatbot is getting its own app
Grok Aurora
Grok Aurora image generator is live — here’s how to use it
Grok
xAI's standalone Grok iOS app launches in the US — here's how to find it
xAI
xAI launches and then pulls Aurora image generator in Grok — here’s what happened
Latest in AI
Sam Altman
ChatGPT-4.5 delayed in surprise announcement — and it could launch with a controversial new payment model
AI Mode of google search
Google launches 'AI Mode' for search — here's how to try it now
Project Astra AI agent
Project Astra — everything you need to know about Google's next-gen smart glasses and new AI assistant
The new Gemini app home page vs the old
Forget ChatGPT — Google Gemini can now see the world with live video and screen-sharing
iOS 18 logo on iPhone in person's lap
iOS 18.5 is coming soon with huge Siri upgrades — here’s everything to expect
The DeepSeek logo seen on the silhouette of a smartphone
I have ChatGPT Plus — but here's 7 reasons why I use DeepSeek instead
Latest in News
NYTimes Connections
NYT Connections today hints and answers — Thursday, March 6 (#634)
Galaxy Z Fold 6 shown in hand
Samsung just killed the crease with this breakthrough foldable phone display
Sam Altman
ChatGPT-4.5 delayed in surprise announcement — and it could launch with a controversial new payment model
Green skull on smartphone screen.
Over one million Android devices infected with password-stealing, pre-installed botnet malware — how to stay safe
Switch 2 console and logo
Nintendo Switch 2 — analyst just tipped release window
Apple tvOS 18 new features
New tvOS 18 code hints at Apple's much rumored smart home hub