Grok gets eyes — X-based chatbot can now analyze images

Grok 2
(Image credit: Shutterstock)

Elon Musk's artificial intelligence company, xAI, has unveiled a major new update to its AI assistant called Grok. The latest iteration now incorporates vision capabilities, enabling Grok to analyze and comprehend images, alongside its existing text functionalities.

Grok can already generate images using the Flux model from Black Forest Labs and it was the last of the major AI chat products not to include image analysis, also known as AI vision.

With the introduction of this vision feature, Grok can analyze images linked to posts on the X platform, interpret visual content such as documents, diagrams, and photographs and understand spatial relationships within images to help better describe the contents.

You could use this to come up with recipe ideas based on a photo of ingredients, identify the location of a landmark inside a photo shared on X or even explain the results of a graph. The last part could be particularly useful on a news-heavy platform like Grok.

How vision works in Grok

Grok

(Image credit: xAI Grok)

Users will soon notice a new button on posts containing images on the X platform. When clicked it sends the image to Grok, allowing users to pose questions or request analyses of the visual content. It could also be used to help with describing images for people with sight issues.

We haven’t seen official benchmarks yet but according to xAI Grok's vision capabilities hold their own against established models from OpenAI, Google and Anthropic. To this end, the company has introduced a new benchmark, RealWorldQA, designed to evaluate the model’s proficiency in understanding and reasoning about the physical world through images.

The announcement led to varied reactions from the AI community and users with some enthusiastic about how fast Grok is advancing, while others remained cautious, questioning its performance against established AI models.

What comes next for Grok

Elon Musk-owned xAI has a 200,000 GPU data center built for the sole purpose of training future versions of Grok. I think it's safe to say we’re going to see big things from the model in the future.

Specifically related to vision capabilities, these could find their way into robots. Musk owns Tesla, which also has its own robotics division. In the future, we may also see video and voice analysis from Grok as these are features already in place with Gemini and ChatGPT.

While this update marks a notable advancement for Grok, it's clear that the model is still in development compared to more mature AI models like Gemini or ChatGPT. As with all rapidly evolving AI technologies, we'll need to monitor both the upgraded capabilities and the ethical considerations of these developments in the months ahead.

More from Tom's Guide

Category
Arrow
Arrow
Back to MacBook Air
Brand
Arrow
Processor
Arrow
RAM
Arrow
Storage Size
Arrow
Screen Size
Arrow
Colour
Arrow
Storage Type
Arrow
Condition
Arrow
Price
Arrow
Any Price
Showing 10 of 47 deals
Filters
Arrow
Show more
Ritoban Mukherjee

Ritoban Mukherjee is a freelance journalist from West Bengal, India whose work on cloud storage, web hosting, and a range of other topics has been published on Tom's Guide, TechRadar, Creative Bloq, IT Pro, Gizmodo, Medium, and Mental Floss.

With contributions from
Read more
Grok logo on a phone handset on a keyboard
What is Grok? — everything you need to know about xAI's chatbot
Grok logo on a phone handset on a keyboard
Grok 3 AI model unveiled with '10x more power' than Grok 2 — what you need to know
Grok
Grok is coming to iPhone — Elon's X-based chatbot is getting its own app
Grok
xAI's standalone Grok iOS app launches in the US — here's how to find it
Grok
I just tested the new Grok-3 with 5 prompts — here’s what I like and don’t like about this chatbot
ChatGPT vs Grok
I put ChatGPT vs Grok to the test with 7 prompts — here's the winner
Latest in AI
Google Assistant logo on a smartphone screen
Google Assistant is losing features to make way for Gemini — here's what's just been axed
Siri
Siri 2.0 features reportedly only working ‘two-thirds to 80% of the time’
Apple Intelligence on an iPhone screen
Apple analysts sound alarm on Siri delay — here’s why
Manus vs. DeepSeek logos
I just tested Manus vs. DeepSeek with 7 prompts from Gemini — here's the winner
Shutterstock Sora image
5 must-try Sora prompts for creating incredible AI videos
AI Madness logo
AI Madness: The ultimate chatbot showdown
Latest in News
Galaxy S24
One UI 7 finally gets a stable release from Samsung — here's when it's coming to your phone
Google Assistant logo on a smartphone screen
Google Assistant is losing features to make way for Gemini — here's what's just been axed
3D printed model of alleged iPhone 17 Air design
iPhone 17 Air — these 5 big revelations have me excited for the first truly new iPhone in years
NYTimes Connections
NYT Connections today hints and answers — Tuesday, March 18 (#646)
Heath Ledger and Julia Stiles in 10 Things I Hate About You
My 7 favorite teen comedy movies from the ’90s when I need a dose of nostalgia
Zens Quattro Wireless Charging Pro 4 charging station with 3 iphones and an AirPods case
Double-decker 'AirPower' charger now available from Apple — here's what it costs