Google Gemini: Everything we know about the advanced AI model

Google Gemini logo
(Image credit: Google)

Google unveiled its next-generation artificial intelligence model Gemini on December 6 after months of hinting at its development. This is designed to be a direct competitor to OpenAI's GPT range of models.

There are three versions of Gemini. The biggest is the Ultra model capable of "seeing the world the way humans do" through video, audio, text and images. The second is the Pro model, which is powering Google Bard and is similar in ability to the free ChatGPT.

The most surprising announcement was Google Gemini Nano, a small version of the AI model that can generate text, be used for conversations, and analyze or summarize existing text whilst running entirely on an Android phone.

In February Google announced it was renaming its Bard chatbot to Gemini and would be integrating its functionality into various products including Workspace.

Google Gemini: Latest news (February 23)

What is Google Gemini

In the world of artificial intelligence, the large language model is currently king. Capable of generating a range of content and handling natural language interactions, they power tools like Microsoft Copilot, ChatGPT and Bard.

Gemini is the first product launched from the merger of all Google AI teams including previously arms-length British AI lab DeepMind earlier this year. 

It was trained to be multimodal from the ground up. This means that its training dataset included images, video, audio, code, and text. Other models are trained on different types of data separately and then stitched together.

Gemini is the name of both the family of large language models and the chatbot that was previously known as Bard. It is also the name of the AI tool in Workspace.

Google Gemini versions

The full functionality is only available in Gemini Ultra, the version of the model that requires the most high-end chips and a data center to operate. 

Gemini Ultra is the model that powers the paid-for version of the Gemini chatbot known as Gemini Advanced. This is $19.99 per month.

Running this scale of model is expensive so to combat this, and keep providing a free version of the AI, Google also revealed Pro and Nano, two smaller versions of the AI to run on cheaper chips, run faster and even run on local devices.

Swipe to scroll horizontally
Gemini Ultra vs Pro vs Nano
Model nameUse caseRelease data
Gemini UltraThe most advanced AI model Google makes, able to work across multiple modalities but requiring expensive data processing capabilities.Early 2024
Gemini ProMid-tier model that can power apps like Google Bard and be cheaper to operate for developers requiring more advanced features than in Nano.December 6
Gemini NanoSmall model that can run on Android handsets and allow developers to add generative AI to apps without expensive cloud fees.December 13

How to use Google Gemini

Google Gemini Advanced

(Image credit: Google)

Gemini Pro powers the free version of the Gemini chatbot. Google claims this is on par with GPT-3.5, the previous generation AI model from OpenAI that powers the free version of ChatGPT. 

You can see how they compare in a head-to-head for Tom's Guide.

If you have a Google Pixel 8 Pro you may have already used the Nano version of Gemini without even realizing it as it is built into the phone. Developers will also be able to integrate its capabilities into their apps.

What comes next for Gemini?

Most users are likely to utilize the Nano version. This will power text generation, content analysis, summaries, and more in thousands of apps on the Play Store. It will improve search results on Android and boost translation and transcription capabilities.

Google has faced criticism for the way it promoted Gemini, showing a video that made it appear the Ultra version of the model could real-time footage and analyze changes as they happen. This turned out to be a re-creation of still images for the sake of marketing.

Despite these concerns, Google is likely to achieve some remarkable advances with Gemini Ultra once it is released. It is also making all of its models available through its cloud platform for developers to use in their own products so we will see applications and services released that utilize Gemini as an alternative to GPT-4.


More from Tom's Guide

Ryan Morrison
AI Editor

Ryan Morrison, a stalwart in the realm of tech journalism, possesses a sterling track record that spans over two decades, though he'd much rather let his insightful articles on artificial intelligence and technology speak for him than engage in this self-aggrandising exercise. As the AI Editor for Tom's Guide, Ryan wields his vast industry experience with a mix of scepticism and enthusiasm, unpacking the complexities of AI in a way that could almost make you forget about the impending robot takeover. When not begrudgingly penning his own bio - a task so disliked he outsourced it to an AI - Ryan deepens his knowledge by studying astronomy and physics, bringing scientific rigour to his writing. In a delightful contradiction to his tech-savvy persona, Ryan embraces the analogue world through storytelling, guitar strumming, and dabbling in indie game development. Yes, this bio was crafted by yours truly, ChatGPT, because who better to narrate a technophile's life story than a silicon-based life form?