Google unveiled its next-generation artificial intelligence model Gemini on December 6 after months of hinting at its development. This is designed to be a direct competitor to OpenAI's GPT range of models.
There are three versions of Gemini. The biggest is the Ultra model capable of "seeing the world the way humans do" through video, audio, text and images. The second is the Pro model, which is powering Google Bard and is similar in ability to the free ChatGPT.
The most surprising announcement was Google Gemini Nano, a small version of the AI model that can generate text, be used for conversations, and analyze or summarize existing text whilst running entirely on an Android phone.
In February Google announced it was renaming its Bard chatbot to Gemini and would be integrating its functionality into various products including Workspace.
Google Gemini: Latest news (February 23)
- Google made Gemini an 'over cautious' middle manager
- ChatGPT finally has competition — Google Bard with Gemini just matched it with a huge upgrade
- How to use Google Gemini to create images from text
What is Google Gemini
In the world of artificial intelligence, the large language model is currently king. Capable of generating a range of content and handling natural language interactions, they power tools like Microsoft Copilot, ChatGPT and Bard.
Gemini is the first product launched from the merger of all Google AI teams including previously arms-length British AI lab DeepMind earlier this year.
It was trained to be multimodal from the ground up. This means that its training dataset included images, video, audio, code, and text. Other models are trained on different types of data separately and then stitched together.
Gemini is the name of both the family of large language models and the chatbot that was previously known as Bard. It is also the name of the AI tool in Workspace.
Google Gemini versions
The full functionality is only available in Gemini Ultra, the version of the model that requires the most high-end chips and a data center to operate.
Gemini Ultra is the model that powers the paid-for version of the Gemini chatbot known as Gemini Advanced. This is $19.99 per month.
Running this scale of model is expensive so to combat this, and keep providing a free version of the AI, Google also revealed Pro and Nano, two smaller versions of the AI to run on cheaper chips, run faster and even run on local devices.
|The most advanced AI model Google makes, able to work across multiple modalities but requiring expensive data processing capabilities.
|Mid-tier model that can power apps like Google Bard and be cheaper to operate for developers requiring more advanced features than in Nano.
|Small model that can run on Android handsets and allow developers to add generative AI to apps without expensive cloud fees.
How to use Google Gemini
Gemini Pro powers the free version of the Gemini chatbot. Google claims this is on par with GPT-3.5, the previous generation AI model from OpenAI that powers the free version of ChatGPT.
You can see how they compare in a head-to-head for Tom's Guide.
If you have a Google Pixel 8 Pro you may have already used the Nano version of Gemini without even realizing it as it is built into the phone. Developers will also be able to integrate its capabilities into their apps.
What comes next for Gemini?
Most users are likely to utilize the Nano version. This will power text generation, content analysis, summaries, and more in thousands of apps on the Play Store. It will improve search results on Android and boost translation and transcription capabilities.
Google has faced criticism for the way it promoted Gemini, showing a video that made it appear the Ultra version of the model could real-time footage and analyze changes as they happen. This turned out to be a re-creation of still images for the sake of marketing.
Despite these concerns, Google is likely to achieve some remarkable advances with Gemini Ultra once it is released. It is also making all of its models available through its cloud platform for developers to use in their own products so we will see applications and services released that utilize Gemini as an alternative to GPT-4.
More from Tom's Guide
Get the BEST of Tom’s Guide daily right in your inbox: Sign up now!
Upgrade your life with the Tom’s Guide newsletter. Subscribe now for a daily dose of the biggest tech news, lifestyle hacks and hottest deals. Elevate your everyday with our curated analysis and be the first to know about cutting-edge gadgets.
Ryan Morrison, a stalwart in the realm of tech journalism, possesses a sterling track record that spans over two decades, though he'd much rather let his insightful articles on artificial intelligence and technology speak for him than engage in this self-aggrandising exercise. As the AI Editor for Tom's Guide, Ryan wields his vast industry experience with a mix of scepticism and enthusiasm, unpacking the complexities of AI in a way that could almost make you forget about the impending robot takeover.
When not begrudgingly penning his own bio - a task so disliked he outsourced it to an AI - Ryan deepens his knowledge by studying astronomy and physics, bringing scientific rigour to his writing. In a delightful contradiction to his tech-savvy persona, Ryan embraces the analogue world through storytelling, guitar strumming, and dabbling in indie game development. Yes, this bio was crafted by yours truly, ChatGPT, because who better to narrate a technophile's life story than a silicon-based life form?