Meet Alter3 — the creepy new humanoid robot powered by OpenAI GPT-4

The robot responds to voice commands
(Image credit: University of Tokyo)

In a move that is equal parts creepy and impressive, a team of researchers connected a humanoid robot to GPT-4, the artificial intelligence model that powers ChatGPT.

In the year since the launch of ChatGPT by OpenAI, the underlying technology has been used to generate everything from corny jokes to novels. It has been upgraded to understand video, audio, images, and now — movement.

Alter3 is a humanoid robot built by a team from the University of Tokyo. With the help of GPT-4 it can now adopt a range of poses including a selfie stance or pretending to be a ghost and does so without those poses having first to be programmed into its database.

Why is this an important development?

See more

Previously low-level robot control, such as movement, has fallen outside the scope of large language models as it is hardware-dependent and not covered in training data.

To overcome this, the Japanese team found a way to translate the expression of human movement into code the android could understand.

This allowed Alter3 to generate spontaneous motion. It won't carry your bags or run a race anytime soon as its motion is restricted to its upper body. Its lower body is fixed to a stand.

How does Alter3 work?

The robot can create a range of movements on the fly

(Image credit: University of Tokyo)

This means that the robot can generate sequences of actions over time without developers having to program for each body part. It can learn to move, much like a toddler would when first moving from shuffling, to crawling and on to walking.

Users can also adjust the pose or help the android understand the difference between a dab and an infinite dab. Part of me wonders whether the researchers just wanted a way to test new Fortnite emotes outside of the game.

A human talking to Alter3 can say something like "take a selfie with your iPhone" and the robot will run a series of calls to OpenAI's GPT-4, including asking the AI what is involved in taking a selfie. GPT-4 will then turn that into Python code that the robot can understand and use to make the necessary motions.

What happens next?

Alter3 can respond to human commands

(Image credit: University of Tokyo)

The team behind Alter3 and the new GPT-4-based text-to-movement discovery plan to make their findings available for use in other humanoid robots. This, they say, is a significant advancement in robot development.

“Remarkably, Alter3 can mimic ghosts, animals, and human-like expressions of emotions. Its response to conversational content through facial expressions and gestures represents a significant advancement in humanoid robotics, easily adaptable to other androids with minimal modifications,” they wrote in the research paper."

While a robot doing a dab or pretending to be a ghost might not be much of a worry for the future of humanity today, it only takes one to go rogue and we’ve got a Terminator on our hands.

More from Tom's Guide

Ryan Morrison
AI Editor

Ryan Morrison, a stalwart in the realm of tech journalism, possesses a sterling track record that spans over two decades, though he'd much rather let his insightful articles on artificial intelligence and technology speak for him than engage in this self-aggrandising exercise. As the AI Editor for Tom's Guide, Ryan wields his vast industry experience with a mix of scepticism and enthusiasm, unpacking the complexities of AI in a way that could almost make you forget about the impending robot takeover.
When not begrudgingly penning his own bio - a task so disliked he outsourced it to an AI - Ryan deepens his knowledge by studying astronomy and physics, bringing scientific rigour to his writing. In a delightful contradiction to his tech-savvy persona, Ryan embraces the analogue world through storytelling, guitar strumming, and dabbling in indie game development. Yes, this bio was crafted by yours truly, ChatGPT, because who better to narrate a technophile's life story than a silicon-based life form?