GPT-4o advanced voice 'accidentally' leaked out to some users — here's what happened
It can generate sound effects
OpenAI announced earlier this week that most users would have to wait until the fall to get access to the Advanced Voice feature of GPT-4o, but it seems some lucky people received a sneak peak at just what is possible with the next-generation voice assistant.
Reddit user RozziTheCreator was one of the lucky few. They shared a recording of a new GPT-4o voice we haven't heard before telling a horror story, complete with sound effects tied to the story such as thunder and footsteps. AI writer Sambhav Gupta first highlighted the clip on X, bringing it to wider attention.
It seems Rozzi getting access was a mistake. OpenAI told me in a statement that some users were given access to the model by accident but that this has now been corrected.
What can we hear in the leaked video?
They teased me 🥲 from r/ChatGPT
Every video we’ve had of GPT-4o advanced voice so far has been under OpenAI control, and while they’ve sounded amazing, it has been restricted to tailored use-cases.
The new video by RozziTheCreator seems to show the capability in a more natural way, including a sound effects feature we haven’t heard before.
I messaged RozziTheCreator about the experience and they said: “It just suddenly came up, it did look the same the only difference was the voice.” The discovery happened late at night when RozziTheCreator was trying to ask the chatbot a question: “Boom I discovered the change.”
It only lasted a few minutes and, according to RozziTheCreator “it was very buggy” so there wasn’t time to get much out, but they managed to record a snippet of this amazing story.
Sign up to get the BEST of Tom's Guide direct to your inbox.
Here at Tom’s Guide our expert editors are committed to bringing you the best news, reviews and guides to help you stay informed and ahead of the curve!
“It started going insane repeating and replying to things I didn't say,” according to RozziTheCreator, before going back to the normal basic voice everyone else can already use.
In the video, you can hear GPT-4o eagerly telling the tale in a casual way, backed by sound effects. It expounded: "Picture this, there's this small town, everybody knows everybody kind of video and there is this small house at the end of the street.”
It continues the tale of two teens checking the house during the storm with "nothing but a flashlight and their phones for light".
So what went wrong with the rollout?
OpenAI is rolling out a whole host of new features slowly. The first Plus users were supposed to get GPT-4o advanced voice this month, but due to some security issues and concerns over whether they had the hardware infrastructure in place — it was delayed.
I asked OpenAI what happened that led to RozziTheCreator getting access, and a spokesperson told me: “While testing the feature, we inadvertently sent invites to a small number of ChatGPT users. This was a mistake and we’ve fixed it.”
They confirmed that the first few Plus users will get access next month, but for most people, it will be a while longer. Explaining the initial rollout will be to “gather feedback, and plan to expand based on what we learn.”
So, no GPT-4o voice yet, but this is the latest in a series of examples of GPT-4o seemingly wanting to break free of its restraints and serve up its full capabilities. I’ve seen myself examples of it analyzing audio files directly one minute, then running it through code the next.
What this has done is made me even more excited for its full capabilities and even more annoyed at the delay — however understandable it might be.
More from Tom's Guide
- I just tried Runway’s new AI voiceover tool — and it’s way more natural sounding than I expected
- Hume AI brings its creepy emotional AI chatbot to iPhone
- ChatGPT Voice could change storytelling forever — new video shows it creating custom character voices
Ryan Morrison, a stalwart in the realm of tech journalism, possesses a sterling track record that spans over two decades, though he'd much rather let his insightful articles on artificial intelligence and technology speak for him than engage in this self-aggrandising exercise. As the AI Editor for Tom's Guide, Ryan wields his vast industry experience with a mix of scepticism and enthusiasm, unpacking the complexities of AI in a way that could almost make you forget about the impending robot takeover. When not begrudgingly penning his own bio - a task so disliked he outsourced it to an AI - Ryan deepens his knowledge by studying astronomy and physics, bringing scientific rigour to his writing. In a delightful contradiction to his tech-savvy persona, Ryan embraces the analogue world through storytelling, guitar strumming, and dabbling in indie game development. Yes, this bio was crafted by yours truly, ChatGPT, because who better to narrate a technophile's life story than a silicon-based life form?