ElevenLabs now let’s you create a custom voice from a text prompt — here’s why that’s exciting
Sights and sounds.
In just a few short months we've seen all kinds of generative AI possibilities emerge from text prompts. Whether it's music with lyrics, new video tools, or a combination of the two with music videos, generative AI is expanding swiftly.
Still, dialogue (outside of more robotic text-to-speech options and dedicated chatbots) can be difficult to achieve. ElevenLabs has one of the best synthetic voice models, including cloning real voices (with permission and licensing agreements in place). Its latest project takes things a step further and allows you to design a voice from scratch.
I think lending a voice to a character should still remain the area of expertise for trained voice actors, but that isn't always viable. ElevenLabs new Voice Design engine can create a voice from a text prompt in a matter of seconds, and it can then voice anything you want.
This helps when you need a very specific voice for a project, or, if you need to be able to change what is being said in real-time based on the flow of a game.
Why this is important
Introducing Voice Design.Generate a unique voice from a text prompt alone.Is our library missing a voice you need? Prompt your own. pic.twitter.com/ZR21fMb7q7October 23, 2024
If you've ever spent time playing Dungeons and Dragons, you'll know that the right voice can make all the difference when it comes to setting the scene.
The Voice Design tool should help players create their own deep backstories and lore, but it also democratizes the technology for indie developers who are working solo on ambitious new projects.
Tying it in with AI video generation tools like ElevenLabs own could mean film students are able to create a world, characters, and interactions all within the same project, all working by themselves.
Sign up now to get the best Black Friday deals!
Discover the hottest deals, best product picks and the latest tech news from our experts at Tom’s Guide.
ElevenLabs' examples show the more detailed prompt the better, with "an old British male with a rasper, deep voice. Professional, relaxed and assertive" a good example of the power on offer. I'm not sure anyone British calls someone "old bean" anymore, mind, so stereotypes abound.
More from Tom's Guide
A freelance writer from Essex, UK, Lloyd Coombes began writing for Tom's Guide in 2024 having worked on TechRadar, iMore, Live Science and more. A specialist in consumer tech, Lloyd is particularly knowledgeable on Apple products ever since he got his first iPod Mini. Aside from writing about the latest gadgets for Future, he's also a blogger and the Editor in Chief of GGRecon.com. On the rare occasion he’s not writing, you’ll find him spending time with his son, or working hard at the gym. You can find him on Twitter @lloydcoombes.
- Ryan MorrisonAI Editor