Meta's MusicGen AI makes high-quality music clips from text — here's how to try it

A robot playing music
(Image credit: Shutterstock)

Meta may not have a ChatGPT clone available for the public yet, but it did just announce a new AI tool you can try today. MusicGen is a new, open-source AI tool that allows you to combine text prompts and clips of existing music to create entirely new sounds.

Meta announced the news via tweet and showed off a couple of examples of MusicGen in action. First, they took a Bach organ melody and gave it the prompt “An 80s driving pop song with heavy drums and synth pads in the background.” 

And sure enough — it works. The AI tool spits out an entirely new clip that sounded an awful lot like an 80s synth-pop track. Ditto for when they took Boléro and transformed it into “An energetic hip-hop music piece, with synth sounds and strong bass. There is a rhythmic hi-hat pattern in the drums.” The AI tool doesn’t deviate far from the melody, but it produces new clips just with text and audio context.

So after seeing it in action, I decided to put it to the test.

Meta MusicGen AI: What it's like to use 

First, I thought I’d play it safe and recreate the first example Meta provided in its tweet. Thankfully, that was easy since the MusicGen Hugging Face page — where you can try the AI tool for free — provides you with a couple of examples you can test out. And the first one is that same Bach melody turned into a synth-pop track from the Meta demo.

Unfortunately, that Meta demo left out a crucial detail. If you use MusicGen, you’re going to be waiting for a while. Just generating a 12-second clip took me several minutes.

And if you think it’ll be easier just to set up the tool on your own machine and run it locally — think again. The AI tool requires a GPU with at least 16GB of memory, which means your laptop probably won’t cut it.

One final note: MusicGen works best if you provide it with a melody to work off of. I tested it on text only, asking it to produce a clip of “80s hardcore punk in the style of Bad Religion” and the results were mixed. Tonally, it sounded like punk, but definitely didn’t sound like Bad Religion and wasn’t fast-paced enough. In general, I found that MusicGen struggles a bit with tempo unless you give it specific beats per minute to follow.

How to use Meta MusicGen AI

Want to try out MusicGen for yourself? Here’s how to check it out.

Meta MusicGen AI text-to-audio tool

(Image credit: Future)

First, go to Meta’s Hugging Face page for MusicGen. This allows you to try the AI tool as a free demo that will create 12-second clips of AI-generated music.

Meta MusicGen AI text-to-audio tool

(Image credit: Future)

Next, enter in a description of your music in the first text field, then upload a melody for the AI tool to follow.  

Meta MusicGen AI text-to-audio tool

(Image credit: Future)

If you want to make things simple, you can use one of the examples above just by clicking on it. Click on an example to automatically fill the two input fields. 

Meta MusicGen AI text-to-audio tool

(Image credit: Future)

Once your Describe your music and Condition and the melody fields are filled (the latter is optional) click Generate. This will start the AI tool and put your text-to-music request in a queue. 

Meta MusicGen AI text-to-audio tool

(Image credit: Future)

Once your music clip is generated, the generated music box will show a graphic resembling sound waves and become a media player. Click play and enjoy your AI music! 

More from Tom's Guide

Malcolm McMillan
Senior Streaming Writer

Malcolm McMillan is a senior writer for Tom's Guide, covering all the latest in streaming TV shows and movies. That means news, analysis, recommendations, reviews and more for just about anything you can watch, including sports! If it can be seen on a screen, he can write about it. Previously, Malcolm had been a staff writer for Tom's Guide for over a year, with a focus on artificial intelligence (AI), A/V tech and VR headsets.

Before writing for Tom's Guide, Malcolm worked as a fantasy football analyst writing for several sites and also had a brief stint working for Microsoft selling laptops, Xbox products and even the ill-fated Windows phone. He is passionate about video games and sports, though both cause him to yell at the TV frequently. He proudly sports many tattoos, including an Arsenal tattoo, in honor of the team that causes him to yell at the TV the most.