I used Google’s Veo 3 to create AI ASMR food videos

(Image credit: Veo 3 / Ryan Morrison)

Google’s Veo 3 AI video model is a league above any of its competitors for one key reason — sound. You can prompt not just what you see on screen, but also what you hear.

Built by Google's DeepMind lab, the first Veo model debuted in May 2024, and each new generation has added more functionality. It has always excelled in motion accuracy and physics understanding compared to competitors, but the addition of sound was a game-changer.

You can use it to prompt a short commercial, a scene from a movie you’re writing, or even a music video. But there’s one use I’ve seen more than any other — ASMR (autonomous sensory meridian response): those gentle tapping, whispering, and ambient sounds that trigger a tingling sensation for some people.

To see just how far this could go, I created a series of ASMR food prompts — each designed to generate a matching video and sound around something culinary.

Gemini logo

(Image credit: Shutterstock)

Prompting Veo 3 in the Gemini app

Veo 3 is now available in the Gemini app. Just select the Video option when starting a new prompt, type what you want, and an 8-second clip is generated.

While Gemini isn’t necessarily the best way to access Veo 3 — I’d recommend Freepik, Fal, Higgsfield, or Google Flow — it’s easy to use and gets the job done.

A key advantage of using Gemini directly is that it automatically interprets and enhances your prompts. So if you ask for “a cool ASMR video featuring lasagna,” that’s what you’ll get.

You can also be more specific using something called structured prompting — labeling each moment with timestamps and scene descriptions. But unless you need precise control, a simple paragraph (aka narrative prompting) is usually more effective.

Creating the prompts

The first task in any AI project is thinking about your prompt. Models are getting better at interpreting intent, but it’s still better to be specific if you know what you want.

I knew I wanted ASMR food videos, so I started with a test: “ASMR food video with sound.”

The result? Decent. It essentially gave me the lasagna I had in mind. Then I refined it — outlining specific food types, adding sound descriptions, and even trying a structured prompt for a fizzy drink with ice.

Most of the time, narrative prompts work best. Just describe what you want to see, the flow of the video, and how sound should come through.

1. Lasagna sizzling from the pan

Google Veo 3 lasagne video - YouTube

Watch On

The first prompt, “ASMR food video with sound,” produced a stunning clip of someone sliding a fork into a slice of lasagna. You hear the squish as the fork enters, then the clunk as it hits the plate. This is one case where I wish Veo 3 had an “extend clip” button.

There was no other prompting involved, so I had no way of identifying what the food would be, how the sound would come out or even if the sound would work. This is why it's important to be specific when prompting AI models, even ones in chatbots like Gemini.

2. Cooking and eating

Google Veo 3 cooking video - YouTube

Watch On

Next, I went more specific — a longer, narrative-style prompt asking Veo 3 to generate a close-up of a chef preparing and eating satisfying food in a well-lit kitchen.

I asked for slow-motion visuals of ingredients being chopped, the sizzling sound of butter melting in a pan, and a crunch as the chef takes a bite.

I also added this line: “Emphasize audio quality: clean, layered ASMR soundscape without music” to direct not just the sound, but to the style of sound and what I don’t want to hear.

3. Popcorn popping

Google Veo 3 popcorn video - YouTube

Watch On

For the final prompt I started with an image. I used Midjourney v7 to create a picture of a woman looking at rainbow popcorn, then added the prompt “ASMR food” in Gemini.

Visually, the result was stunning — but for some reason, the woman says in a voiceover, “This is delicious, this rainbow popcorn.” That’s on me — I didn’t specify whether she should speak, or what she should say.

A simple fix: put any speech you want in quotes. For example, I could have prompted her to say “I love to watch popcorn pop,” and emphasized the word pop. I also could’ve specified that she was speaking on camera — and Veo 3 would have synced the lip movement to match.

Conclusion

Overall, Veo 3 delivers impressive results, especially when it comes to generating high-quality sound that accurately reflects the visuals. While there are a few quirks to navigate, like unintended voiceovers or slightly underbaked looking lasagna — these are easily addressed with more specific prompting.

More from Tom's Guide

Back to Laptops

Apple

Asus

Dell

Lenovo

AMD Ryzen

Intel Core i3

Intel Core i5

Intel Core i7

8GB RAM

16GB RAM

32GB

64GB

128GB

256GB

512GB

1TB

2TB

13.3-inch

13.4-inch

14-inch

15-inch

Black

Blue

Gold

Grey

Silver

New

Refurbished

Showing 10 of 96 deals

Filters☰

Apple 13" MacBook Air M4 (2025)

(256GB SSD)

$999

$749

View

Apple 15" MacBook Air M4 (2025)

$1,199

$949

View

Dell XPS 13 Rose Gold

(13.3-inch 64GB)

$799.99

View

Lenovo Yoga Slim 7x (Gen 9)

(512GB OLED)

$1,075.79

$858.11

View

Lenovo IdeaPad Flex 5i ChromeBook Plus

(14-inch 2TB)

$479.99

View

Asus ROG Zephyrus G14 (2024)

(14-inch 1TB)

Our Review

☆☆☆☆☆

$1,669

View

Apple 13" MacBook Air M4 (2025)

$999

$749

View

Apple 15" MacBook Air M4 (2025)

(15-inch 256GB)

(13.4-inch 16GB RAM)

Our Review

☆☆☆☆☆

$1,240

$992

View

Lenovo Yoga Slim 7x (Gen 9)

(Blue)

$1,289.99

$929.99

View

Ryan Morrison, a stalwart in the realm of tech journalism, possesses a sterling track record that spans over two decades, though he'd much rather let his insightful articles on AI and technology speak for him than engage in this self-aggrandising exercise. As the former AI Editor for Tom's Guide, Ryan wields his vast industry experience with a mix of scepticism and enthusiasm, unpacking the complexities of AI in a way that could almost make you forget about the impending robot takeover.
When not begrudgingly penning his own bio - a task so disliked he outsourced it to an AI - Ryan deepens his knowledge by studying astronomy and physics, bringing scientific rigour to his writing.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.

GET TG ACCESS QUICK

Black Friday Pros Start Early Join, Save, Play and Win!

Prompting Veo 3 in the Gemini app

Creating the prompts

1. Lasagna sizzling from the pan

2. Cooking and eating

3. Popcorn popping

Conclusion

More from Tom's Guide