Claude AI has been continously playing Pokémon Red for over a month — it still can't beat it
Can't catch 'em all.
Here at Tom’s Guide our expert editors are committed to bringing you the best news, reviews and guides to help you stay informed and ahead of the curve!
You are now subscribed
Your newsletter sign-up was successful
Want to add more newsletters?
Daily (Mon-Sun)
Tom's Guide Daily
Sign up to get the latest updates on all of your favorite content! From cutting-edge tech news and the hottest streaming buzz to unbeatable deals on the best products and in-depth reviews, we’ve got you covered.
Weekly on Thursday
Tom's AI Guide
Be AI savvy with your weekly newsletter summing up all the biggest AI news you need to know. Plus, analysis from our AI editor and tips on how to use the latest AI tools!
Weekly on Friday
Tom's iGuide
Unlock the vast world of Apple news straight to your inbox. With coverage on everything from exciting product launches to essential software updates, this is your go-to source for the latest updates on all the best Apple content.
Weekly on Monday
Tom's Streaming Guide
Our weekly newsletter is expertly crafted to immerse you in the world of streaming. Stay updated on the latest releases and our top recommendations across your favorite streaming platforms.
Join the club
Get full access to premium articles, exclusive features and a growing list of member rewards.
Artificial Intelligence is likely to lead to huge advances in human technology in the coming years, but if you're wondering what it can't do, the answer is beat a classic RPG that's nearly 30 years old.
Pokémon Red and Blue debuted in Japan in 1996, coming to the rest of the world in 1998, and while it led many of us into a lifetime of card collecting and monster battling, for AI model Claude it's led to no end of frustration.
Anthropic's latest reasoning model — Claude 3.7 Sonnet — has been tasked with beating the iconic title on a livestream hosted on Twitch. It's billed as "a passion project made by a person who loves Claude and loves Pokémon" and has been going for over a month at this point.
Why AI can't beat Pokemon Red?
The model is taking notes as it progresses, updating its knowledge base of Pokémon, their types, weaknesses, and attacks.
It's also explaining its decisions in real-time through a text box, which is really neat — but at the time of writing, it was constantly encountering Zubats in Mt. Moon.
Despite having Pokémon at Levels 15. 24, 36, and even 39, it hadn't made it through the dungeon area — but at least it's racking up the XP.
There's an argument that because there's no "golden path" to get from A to Z throughout Pokémon, with random encounters and Pokémon trainers popping up regularly and no easy way for an AI to stick with a set "team" of critters.
Get instant access to breaking news, the hottest reviews, great deals and helpful tips.
Will it ever beat it?
Given infinite time, it's possible that Claude could beat Pokémon, but there's no telling how long it'll take. At least the current version is apparently better than prior versions, according to a Reddit thread following the action.
Anthropic has said the agent just kept trying to run away from battles originally, but it's getting better.
"Within hours, Claude defeated Brock [the game's first gym leader],"
"Days later, it trounced Misty [the second]" Anthropic said, describing the victories as "Progress that older models had little hope of achieving."
More from Tom's Guide
- I just ‘met my younger self for coffee’ with mind-blowing AI video — now this is creepy
- I wasn’t sold on ChatGPT — until I tried these 5 prompts
- I'm a long-time iPhone owner, but I rarely use Apple Intelligence - here's why

Lloyd Coombes is a freelance tech and fitness writer. He's an expert in all things Apple as well as in computer and gaming tech, with previous works published on TechRadar, Tom's Guide, Live Science and more. You'll find him regularly testing the latest MacBook or iPhone, but he spends most of his time writing about video games as Gaming Editor for the Daily Star. He also covers board games and virtual reality, just to round out the nerdy pursuits.
You must confirm your public display name before commenting
Please logout and then login again, you will then be prompted to enter your display name.
