OpenAI's smartest model could cost up to $30,000 per task according to estimates

(Image credit: Shutterstock)

Back in December, OpenAI launched a reasoning model known as ‘o3’. Smarter and more thorough in its decision process, this showed OpenAI’s advancement into a more deliberate version of AI. But there’s just one problem.

When the model was first launched, OpenAI teamed up with the Arc Prize Foundation. This group runs the ARC-AGI, a benchmark exam to test highly capable AI models. These tests are incredibly easy for a human to solve with simple puzzles the majority of the public could easily do. But that's not the case for AI.

The foundation originally believed it cost around $3,000 for o3 to solve a complex problem. That's no longer the case, with re-estimates putting it nearer to $30,000 per task.

So, while OpenAI scored highly on the test, it did so at a higher-than-average cost. In fact, it's an astronomical cost that shows just how expensive cutting-edge AI models are to run.

What does this all mean?

OpenAI's o3 is separate from the models that power ChatGPT.

Specifically, o3 is reflective. It is designed for complex problem-solving and reasoning and therefore is used for tasks that require a higher level of computing power.

According to the Arc Prize Foundation, o3 (high) used 172 times more computing power than o3 (low)

When developers use this model, they can select from three types of reasoning efforts: low, medium, and high. This is essentially asking the model to think harder. Increase the reasoning effort and you’ll get a better result, but that comes with an increase in computing power and cost.

In the ARC-AGI test, it was o3 (high) that came with such a massive price tag. In other words, this is OpenAI’s most detailed model, being pushed to its absolute limits of computing power.

Arguably, in a test like this, the model overdid it. Toby Ord, a senior researcher at Oxford University highlighted on X that it took o3 (high) 1,024 attempts at each task, writing roughly 137 pages of text for each attempt, or around 43 million words total.

Finally, I want to note how preposterous the o3-high attempt was. It took 1,024 attempts at each task, writing about 137 pages of text for each attempt, or about 43 million words total. That's writing an Encyclopedia Brittanica (44 million words) per task!7/nApril 2, 2025

According to the Arc Prize Foundation, o3 (high) used 172 times more computing power than o3 (low). While the low-effort version didn’t score as highly, it could end up being a more cost-effective version.

OpenAI logo

(Image credit: Shutterstock)

For now, OpenAI hasn’t made any announcements about how much o3 will cost for developers to use. A more cost-efficient version known as o3-mini is available. This, while being cheaper, is nowhere near as powerful.

For the average person, this pricing will have no effect. ChatGPT is one of the more cost-effective models for OpenAI and o3, when made available, will only be used by those at the cutting edge of AI.

However, as the likes of Deepseek and Claude start to cut costs and develop more affordable complex models, OpenAI could quickly find itself as the most expensive provider of the bunch.

More from Tom's Guide

Back to Laptops

Apple

Asus

Dell

Lenovo

AMD Ryzen

AMD Ryzen 7

Intel Core i3

Intel Core i5

Intel Core i7

4GB RAM

8GB RAM

16GB RAM

24GB RAM

32GB RAM

32GB

64GB

128GB

256GB

512GB

1TB

2TB

13.3-inch

13.4-inch

14-inch

15-inch

Black

Blue

Gold

Silver

White

New

Refurbished

LED

OLED

EMMC

SSD

Showing 10 of 205 deals

Filters☰

Apple 13" MacBook Air M4 (2025)

$869

View

Apple 15" MacBook Air M4 (2025)

(15-inch 1TB)

$1,599

View

Dell XPS 13 (2016)

Our Review

☆☆☆☆☆

$569

View

Lenovo Yoga Slim 7x (Gen 9)

(512GB OLED)

$1,075.79

$858.11

View

Lenovo IdeaPad Flex 5i ChromeBook Plus

(14-inch 2TB)

$499.99

View

Asus ROG Zephyrus G14 (2024)

(14-inch 1TB)

Our Review

☆☆☆☆☆

$1,579.95

View

Apple 13" MacBook Air M4 (2025)

(16GB RAM SSD)

$999

$799

View

Apple 15" MacBook Air M4 (2025)

(16GB RAM SSD)

(13.3-inch 256GB)

Lenovo Yoga Slim 7x (Gen 9)

(Blue)

$1,289.99

View

See more AI News

Alex is the AI editor at TomsGuide. Dialed into all things artificial intelligence in the world right now, he knows the best chatbots, the weirdest AI image generators, and the ins and outs of one of tech’s biggest topics.

Before joining the Tom’s Guide team, Alex worked for the brands TechRadar and BBC Science Focus.

He was highly commended in the Specialist Writer category at the BSME's 2023 and was part of a team to win best podcast at the BSME's 2025.

In his time as a journalist, he has covered the latest in AI and robotics, broadband deals, the potential for alien life, the science of being slapped, and just about everything in between.

When he’s not trying to wrap his head around the latest AI whitepaper, Alex pretends to be a capable runner, cook, and climber.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.