Is GPT-5 a "phenomenal" success or an "underwhelming" failure?

Maybe it's a bit of both.

Aug 14, 2025

∙ Paid

It was inevitable that people would be disappointed with last week’s release of GPT-5. That’s not because OpenAI did a poor job, and it’s not even because OpenAI did anything in particular to hype up the new version. The problem was simply that OpenAI’s previous “major” model releases—GPT-2, GPT-3, and GPT-4—have been so consequential:

GPT-2 was the first language model that could write coherent sentences and paragraphs across a wide range of topics.
GPT-3 was the first language model that could be prompted to perform a wide range of tasks without retraining.
GPT-4 delivered such a dramatic performance gain that it took competitors a year to catch up.

So of course people had high expectations for GPT-5. And OpenAI seems to have worked hard to meet those expectations.

After OpenAI released GPT-4.5 back in February, I argued that you could think of it as the model everyone was expecting to be called GPT-5. It was a much larger model than GPT-4 and was trained with a lot more compute. Unfortunately, its performance was so disappointing that OpenAI called it GPT-4.5 instead. Sam Altman gave it a distinctly half-hearted introduction, calling it a “giant, expensive model” that “won’t crush benchmarks.”

OpenAI probably should have given the GPT-5 name to o1, the reasoning model OpenAI announced last September. That model really did deliver a dramatic performance improvement over previous models. It was followed by o3, which pushed this paradigm—based on reinforcement learning and long chains of thought—to new heights. But we haven't seen another big jump in performance over the last six months, suggesting that the reasoning paradigm may also be reaching a point of diminishing returns (though it’s hard to know for certain).

Regardless, OpenAI found itself in a tough spot in early 2025. It needed to release something it could call GPT-5, but it didn’t have anything that could meet the sky-high expectations that had developed around that name. So rather than using the GPT-5 name for a dramatically better model, it decided to use it to signal a reboot of ChatGPT as a product.

Sam Altman explained the new approach back in February. “We realize how complicated our model and product offerings have gotten,” Altman tweeted. “We hate the model picker as much as you do and want to return to magic unified intelligence.”

Altman added that “we will release GPT-5 as a system that integrates a lot of our technology, including o3. We will no longer ship o3 as a standalone model.”

This background helps to explain why the reactions to GPT-5 have been so varied. Smart, in-the-trenches technologists have praised the release, with Nathan Lambert calling it “phenomenal.” On the other hand, many AI pundits—especially those with a skeptical bent—have panned it. Gary Marcus, for example, called it “overdue, overhyped and underwhelming.”

The reality is that GPT-5 is a solid model (or technically suite of models—we’ll get to that) that performs as well or better than anything else on the market today. In my own testing over the last week, I found GPT-5 to be the most capable model I’ve ever used. But it’s not the kind of dramatic breakthrough people expected from the GPT-5 name. And it has some rough edges that OpenAI is still working to sand down.

A new product, not just a new model

When OpenAI released ChatGPT back in 2022, the organization was truly a research lab. It did have a commercial product—a version of GPT-3 developers could access via an API—but that product had not yet gained much traction. In 2022, OpenAI’s overwhelming focus was on cutting-edge research that advanced the capabilities of its models.

Things are different now. OpenAI still does a lot of research, of course. But it also runs the world’s leading AI chatbot—one with hundreds of millions of weekly average users. Indeed, some rankings show ChatGPT as the fifth most popular website in the world, beating out Wikipedia, Amazon, Reddit, and X.com.

So in building GPT-5, OpenAI executives were thinking not only about how to advance the model’s raw capabilities, but also how to make ChatGPT a more compelling product.

Keep reading with a 7-day free trial

Subscribe to Understanding AI to keep reading this post and get 7 days of free access to the full post archives.