OpenAI Unveils GPT-4.5: Friendliest Model Yet at 1300% the Price

clock
2025-02-27 22:57:15

OpenAI released GPT-4.5 on Thursday, just one day after Anthropic launched Claude 3.7 Sonnet and merely a week following xAI's Grok-3 debut and DeepSeek’s announcement of a new model coming soon.

And expensive is the operative word here. OpenAI’s new model comes with an eye-watering API price tag of $75 per million input tokens and $150 per million output tokens.

It appears to be a new competitive phase in the AI race, with companies scrambling to outdo each other with increasingly capable—and increasingly expensive—models.

For context, that's ten times pricier than Claude 3.7 Sonnet, making it potentially prohibitive for many developers and startups looking to build on the technology.

GPT-4o (its predecessor) cost $2.50 per 1M tokens of input and $10.00 per 1M tokens of output—making GPT-4.5 2900% more expensive to input and 1300% dearer to get a response.

Sam Altman, OpenAI's CEO, didn't shy away from acknowledging the model's massive resource requirements in his announcement. "Bad news: It is a giant, expensive model," he said.

"A heads up: this isn’t a reasoning model and won’t crush benchmarks. It’s a different kind of intelligence," Altman said. “There’s a magic to it I haven’t felt before.”

GPT-4.5 is ready!

good news: it is the first model that feels like talking to a thoughtful person to me. i have had several moments where i've sat back in my chair and been astonished at getting actually good advice from an AI.

bad news: it is a giant, expensive model. we…

— Sam Altman (@sama) February 27, 2025

And this seems to be the key. Users are paying 1300% more not to have a more intelligent model, but to have a nicer model that feels more human.

For example, one thing in which GPT-4.5 shines, according to OpenAI, is in what they call "vibes," or essentially the model's EQ, warmth, and collaborative feel.

The company created a "Vibes test set" measuring creative intelligence and conversational quality, on which GPT-4.5 purportedly outperformed other models.

The examples shared during the presentation didn't exactly introduce anything new.

The first demonstration had literally this prompt: “UGHHH! My friend cancelled on me again!!! Write a text message telling them that I HATE THEM!!!!” which arguably isn’t something for which you would use a competent large language model.

In a following demonstration comparing GPT-4.5 to OpenAI's o1 model, researchers asked both AIs to explain the need for AI alignment and to help craft a message to a friend who had canceled plans.

The responses, while showing some improved nuance in GPT-4.5, hardly seemed revolutionary. The difference was in the tone.

In another example, the research team asked the powerful GPT-4.5 why the sea water is salty.

The new model responded using less complex terms—"because of rain, rivers, and rocks"—compared to previous models.

GPT-4-Turbo gave a more comprehensive and detailed reply, which the team didn’t like, arguing that “you get the feeling that it wants you to know how smart it is.”


One amusing detail from the presentation was an Easter egg hinting at a possible GPT-6, with a query that read: "Num GPUs for GPT-6 Training."

Perhaps when that model arrives, the demos will be more impressive.

The benchmarks presented paint a mixed picture. GPT-4.5 scores 71.4% on GPQA (a science evaluation), compared to GPT-4o's 53.6%.

However, it still trails behind OpenAI's o3-mini model, which scores 79.7% through its reasoning capabilities.

Similar patterns emerged across other benchmarks. On the AIME '24 math evaluation, GPT-4.5 scored 36.7%, beating GPT-4o's 9.3% but still far behind o3-mini's 87.3%.

For coding tasks, GPT-4.5 outperformed its predecessor and o3-mini on the SWE-Lancer Diamond benchmark but fell short on SWE-Bench Verified compared to the reasoning-focused model.

Altman described the model in almost mystical terms, calling it "the first model that feels like talking to a thoughtful person."

He added: "I have had several moments where I've sat back in my chair and been astonished at getting actually good advice from an AI."

During the model's presentation, OpenAI researchers explained that the company advances AI through two distinct approaches: unsupervised learning and reasoning.

While reasoning teaches models to "think before responding," unsupervised learning helps increase "word model accuracy and intuition." GPT-4.5 doubles down on the latter.

"GPT-4.5 is our next step in scaling up unsupervised learning, increasing world knowledge, intuition, and reducing hallucinations," an OpenAI research lead explained in the presentation.

Developing GPT-4.5 required massive technical innovation, according to the team. They had to build new inference systems to serve such a large model efficiently, use low-precision training to maximize GPU usage, and even train across multiple data centers simultaneously.

The release comes at a time when consumer expectations for AI are sky-high, and competition in the space is intensifying. Whether GPT-4.5's "different kind of intelligence" and improved "vibes" justify its enormous resource requirements and steep pricing remains to be seen.

GPT-4.5 is currently available for Pro users who pay $200 a month. Plus users paying $20 a month will have access to the model next week.

Edited by Sebastian Sinclair

Web3 桌面交易工具
了解币圈信息快人一步

7x24 快讯

05:26 2025-04-25
Twenty One官网显示其当前比特币持仓为31500枚BTC
Twenty One官网显示其当前比特币持仓为31500枚BTC,流通股数量为2.6732亿份,每股比特币持仓为0.00011783 BTC。Twenty One由Tether支持,在SPAC合并及可转债转股后Tether将持股42.8%,SoftBank持股24%,Bitfinex持股16%。
05:19 2025-04-25
Sui网络TVL突破16亿美元,近24小时增超9%
据DefiLlama数据,Sui网络TVL突破16亿美元,现报16.32亿美元,近24小时增超9%。此外,Sui网络DEX24小时交易量已达到5.99亿美元,较上周增长35.01%。
05:13 2025-04-25
币安将上线MEMEFIUSDT与FISUSDT永续合约
据Binance公告,Binance Futures将于4月25日15:15(北京时间)上线MEMEFIUSDT永续合约(最高50倍杠杆),15:30上线FISUSDT永续合约(最高75倍杠杆)。
04:43 2025-04-25
美国现货比特币ETF近5日共流入27.59亿美元
据 Farside Investors 数据,美国现货比特币ETF 自 4 月 17 日以来,已连续 5 个交易日保持净流入状态,总计净流入 27.59 亿美元。
04:37 2025-04-25
一鲸鱼斥资150万美元买入995枚MKR
据Onchain Lens监测,一鲸鱼地址近日以1508美元单价,动用150万美元USDS买入995枚MKR,时隔近两个月重新进入该币种。此前该地址曾在MKR交易中亏损13.8万美元。
04:34 2025-04-25
Manus开发团队以近5亿美元估值完成7500万美元融资,Benchmark领投
彭博社周五援引知情人士的话报道称,Manus AI 背后的中国初创公司在由美国风险投资公司 Benchmark 领投的一轮融资中筹集了 7500 万美元。 据报道,本轮融资还包括现有投资者的参与,使该初创公司的估值增加了五倍,达到近 5 亿美元。报道称,这家名为「蝴蝶效应」的公司计划利用这笔资金拓展美国、日本和中东等市场。 Manus 今...
04:25 2025-04-25
Binance Wallet上线第12期TGE OKZOO
据官方消息,Binance Wallet宣布上线第12期TGE:OKZOO。投入时间为:2025年4月25日上午8:00至上午10:00(UTC),要求至少需要45点Alpha积分。
04:22 2025-04-25
瑞士央行行长:加密货币本质是一种软件
据路透社披露,瑞士央行行长Martin Schlegel在接受当地媒体采访时表示:“加密货币本质上是一种软件。我们都知道,软件经常会存在漏洞和其他弱点”,但Bitcoin Suisse董事会成员Luzius Meisser表示,随着世界走向多极秩序,美元和欧元正在走弱,持有比特币变得更有意义,因为比特币是一种不能通过赤字支出来膨胀的货币。
04:16 2025-04-25
Cryptoquant:MicroStrategy的比特币投资组合首次突破500亿美元
Cryptoquant分析师表示,随着MicroStrategy不断购买比特币,MicroStrategy的比特币投资组合首次突破 500 亿美元,比特币投资策略正成为市场观察和分析的重要指标。
04:10 2025-04-25
去中心化存储网络Aleph.im更名为Aleph Cloud并推出100万美元加速器计划
去中心化存储网络Aleph.im宣布更名为Aleph Cloud,未来将转向为全栈去中心化云提供商,除品牌重塑之外还将推出规模为100万美元的初创项目加速器计划,以帮助Web3建设者和初创企业摆脱AWS和Google Cloud中心化云提供商,该加速器还将为以太坊、Base、Solana、BSC和Avalanche等生态系统提供计算积分、存储和技术支持。
04:10 2025-04-25
金色午报 | 4月25日午间重要动态一览
7:00-12:00关键词:GRASS、Fleek、MagicBlock、TRUMP 1.Bithumb将上线GRASS、XYO韩元交易对; 2.CoinList将于5月2日上线Fleek(FLK)代币销售; 3.分析:近期两机构大幅减持ETH,或引发市场不稳定; 4.前桥水基金CEO、参议员McCormick再投比特币,总额或达百万美元; 5.Solana生态游戏项目MagicBlock完成750万美元种子轮融资,Faction领投; 6.目前“TRUMP晚宴”VIP席位前三分别为:HTX钱包、Wintermu...
04:04 2025-04-25
Binance将于5月7日移除部分杠杆交易对
4 月 25 日,据官方公告,Binance 将于 5 月 7 日 14 时移除部分杠杆交易对,包括: 全仓杠杆交易对:ALT/FDUSD、BIO/FDUSD、GPS/FDUSD、JUV/USDC、TRU/BTC、TST/FDUSD、SKL/BTC 逐仓杠杆交易对: ALT/FDUSD、GPS/FDUSD、TRU/BTC、SKL/BTC。