Home > AI > Body

New Open Source AI Model Rivals DeepSeek's Performance—With Far Less Training Data

2025-02-13 22:46:12

class font meta serif scene font noto sans desk3 cryptocurrency desktop Crypto News

A team of international researchers from leading academic institutions and tech companies upended the AI reasoning landscape on Wednesday with a new model that matched—and occasionally surpassed—one of China's most sophisticated AI systems: DeepSeek.

OpenThinker-32B, developed by the Open Thoughts consortium, achieved a 90.6% accuracy score on the MATH500 benchmark, edging past DeepSeek's 89.4%.

The model also outperformed DeepSeek on general problem-solving tasks, scoring 61.6 on the GPQA-Diamond benchmark compared to DeepSeek's 57.6. On the LCBv2 benchmark, it hit a solid 68.9, showing strong performance across diverse testing scenarios.

In other words, it’s better than a similarly-sized version of DeepSeek R1 at general scientific knowledge (GPQA-Diamond). It also beat DeepSeek at MATH500 while losing at the AIME benchmarks—both of which try to measure math proficiency.

It’s also a bit worse than DeepSeek at coding, scoring 68.9 points vs 71.2, but since the model is open source, all these scores can drastically get better once people start improving upon it.

What set this achievement apart was its efficiency: OpenThinker required only 114,000 training examples to reach these results, while DeepSeek used 800,000.

The OpenThoughts-114k dataset came packed with detailed metadata for each problem: ground truth solutions, test cases for code problems, starter code where needed, and domain-specific information.

Its custom Curator framework validated code solutions against test cases, while an AI judge handled math verification.

The team reported it used four nodes equipped with eight H100 GPUs, completing in approximately 90 hours. A separate dataset with 137,000 unverified samples, trained on Italy's Leonardo Supercomputer, burned through 11,520 A100 hours in just 30 hours.

"Verification serves to maintain quality while scaling up diversity and size of training prompts," the team noted in their documentation. The research indicated that even unverified versions performed well, though they did not match the verified model's peak results.

The model was built on top of Alibaba’s Qwen2.5-32B-Instruct LLM and supports a modest 16,000-token context window—enough to handle complex mathematical proofs and lengthy coding problems but a lot less than the current standards.

This release arrives amid intensifying competition in AI reasoning capabilities, which seems to be happening at the speed of thought. OpenAI announced on February 12 that all models following GPT-5 would feature reasoning capabilities. One day later, Elon Musk hyped up xAI’s Grok-3's enhanced problem-solving capabilities, promising it would be the best reasoning model to date, and just a few hours ago, Nous Research released another open-source reasoning model, DeepHermes, based on Meta’s Llama 3.1.

The field gained momentum after DeepSeek demonstrated comparable performance to OpenAI's o1 at significantly reduced costs. DeepSeek R1 is free to download, use, and modify, with the training techniques also revealed.

However, unlike Open Thoughts, which decided to open source everything, the DeepSeek development team kept its training data private.

This key difference means developers may have an easier time understanding OpenThinker and reproducing its results from scratch than they would have with DeepSeek because they have access to all the pieces of the puzzle.

For the broader AI community, this release demonstrates once again the viability of building competitive models without massive proprietary datasets. Also, it may be a more trusty competitor for Western developers who are still unsure about using a Chinese model—open source or not.

OpenThinker is available for download at HuggingFace. A smaller, less powerful 7B parameter model is also available for lower-end devices.

The Open Thoughts team pulled together researchers from different American universities, including Stanford, Berkeley, and UCLA, alongside Germany’s Juelich Supercomputing Center. The US-based Toyota Research Institute and other players in the EU AI scene also back it.

Edited by Josh Quittner and Sebastian Sinclair

Web3 Desktop Trading Tool

Stay ahead of the game in the cryptocurrency space.

Elon Musk Says His New AI Chatbot Is 'Scary Smart'—And Arriving in Weeks

Grok-3 Launch Ignites AI Arms Race as OpenAI Readies Counterpunch With GPT-4.5

7x24 Newsflash

21:59 2025-04-09

The US House of Representatives hearing pushes for progress on crypto market regulation bill

The U.S. House of Representatives held a hearing to discuss cryptocurrency market structure legislation, marking an important step forward for the bill. Lawmakers debated a regulatory framework for digital assets aimed at establishing clear rules for cryptocurrency exchanges and token offerings. The hearing paves the way for final legislation that could change the regulatory landscape of the U.S. crypto industry. Supporters of the bill believe it would bring more certainty to the market and inve...

21:50 2025-04-09

Bitcoin DeFi network Arch is backed by venture capital, the specific investment amount and participating institutions have not been disclosed

Bitcoin DeFi network Arch has announced that it has received support from venture capital institutions, which will be used to incubate early-stage projects and promote the development of decentralized finance in the Bitcoin ecosystem. Arch aims to provide developers with tools and resources through its platform to promote Bitcoin Layer2 innovation. This move may further accelerate the application of Bitcoin in the field of DeFi and challenge the dominance of traditional DeFi such as Ethereum. Th...

21:47 2025-04-09

Federal Reserve Kashkari: If tariff suspension continues, inflation impact is expected to weaken

Federal Reserve Kashkari: Dramatic changes took place this afternoon. If the tariff pause continues, the inflationary impact is expected to weaken. Tariffs could lead to inflation and we need to monitor. The threshold for rate cuts remains high.

21:44 2025-04-09

The US's 90-day suspension of reciprocal tariffs "small composition" has come true, a brief description of the timeline

On the evening of April 7, Beijing time, the White House refuted rumors that it was "fake news" to consider suspending tariffs for 90 days. On the evening of April 9, Trump: Calm down! Now is an excellent time to buy (the stock market). In the early morning of April 10, Trump announced that he would suspend the implementation of reciprocal tariffs on most economies (only in effect for 13 hours) for a period of 90 days for negotiation. Then the US stock index soared 12%. Senate Democratic leader ...

21:32 2025-04-09

Ledn Lianchuang: Global bitcoin mortgage rates may drop significantly

The co-founder of Ledn has revealed that the cost of bitcoin mortgages will be significantly reduced globally, in order to enhance the competitiveness of the crypto lending market. This trend may promote more institutions and individuals to adopt BTC mortgage financing, while promoting further integration of DeFi and traditional finance. The specific interest rate adjustment plan and implementation time have not been announced.

21:26 2025-04-09

The Federal Reserve is ready to intervene in the money market if necessary

Federal Reserve Hamak: Markets look nervous, but are still functioning. Ready to intervene in money markets if needed. We may act more quickly when the Fed adjusts interest rates. Monetary policy is moderately restrictive at the moment. It is unclear whether removing the Supplemental Leverage Ratio (SLR) limit will increase risk tolerance. We have been seeing markets adjust themselves. I would rather wait than go in the wrong direction on interest rates.

21:26 2025-04-09

Clarifying the latest US tariff policy: suspension of reciprocal measures in most economies, 10% and industry tariffs

In the early hours of Thursday morning Beijing time, US President Donald Trump suspended the full reciprocal tariffs that had come into effect on April 9. The suspension period is now 90 days to allow for trade negotiations, but he still maintained the 10% benchmark tariff on all goods entering the United States globally (which has been in effect since April 5), and continued to impose tariffs on specific industries, while hinting that more may be imposed in the future. As a result, dozens of tr...

21:11 2025-04-09

Brazilian President: Trump's arbitrary imposition of tariffs will destabilize the international economy

On the 9th local time, Brazilian President Lula delivered a speech at the CELAC summit in Honduras, saying that Trump's "arbitrary imposition of tariffs" will undermine international economic stability. History tells us that "no winner in a trade war". Lula said that "regional autonomy is once again threatened. Attempts to restore the old hegemony are hanging over the entire region". Lula called on Latin America to put aside differences and strengthen cooperation.

21:02 2025-04-09

Trump: Reaffirms that Iran cannot possess nuclear weapons

US President Trump: Reaffirming that Iran cannot have nuclear weapons. Russia and Ukraine need to make a deal. (on Iran) If military action is required, we will take military measures, and Israel will be involved.

20:59 2025-04-09

Trump: I've been considering suspending tariffs for the past few days

US President Trump: Everybody wants to make a deal on tariffs. A lot of times, it's not really starting negotiations until the last minute. Industry-specific tariffs are still coming. I'm going to put tariffs on pharmaceutical companies. Been thinking about suspending tariffs for the past few days. I didn't know a tariff suspension would have that kind of impact.

20:59 2025-04-09

BTC breaks through $83,000

The market shows that BTC has broken through $83,000 and is now reported at $83,000.92, with a 24-hour increase of 7.68%. The market is volatile, so please do a good job in risk control.

20:53 2025-04-09

U.S. Commerce Secretary Lutnik: U.S. expects European Union to delay planned tariff retaliation

The United States expects the European Union to delay the planned tariff retaliation.

Hot News

Vitalik: "DAO" means "project", "official" means "scam"Backpack Exchange已面向英国用户开放其服务派盾：NIBI同名代币发生Rug Pull，损失约31.39万美元香港金管局推出稳定币发行人沙盒 CIAN与Lido合作，在Base上推出wstETH Hyper-Staking Vault Gate.io 3月储备金总额突破60亿美元，额外储备金超8亿美元 Polyhedra Network已于3月12日16时完成ZK空投快照英FCA：不会反对加密资产相关ETN上市请求 BTC流通市值突破1.4万亿美元，续创新高 Space Nation将于3月底启动OIK代币空投

Related Recommendations

Backpack Exchange已面向英国用户开放其服务派盾：NIBI同名代币发生Rug Pull，损失约31.39万美元香港金管局推出稳定币发行人沙盒 CIAN与Lido合作，在Base上推出wstETH Hyper-Staking Vault Gate.io 3月储备金总额突破60亿美元，额外储备金超8亿美元 Polyhedra Network已于3月12日16时完成ZK空投快照英FCA：不会反对加密资产相关ETN上市请求 BTC流通市值突破1.4万亿美元，续创新高 Space Nation将于3月底启动OIK代币空投

About DESK3

About Us Terms of Service Privacy protection Disclaimer

Products

News Swap Bridge Cloud charts Inscription Wallet

Service

Help center Announcement Business support

Sociality