New Open-Source AI Model Matches China’s DeepSeek with 85% Less Training Data

OpenThinker-32B Outperforms DeepSeek in Math Reasoning with Just One-Seventh of Training Data

- Advertisement -
  • OpenThinker-32B achieves 90.6% accuracy on MATH500, surpassing DeepSeek’s 89.4% with significantly less training data.
  • The model demonstrates superior efficiency, requiring only 114,000 training examples compared to DeepSeek’s 800,000.
  • Open source release includes verified and unverified datasets, enabling broader community development.
  • Training completed in 90 hours using four nodes with eight H100 GPUs, showing practical implementation potential.
  • Built on Alibaba‘s Qwen2.5-32B-Instruct LLM, supporting a 16,000-token context window for complex operations.

A breakthrough in AI reasoning emerged Wednesday as international researchers unveiled OpenThinker-32B, a model that challenges DeepSeek‘s dominance in mathematical and problem-solving capabilities while using just one-seventh of the training data.

The model, developed by the Open Thoughts consortium, demonstrated remarkable efficiency by achieving superior results across multiple benchmarks. On the MATH500 assessment, OpenThinker-32B scored 90.6% accuracy, exceeding DeepSeek’s 89.4%. Similarly, it outperformed in general problem-solving with a GPQA-Diamond score of 61.6 versus DeepSeek’s 57.6.

The project’s efficiency stems from its innovative OpenThoughts-114k dataset, which includes comprehensive metadata, ground truth solutions, and domain-specific information. A separate unverified dataset containing 137,000 samples was processed using Italy‘s Leonardo Supercomputer, consuming 11,520 A100 hours in just 30 hours.

This development arrives amid intensifying competition in AI reasoning capabilities. OpenAI recently announced reasoning features for post-GPT-5 models, while xAI‘s Grok-3 and Nous Research‘s DeepHermes join the race.

The model’s accessibility through HuggingFace, including a smaller 7B parameter version, represents a significant shift toward open-source AI development. Unlike DeepSeek, which keeps its training data private, OpenThinker’s complete transparency enables easier reproduction and improvement by the developer community.

Backed by researchers from Stanford, Berkeley, UCLA, and the Juelich Supercomputing Center, along with the Toyota Research Institute, OpenThinker-32B demonstrates how international collaboration can produce competitive AI models without relying on massive proprietary datasets.

✅ Follow BITNEWSBOT on Telegram, Facebook, LinkedIn, X.com, and Google News for instant updates.

Previous Articles:

- Advertisement -
- Advertisement -
- Advertisement -

Latest

Solo Bitcoin Miner Hits Jackpot, Scores $266,000 With Single Block

A solo Bitcoin miner secured block 888,737 and earned approximately $266,000 in rewards, consisting of 3.125 BTC plus transaction fees.The miner reportedly used a...

Ex-SEC Official Rejects Crypto Regulatory Reform at SEC Roundtable

Former SEC official John Reed Stark opposes regulatory reform for cryptocurrencies at the SEC's first crypto roundtable.Stark argues crypto buyers are investors who need...

Open House Group Adds XRP, SOL, DOGE to Crypto Payment Options in Japan

Open House Group expands cryptocurrency payment options to include XRP, Solana, and Dogecoin alongside existing Bitcoin and Ethereum options.The company launches a Traditional Chinese...

Chainlink CCIP Breaks Vendor Lock-In Barrier for Cross-Chain Tokens

ChainLink CCIP provides token issuers with cross-chain functionality without being restricted to a single blockchain ecosystem.Cross-Chain Tokens (CCTs) enable seamless token movement across multiple...

Michael Saylor raises $722.5M for bitcoin buys at premium dividend rates

Strategy (formerly MicroStrategy) increased its fundraising from $500M to $722.5M but had to offer significantly more favorable terms to investors.The STRF preferred stock was...

Tether in Talks with Big Four Accounting Firm for Independent Audit

Tether is in discussions with one of the Big Four accounting firms to conduct an independent audit of its stablecoin reserves.The stablecoin issuer has...

SEC Finally Opens Door to Crypto Industry Collaboration on Regulations

SEC's Crypto Task Force, led by Commissioner Hester Peirce, held its first roundtable focused on developing a regulatory framework for digital assets.Acting Chairman Mark...

Coinbase in Advanced Talks to Acquire Crypto Derivatives Giant Deribit

Coinbase is in advanced discussions to acquire Deribit, potentially valuing the world's largest cryptocurrency derivatives exchange at $4-5 billion.The acquisition would expand Coinbase's derivatives...
- Advertisement -

Must Read

Top 14 BEST Crypto Trading Bots For Passive Income

TL;DR: In this article, we present a list of the best Crypto Trading Bots for passive income. If you are in a hurry with...

Read Next
Recommended to you