DeepSeek V3.2 matches GPT-5 performance and reduces training costs by 90%

While tech giants spend billions on computing power to train cutting-edge AI models, China’s DeepSeek achieved comparable results by working smarter, not harder. DeepSeek V3.2 AI models are comparable to OpenAI’s GPT-5 in inference benchmarks despite having “lower total training FLOPs.” This is a breakthrough that has the potential to reshape the way the industry thinks about building advanced artificial intelligence.

For enterprises, this release shows that Frontier AI capabilities don’t require Frontier-scale compute budgets. DeepSeek V3.2 is available as open source, allowing organizations to evaluate advanced inference and agent capabilities while maintaining control over their deployment architecture. As cost efficiency becomes increasingly central to AI deployment strategies, it becomes a practical consideration.

The Hangzhou-based institute released two versions on Monday: base DeepSeek V3.2 and DeepSeek-V3.2-Speciale. The latter achieved gold medal performances at the 2025 International Mathematics Olympiad and International Informatics Olympiad. A benchmark previously reached only by unpublished internal models from major US AI companies.

This accomplishment is particularly significant given that DeepSeek has limited access to advanced semiconductor chips due to export regulations.

Resource efficiency as a competitive advantage

DeepSeek’s results contradict common industry assumptions that frontier AI performance requires significantly scaling computational resources. The company attributes this efficiency to architectural innovations, particularly the DeepSeek Sparse Attendant (DSA), which significantly reduces computational complexity while maintaining model performance.

The base DeepSeek V3.2 AI model achieved 93.1% accuracy on AIME 2025 math problems, achieved 2386 in Codeforces evaluation, and tied GPT-5 in inference benchmarks.

The Speciale variant was even more successful, scoring 96.0% in the 2025 American Invitational Mathematics Examination (AIME), 99.2% in the February 2025 Harvard-MIT Mathematics Tournament (HMMT), and achieving gold medal performances in both the 2025 International Mathematics Olympiad and the International Informatics Olympiad.

This result is particularly significant given DeepSeek’s limited access to the numerous tariffs and export restrictions affecting China. A technical report revealed that the company had allocated a post-training compute budget that exceeded 10% of pre-training costs. This was a significant investment that enabled advanced capabilities through reinforcement learning optimization rather than brute force scaling.

Innovations that increase efficiency

The DSA mechanism represents a departure from traditional attention architectures. Rather than processing all tokens with the same computational intensity, DSA employs a “lightning indexer” and a fine-grained token selection mechanism that identifies and processes only the most relevant information for each query.

This approach reduces the core attention complexity from O(L²) to O(Lk). Here, k represents the number of selected tokens (a fraction of the total sequence length L). During continuous pre-training from the DeepSeek-V3.1-Terminus checkpoint, the company trained DSA on 943.7 billion tokens using 480 sequences of 128,000 tokens per training step.

This architecture also introduces context management tailored to tool invocation scenarios. Unlike previous inference models that discarded thought content after each user message, DeepSeek V3.2 AI models preserve inference traces when only tool-related messages are added, improving token efficiency in multi-turn agent workflows by eliminating redundant re-inference.

Enterprise applications and practical performance

For organizations evaluating AI implementations, DeepSeek’s approach offers tangible benefits beyond benchmark scores. In Terminal Bench 2.0, which evaluates coding workflow features, DeepSeek V3.2 achieved 46.4% accuracy.

The model achieved a score of 73.1% on software engineering problem-solving benchmarks SWE-Verified and 70.2% on SWE Multilingual, demonstrating its practicality in a development environment.

The model showed significant improvements over previous open-source systems for agent tasks that require autonomous tool use and multi-step reasoning. The company has developed an extensive agent task synthesis pipeline that generates over 1,800 distinct environments and 85,000 complex prompts. This allowed the model to generalize its inference strategies to unfamiliar tool usage scenarios.

DeepSeek has open sourced the basic V3.2 model of Hugging Face, allowing enterprises to implement and customize it vendor-neutral. The Speciale variant will continue to be accessible only via the API due to higher token usage requirements and a trade-off between maximum performance and deployment efficiency.

Industry impact and awareness

This release caused a great deal of discussion in the AI research community. Susan Zhang, principal research engineer at Google DeepMind, praised DeepSeek’s detailed technical documentation, particularly highlighting the company’s work on post-training model stabilization and agent enhancements.

It is attracting increasing attention in the run-up to the Neural Information Processing Systems Conference. Florian Brand, an expert on the Chinese open source AI ecosystem who attended NeurIPS in San Diego, said of the immediate reaction: “After the announcement of DeepSeek, all the group chats today were full.”

Recognized limitations and development path

DeepSeek’s technical report addresses the current gap compared to frontier models. Token efficiency remains a challenge. DeepSeek V3.2 AI models typically require longer generation trajectories to match the output quality of systems like Gemini 3 Pro. The company also acknowledges that the breadth of world knowledge lags behind leading proprietary models due to a lower amount of total training computing.

Future development priorities include scaling pre-training computational resources to expand world knowledge, optimizing the efficiency of inference chains to improve token usage, and improving the underlying architecture for complex problem-solving tasks.

See: AI Business Realities – What Business Leaders Need to Know

Want to learn more about AI and big data from industry leaders? Check out the AI & Big Data Expos in Amsterdam, California, and London. This comprehensive event is part of TechEx and co-located with other major technology events. Click here for more information.

AI News is brought to you by TechForge Media. Learn about other upcoming enterprise technology events and webinars.

US - NEA

Company

DeepSeek V3.2 matches GPT-5 performance and reduces training costs by 90%

Resource efficiency as a competitive advantage

Innovations that increase efficiency

Enterprise applications and practical performance

Industry impact and awareness

Recognized limitations and development path

LEAVE A REPLY Cancel reply

Subscribe

Authorities search for Special Forces veteran suspected of shooting wife to death

Do you miss your spirit in the sky?

Paxton leads Cornyn in Texas Senate runoff election

By 2025, “diversified” portfolios will outperform stocks. The 11 elements are:

RFK Jr. talks about heroin addiction and SSRIs in speech to MAHA fans

More like this
Related

Authorities search for Special Forces veteran suspected of shooting wife to death

Do you miss your spirit in the sky?

Paxton leads Cornyn in Texas Senate runoff election

By 2025, “diversified” portfolios will outperform stocks. The 11 elements are:

About us

Editor's Picks

Wealthy families regret rushing to make gifts after the tax law...

Sam’s Club increases membership fees starting May 1st

‘SNL’ cold open features Aziz Ansari impersonating FBI agent Kash Patel

The latest

Authorities search for Special Forces veteran suspected of shooting wife to...

Do you miss your spirit in the sky?

Paxton leads Cornyn in Texas Senate runoff election

Subscribe

US - NEA

Company

DeepSeek V3.2 matches GPT-5 performance and reduces training costs by 90%

Resource efficiency as a competitive advantage

Innovations that increase efficiency

Enterprise applications and practical performance

Industry impact and awareness

Recognized limitations and development path

LEAVE A REPLY Cancel reply

Subscribe

More like thisRelated

About us

Editor's Picks

The latest

Subscribe

More like this
Related