Hopp til hovedinnhold
 AI-nyheter, ferdig filtrert for ledere
SISTE:

Anthropic: AI fant over 10.000 alvorlige sårbarheter • Reuters: AI-feil i retten gir advokater karriererisiko • CNBC: GitHub svikter under presset fra AI-koding

DeepSeek makes 75% price cut permanent on flagship V4-Pro
CIOCFOBoardDeepSeekAI-pricingV4-ProHuaweiAscendAI InfrastructureAI GovernanceVendor RiskEnterprise AIModel Cost

DeepSeek makes 75% price cut permanent on flagship V4-Pro

JH
Joachim Hogby
23. mai 202623. mai 20264 min lesingKilde: Reuters

DeepSeek confirmed on May 23 that its 75% discount on the flagship V4-Pro model will become permanent. The promotional pricing set to expire on May 31 will not revert — the lower price is now the base price indefinitely.

This means V4-Pro, a model designed to compete directly with OpenAI, Anthropic, and Google on quality, now costs roughly 20 to 35 times less than Western frontier models for comparable workloads.

New permanent pricing:

  • Input: $0.435 per million tokens (down from $1.74)
  • Output: $0.87 per million tokens (down from $3.48)
  • Cache-hit: in some cases one-tenth of the original price
  • In yuan: 0.025–6 yuan per million tokens, down from 0.1–24 yuan

For an enterprise processing billions of tokens monthly, the annual savings could run into millions of dollars.

Not a budget model

V4-Pro is not a lightweight system. Its Mixture-of-Experts architecture uses an estimated 1.6 trillion total parameters with around 49 billion activated during inference. The model supports a one-million-token context window and can output up to 384,000 tokens in a single request.

By comparison, GPT-5.5 is estimated at $8–15 per million input tokens and $30–50 per million output, with Anthropic's Claude Opus series even more expensive for heavy reasoning and long-context tasks.

Huawei chips underneath

A key factor behind DeepSeek's ability to make the cut permanent is infrastructure. The V4 family is the company's first major AI model line optimized to run on Huawei's Ascend AI accelerators rather than primarily NVIDIA hardware.

Chinese tech giants — Tencent, Alibaba, and ByteDance — are reportedly racing to secure Ascend 950 and 950PR chips following the V4 launch, according to Reuters. Production remains constrained because US export controls continue limiting China's access to advanced chipmaking. Huawei aims to ship approximately 750,000 Ascend 950PR units during 2026.

What this means for leaders

The permanent pricing structure changes the equation for any organization paying full price for Western AI models. CIOs and CFOs should consider several things:

Cost opportunity: If DeepSeek delivers comparable quality on relevant tasks, the savings are real. Test V4-Pro against your own workloads before drawing conclusions.

Vendor risk: As Chinese models permanently pressure Western pricing, the case for locking all AI traffic to a single supplier weakens. Multi-model sourcing provides negotiating leverage and flexibility.

Data governance and regulation: EU and Norway have stricter data-handling requirements than China. DeepSeek access in Europe is not as simple as an API call from the US. Assess whether data sovereignty, privacy, and audit requirements prevent adoption.

Infrastructure fragmentation: China's AI stack now running on Huawei chips rather than NVIDIA means the global AI ecosystem is fragmenting. For organizations building their own AI infrastructure — or dependent on NVIDIA supply — this changes vendor power dynamics and pricing expectations.

This is no longer a promotion. It is a permanent pricing structure from a model claiming to be in the top tier — powered by a semiconductor stack outside the Western ecosystem.

Sources and media

📬 Likte du denne?

AI-nyheter for ledere. Kuratert av en CIO som bygger det selv. Daglig i innboksen.