Futures
Access hundreds of perpetual contracts
TradFi
Gold
One platform for global traditional assets
Options
Hot
Trade European-style vanilla options
Unified Account
Maximize your capital efficiency
Demo Trading
Introduction to Futures Trading
Learn the basics of futures trading
Futures Events
Join events to earn rewards
Demo Trading
Use virtual funds to practice risk-free trading
Launch
CandyDrop
Collect candies to earn airdrops
Launchpool
Quick staking, earn potential new tokens
HODLer Airdrop
Hold GT and get massive airdrops for free
Pre-IPOs
Unlock full access to global stock IPOs
Alpha Points
Trade on-chain assets and earn airdrops
Futures Points
Earn futures points and claim airdrop rewards
Promotions
AI
Gate AI
Your all-in-one conversational AI partner
Gate AI Bot
Use Gate AI directly in your social App
GateClaw
Gate Blue Lobster, ready to go
Gate for AI Agent
AI infrastructure, Gate MCP, Skills, and CLI
Gate Skills Hub
10K+ Skills
From office tasks to trading, the all-in-one skill hub makes AI even more useful.
GateRouter
Smartly choose from 30+ AI models, with 0% extra fees
Goldman Sachs: What does DeepSeek V4 mean for China's AI?
Author: Bao Yilong, Wall Street Insights
Goldman Sachs believes that the core significance of DeepSeek V4 lies in supporting more complex intelligent agent applications at lower costs, thereby opening new space for AI application scaling.
On April 24, Goldman Sachs’ Ronald Keung team published a research report stating that the all-new open-source V4 model continues the efficiency-first, open-source approach of DeepSeek.
At the technical level, V4 achieves significant cost reductions in long context windows through architectural upgrades and explicitly bets on Huawei’s domestically produced chips. At the market level, this release accelerates the intense competition among Chinese AI models, with programming ability, task completion rate, and multimodality becoming the core dividing lines for pricing power.
Goldman Sachs maintains its recommended ratings for cloud computing and data center sectors, as continuous improvements in computing power cost efficiency will drive faster AI application penetration. The dual growth of enterprise AI agents and consumer AI assistants will support ongoing enhancement of cloud service pricing capabilities.
V4 Architecture Upgrade, Supporting Longer Contexts with Less Memory
DeepSeek V4 is released in two versions: Pro and Flash.
The Pro version is flagship-scale, with 1.6 trillion parameters (16k active parameters); the Flash version is relatively lightweight, with 284 billion parameters (130 billion active parameters). Both models support an ultra-long context window of 1 million tokens, comparable to top US models (SOTA), but require significantly less memory and KV cache.
According to Goldman Sachs’ report, V4 Pro requires only 27% of the FLOPs for inference on 1 million tokens compared to DeepSeek V3.2, with KV cache usage at just 10%; V4 Flash is even more aggressive, reducing FLOPs to 10% and KV cache to 7%.
This efficiency leap is achieved through three key architectural innovations:
In terms of hybrid attention mechanisms, V4 introduces a hybrid architecture combining compressed sparse attention (CSA) and heavily compressed attention (HCA). CSA compresses the KV cache along the sequence dimension before sparse attention computation, while HCA adopts more aggressive compression but retains dense attention. Together, they greatly reduce the temporary memory needed for long inputs.
For training stability, V4 introduces the mHC mechanism, enhancing the stability of information transfer across multiple network layers;
Simultaneously, Muon is used as the primary training optimizer (with some modules retained as AdamW) to accommodate more complex network architectures than V3, improving convergence quality during training.
Goldman Sachs points out that these efficiency gains are especially significant for long-term task scenarios, such as long-cycle agent tasks requiring processing large amounts of context.
It is worth noting that DeepSeek currently remains focused on foundational text models, while internet giants like Alibaba, ByteDance, MiniMax, and independent model players tend to pursue multimodal or all-modal routes, showing a clear divergence in the path toward AGI.
Domestic Chips Accelerate Deployment, Huawei Ascend 950 Paves the Way for Price Reductions
Another important signal from the V4 release is that DeepSeek explicitly includes the mass production of Huawei’s Ascend 950 supernodes in its commercial roadmap.
DeepSeek expects that as Huawei’s Ascend 950 supernodes achieve large-scale supply in the second half of 2026, the API pricing for V4 Pro will see a significant decrease.
Goldman Sachs’ report indicates that this statement has dual implications:
Based on current pricing, Goldman Sachs data shows that V4 Pro’s pricing on mainstream API platforms is already competitive, and as domestic computing power supply expands, this advantage is expected to further grow in the second half of 2026.
Chinese AI Model Competition Enters a Differentiated Stage
The open-source release of DeepSeek V4 has quickly triggered a new round of intensive follow-up from Chinese AI model players.
According to Goldman Sachs’ overview, recent players launching new models include: Kimi K2.6, Alibaba Qwen3.6-Max, Tencent Hy3 preview, Xiaomi V2.5, and the MiniMax M3/Hailuo expected to launch in May.
In Goldman Sachs’ view, the key differentiating factors that will determine future pricing power of these models will focus on two dimensions:
The report points out that the advantages and disadvantages of these two types of players are clear:
It is also noteworthy that Goldman Sachs’ report cites news reports indicating that Tencent and Alibaba are in talks to invest in DeepSeek at valuations exceeding $20 billion, while Zhipu and MiniMax have latest market caps of approximately $53 billion and $31 billion, respectively. This potential deal reflects the giants’ strategic competition for scarce top-tier AI capabilities.
Unchanged Track Logic: Cloud Computing and Data Centers
Goldman Sachs maintains its view that cloud computing and data centers remain the top sub-sectors for China’s internet sector, based on:
In the B2B enterprise cloud market, Alibaba leads with the largest external AI cloud revenue; in the B2C consumer market, ByteDance currently has the highest daily token usage for AI chatbots. China’s AIGC applications’ DAU continues to grow strongly, with a 36% month-over-month increase in March 2026.
Key recommended targets, Goldman Sachs emphasizes, include GDS (Global Data Service), Century Internet, Alibaba, and Kingsoft Cloud, as core allocations to capture the dividends of China’s AI infrastructure expansion.
Additionally, the second tier includes e-commerce and mobility sectors, the third tier covers AI model-related stocks, and the fourth tier involves gaming and entertainment sectors.