Futures
Access hundreds of perpetual contracts
CFD
Gold
One platform for global traditional assets
Options
Hot
Trade European-style vanilla options
Unified Account
Maximize your capital efficiency
Demo Trading
Introduction to Futures Trading
Learn the basics of futures trading
Futures Events
Join events to earn rewards
Demo Trading
Use virtual funds to practice risk-free trading
CFD
U.S. stock CFD derivatives
US Stocks
Access real US stocks and ETFs
HK Stocks
Trade quality Hong Kong-listed stocks
Korean Stocks
SK Hynix
Real Korean stocks and top assets
Stock Futures
High leverage, 24/7 trading
Tokenized Stocks
Backed by real stock assets
IPO Access
Unlock full access to global stock IPOs
GUSD
Mint GUSD for Treasury RWA yields
Stocks Activities
Trade Popular Stocks and Unlock Generous Airdrops
Launch
CandyDrop
Collect candies to earn airdrops
Launchpool
Quick staking, earn potential new tokens
HODLer Airdrop
Hold GT and get massive airdrops for free
IPO Access
Unlock full access to global stock IPOs
Alpha Points
Trade on-chain assets and earn airdrops
Futures Points
Earn futures points and claim airdrop rewards
Promotions
AI
Gate AI
Your all-in-one conversational AI partner
Gate AI Bot
Use Gate AI directly in your social App
GateClaw
Gate Blue Lobster, ready to go
Gate for AI Agent
AI infrastructure, Gate MCP, Skills, and CLI
Gate Skills Hub
10K+ Skills
From office tasks to trading, the all-in-one skill hub makes AI even more useful.
DeepSeek V4 Released: 1.6T Parameter Flagship Supports 1M Context, Inference Computing Power Only 27% of V3.2
ME News message. On April 24 (UTC+8), according to Dongcha Beating monitoring, DeepSeek has open-sourced a V4 series preview version under the MIT license, and the weights have been uploaded to Hugging Face and ModelScope.
The series includes two MoE models: V4-Pro, with total parameters of 1.6T and 49B (49 billion) activated per token; and V4-Flash, with total parameters of 284B (284 billion), activating 13B (130 billion).
Both support 1M token context windows.
The architecture features three upgrades: a hybrid attention mechanism (Compressed Sparse Attention CSA + Heavy Compression Attention HCA) that significantly reduces long-context overhead. Under a 1M-context window, V4-Pro’s single-token inference FLOPs are only 27% of V3.2, and the KV cache (GPU memory usage for storing historical information during inference) is only 10% of V3.2. Manifold-constrained hyperconnection mHC replaces traditional residual connections, improving the stability of signal propagation across layers. Training has switched to the Muon optimizer to accelerate convergence.
Pretraining data exceeds 32T tokens.
Post-training is carried out in two stages: first, use SFT and GRPO reinforcement learning to train experts in each domain separately; then use online distillation to unify and merge them into a single model.
V4-Pro-Max (the maximum inference effort mode) claims to be the strongest current open-source model, achieving top-tier performance on coding benchmarks, with a significantly narrowed gap versus closed-source leaders in reasoning and agent tasks.
V4-Flash-Max, after being given a sufficient thinking budget, delivers reasoning performance close to Pro, but is limited by its parameter scale in pure knowledge and complex agent tasks.
The weights are stored using FP4+FP8 mixed precision.
(Source: BlockBeats)