Futures
Access hundreds of perpetual contracts
TradFi
Gold
One platform for global traditional assets
Options
Hot
Trade European-style vanilla options
Unified Account
Maximize your capital efficiency
Demo Trading
Introduction to Futures Trading
Learn the basics of futures trading
Futures Events
Join events to earn rewards
Demo Trading
Use virtual funds to practice risk-free trading
Launch
CandyDrop
Collect candies to earn airdrops
Launchpool
Quick staking, earn potential new tokens
HODLer Airdrop
Hold GT and get massive airdrops for free
Pre-IPOs
Unlock full access to global stock IPOs
Alpha Points
Trade on-chain assets and earn airdrops
Futures Points
Earn futures points and claim airdrop rewards
Promotions
AI
Gate AI
Your all-in-one conversational AI partner
Gate AI Bot
Use Gate AI directly in your social App
GateClaw
Gate Blue Lobster, ready to go
Gate for AI Agent
AI infrastructure, Gate MCP, Skills, and CLI
Gate Skills Hub
10K+ Skills
From office tasks to trading, the all-in-one skill hub makes AI even more useful.
GateRouter
Smartly choose from 30+ AI models, with 0% extra fees
Xiaomi Reveals Training Details of 1T Model MiMo-V2-Pro: Thousands of GPUs Used, No Job Levels or Deadlines
According to monitoring by Dongcha Beating, Luo Fuli, head of Xiaomi’s large model team, disclosed in her first in-depth interview that the MiMo-V2-Pro model base has a total parameter count of 1T, utilizing thousands of GPUs for training. She believes that a scale of 1T is the baseline for achieving performance close to Claude Opus 4.6 and securing entry into the next phase of agent competition. On a technical level, the Pro version pushes the ratio of global attention to sliding window attention to an extreme sparse ratio of 7:1, controlling the reasoning cost for long texts while expanding the parameter count, and continues to use the MTP (Multi-Token Prediction) architecture to leverage excess computing power for accelerated inference. On the management side, only about 30 to 40 out of the hundred-member MiMo team are directly involved in core iterations, with no established job levels, clear group divisions, or delivery deadlines. When encountering unstable numerical issues such as sudden changes in training loss, the team opts to halt training for troubleshooting, even if it means stopping for one or two weeks and incurring millions in computing costs.